Edyoda: Data Scientist Program
Edyoda: Data Scientist Program
Edyoda: Data Scientist Program
Program Curriculum
www.edyoda.com hello@edyoda.com
Learning outcomes:
Python Programming
1. Introduction to Python
Useful Python Resources
Python Tools and Utilities
Python Features
2. Python Environment
Local Environment Setup
Downloads and Installations
Setting up Environment Path
3. Executing Python
Interactive Mode
Scripting Mode
Integrated Development Environment
www.edyoda.com hello@edyoda.com
Bitwise Operators
Logical Operators
Membership Operators
Identity Operators
Operators Precedence
8. Python Loops
While loop
For loop
Nested loop
Break control statement
Continue statement
Pass statement
9. Python Numbers
Number type conversion
Mathematical function
Random number function
Trigonometric function
www.edyoda.com hello@edyoda.com
Basic Dictionary operations
Built-in Functions and Methods
Use cases
www.edyoda.com hello@edyoda.com
19. Functional Programming
Lambda
Filter
Map
Functools
21. Collections
Deque
Counter
OrderedDict
ChainMap
www.edyoda.com hello@edyoda.com
Data Analysis
1. Data Quality
Introduction to Data Quality
Handling different Data Quality Issues
3. Understanding of Data
Intro to types of data
Derived Facts/Dimensions
Building dimensions from Facts (Binning)
Granularity of Data
5. Data Modeling
Understanding: Unique Keys, Key References, Cardinality, ER Diagram
Introduction to Data Quality
The Six Dimensions of Data Quality
6. Excel Refresher
Frequently used Excel Functions
Useful Shortcuts for Faster Excel Analysis
Tables in Excel
Data Formatting in Excel
Visualization with Excel
www.edyoda.com hello@edyoda.com
Data Transformations in PowerQuery
8. Power BI Essentials
Overview of Power BI Tools
Handling Data Types and Formats
Handling Special Data Category
Creating Hierarchical Dimensions
KPI Cards
Bar Charts / Column Charts
Filters (Simple vs Complex)
Slicers
Formatting & Aesthetics
Publishing and Sharing your Dashboard
Exploring different Chart Options
Understanding Important Terms in a Given Visual
Pivot/Matrix Tables
Creating Drilldown Reports
Introduction to DAX
Commonly used DAX Functions
Applications of DAX Concepts
Exploring different types of visuals
Publishing Modified Dashboard
9. Probability Theory
Types of Events
Idea of a Random Events
Understanding via Example Datasets
Discrete vs Continous Random Variables
Nominal, Ordinal, Ratio/Interval Data
Basic Probability Theory
Idea of MECE events
Idea of Conditional events / Independent Events
Idea of Bayes Theorem
www.edyoda.com hello@edyoda.com
Mean, Median, Mode, Variance, Co-variance, Standard Deviation etc.
Skewness & Kurtosis of a distribution
Meaning of left, right skewed data
www.edyoda.com hello@edyoda.com
Data Wrangling
2. Essential NumPy
Introduction to NumPy
Creation
Access
Stacking and Splitting
Methods
Broadcasting
Introduction to Pandas
Understanding Series & DataFrames
Loading CSV,JSON
Connecting databases
Descriptive Statistics
Accessing subsets of data - Rows, Columns, Filters
Handling Missing Data
Dropping rows & columns
Handling Duplicates
Function Application - map, apply, groupby, rolling, str
Merge, Join & Concatenate
Stacking, Unstacking & Melting
Pivot-tables
www.edyoda.com hello@edyoda.com
Normalizing JSON
Application - EDA on Employee data, sales data
4. Understanding Visualization:
Machine Learning
Introduction to Preprocessing
www.edyoda.com hello@edyoda.com
StandardScaler
MinMaxScaler
RobustScaler
Normalization
Binarization
Encoding Categorical (Ordinal & Nominal) Features
Imputation
Polynomial Features
Custom Transformer
Text Processing
CountVectorizer
TfIdf
HashingVectorizer
Image using skimage
3. Decision Trees
4. Naive Bayes
www.edyoda.com hello@edyoda.com
5. Composite Estimators using Pipelines & FeatureUnions
Cross Validation
Hyperparameter Tuning
Model Evaluation
Model Persistence
Validation Curves
Learning Curves
7. Feature Selection & Dimensionality Reduction
www.edyoda.com hello@edyoda.com
Application - Credit Risk Prediction
8. Nearest Neighbors
9. Clustering Techniques
www.edyoda.com hello@edyoda.com
11. Support Vector Machines
www.edyoda.com hello@edyoda.com
15. Time Series Modeling
1. Mathematical Aptitude
Percentages
Profit and Loss
Simple Interest and Compound Interest
Work And Time
Probability
Permutation and Combination
Profit and Loss
Time & Speed
Ratios and Proportions
Data Interpretation
www.edyoda.com hello@edyoda.com
Procrastination
Improving Recall
Creating Brain Links
Visual memory & Data Memory
Slow Thinking
3. Computational Thinking
Thinking before Doing/Coding
Problem Identification
Decomposition
Pattern Recognition
Abstraction
Algorithm Design
Computational Thinking Use Case 1
Computational Thinking Use Case 2
4. Technical Puzzles
Why are Puzzles part of interviews?
The Art of solving puzzles
Approach more important than the solution
Puzzles for Vertical Thinking
Puzzles for Horizontal Thinking
www.edyoda.com hello@edyoda.com
Choosing wisely habits to inculcate
Professional Communication
www.edyoda.com hello@edyoda.com
Understanding why GD is so important in personal & professional life
The objective of GD - Collectively making the right decision
5 GD on various topics
Computer Fundamentals
www.edyoda.com hello@edyoda.com
Remote Services
Distributed Systems
Introduction to Data Center Technologies
2. Linux Administration
Introduction to Linux Operating Systems
Basic Linux Commands
File Management and Security
The directory structure of Unix
User Management
Groups
Shell types and basic commands
Permissions
sudo
Systemd Services Start and Stop
Resource Mgmt with systemctl
Process Management (top, ps)
Package Management(yum, apt, rpm)
Managing disks (lsblk, df, mount, umount,du)
File systems
www.edyoda.com hello@edyoda.com
Array
Stack
Queue
Tree
Basic Operations
o Traversing
o Searching
o Sorting
o Hashing
o Insertion
o Deletion
o Merging
Searching techniques
o Binary search
o Linear search
Recursion
Fibonacci series
Sorting Algorithm
o Bubble sort
o Insertion sort
o Selection sort
o Quick sort
o Merge sort
o Bucket sort
4. Database concepts
Introduction to Databases
Entity Relationship Model
Relational Model
Relational Algebra
Normalization
Transactions and Concurrency Control
DBMS Architecture 2-level 3-level
Data Abstraction and Data Independence
Database Objects
Entity-Relationship Model
Generalization
www.edyoda.com hello@edyoda.com
Specialization
Aggregation
Entity Relationship Diagrams
Keys in Relational Model
Candidate key,
Super key
Primary key
Alternate key
Foreign key
Strategies for Schema design
Schema Integration
Data modelling
Star Schema in Data Warehouse modelling
Data Warehouse Modeling
www.edyoda.com hello@edyoda.com
SQL Constraints
Type of Joins
Unions Clause
NULL Values
Indexing
Views
6. Software Engineering
Software Engineering Overview
Features of Good Software:
o Operational Features
o Transitional Features
o Maintenance Features
Software Development:
o Requirement Gathering
o Software Design
o Programming
Software Design
o Design
o Maintenance
o Programming
Programming:
o Coding
o Testing
o Integration
Software Development Life Cycle
o Requirement Gathering
o System Analysis
o Software Design
o Coding
o Testing
o Integration
o Deployment
o Operation and Maintenance
Types of SDLC
o Waterfall model
o Iterative Model
o Spiral model
o V Model
www.edyoda.com hello@edyoda.com
Agile Concepts
DevOps Concepts
Microservices Architecture
Features of Microservices Architecture
Software Requirements
Software Design Basics
Analysis & Design Tools
o Data Flow Diagram
o Flow Chart
Design Strategies
o Function-Oriented Design
o Object-Oriented Design
User Interface Design
o Command Line Interface(CLI)
o Graphical User Interface (GUI)
Design Complexity
Software Testing Overview
o Manual Vs Automated Testing
o Testing Approaches
o Black-box testing
o White-box testing
o Unit Testing
o Integration Testing
o Functionality testing
o Acceptance Testing
o Regression Testing
Quality Control
Deployment Methods
o Blue-Green Deployment
o Rolling Deployment
Software Monitoring
Software Maintenance
7. Tools
Git
o What is Git?
o Installing Git
www.edyoda.com hello@edyoda.com
o First-Time Git Setup
o Git Basics
o Getting a Git Repository
o Recording Changes to the Repository
o Viewing the Commit History
o Undoing Things
o Working with Remotes
o Tagging
o Git Branching
o Basic Branching and Merging
o Branch Management
o Branching Workflows
o Remote Branches
o Rebasing
Putty
o Installation
o Types of connections
o Connecting to a remote server
o Using Auth keys
o Customizing putty
Vim
o Vim Basics
o Insert Mode
o Visual Mode
o Command Mode
o Create and Edit a file
o Search and replace in Vim
o Vim diff
o Copy operations
o .vimrc file
o Vim Commands
www.edyoda.com hello@edyoda.com