0% found this document useful (0 votes)
3 views

Data Engineering Brochure New

The Bosscoder Data Engineering Program offers a comprehensive curriculum designed to equip learners with essential skills in data engineering through hands-on projects and real-world case studies. The program includes modules on core data foundations, big data, ETL, advanced data operations, and system design, with a focus on personalized learning and mentorship from industry experts. Additionally, the program provides support for job placements, connecting learners with top tech companies and offering resources for interview preparation.

Uploaded by

saianil2364
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Data Engineering Brochure New

The Bosscoder Data Engineering Program offers a comprehensive curriculum designed to equip learners with essential skills in data engineering through hands-on projects and real-world case studies. The program includes modules on core data foundations, big data, ETL, advanced data operations, and system design, with a focus on personalized learning and mentorship from industry experts. Additionally, the program provides support for job placements, connecting learners with top tech companies and offering resources for interview preparation.

Uploaded by

saianil2364
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 33

Data Engineering

CURRICULUM

Brochure
Search / Ctrl+ Student’s name

B atc
h : -

Home
Data Engineering
You have access to 8 modules in software engineering course.
Onboarding
Core Data ETL & Data Big Data & Advanced Beginners
Course Foundations Warehousing Cloud Data Ops Lectures 10,000

30% Module Done


30% Module Done
30% Module Done
30% Module Done
30% Module Done

Curriculum 5 Classes Completed 5 Classes Completed 5 Classes Completed 5 Classes Completed 5 Classes Completed

My Mentor
View Module View Module View Module View Module View Module 2100 +

Progress Meet Classes &

assignments
Schedule 60%
uestions
Contests
Q &

answers

Placements 25
Mentors i h p

session
Leader Board
Rewards
Pay Method
Refer & Earn
Doubt Support
Logout

www.bosscoderacademy.com
About Bosscoder
Bosscoder is an outcome-focused upskilling platform for tech professionals.
Our industry-approved approach towards teaching helps our learners upskill
& transition into their dream tech roles. 

Learners enrolled with us are taught, guided, and mentored by engineers


working at top product companies like Google, Microsoft, Amazon, Arcesium,
etc. 

Our Learners have witnessed an industry-leading hike of 120% with 6.25X


ROI (Return on Investment).

About Bosscoder Data


Engineering Program
Bosscoder Data Engineering is an industry-recognised program that focuses
on:

Building the RIGHT skills with Real-life Business Case Studie


Hands-on Projects with Datasets from Real Companie
Revised in 2024 with up-to-date tools & cloud-incorporating projects  

Within our program, you will learn from live classes hosted by instructors
working in Data Engineering roles with top product companies.

1
Table of Contents
Content Page No.

Landscape of Data 3

Why upskill with us? 5

Bosscoder Curriculum 6

Curriculum Deep-dive 7

Projects 16

Meet the Instructors 20

Alumni’s Thoughts about Bosscoder 23

2
Landscape of Data
Today, we're living in a world that's all about data. From the food we eat, the
music we listen to, to the health care services we receive - everything is
being shaped by data. 

And this is where the magic of Data Engineering and machine learning comes
into play.

In healthcare, Companies are using algorithms to predict diseases,


improving patient outcomes.

Financial institutions are harnessing machine learning to detect fraudulent


transactions in real-time.

Even in agriculture, farmers are leveraging Data to optimize yields and


manage resources efficiently.

The list is endless!

However, the best part is data revolution isn't limited to just one or two
industries. It's everywhere. This means, as Data Engineer, the opportunities
are immense.

Data is the new oil. And the ability to extract insights from this data is a
superpower.

According to LinkedIn’s emerged as one of the fastest growing job


requirement globally.

Some benefits of upskilling & transitioning to the Data Engineering domain


include

High Compensation
High Impactful work
Future Growth in the Tech Sector

3
However, learners face many challenges while upskilling → 

1. Lack of exposure to solving real-world business case studies & build


projects while working on real data sets.

2. Feel overwhelmed by the vastness of the Data Engineering domain & not
sure where to start, or where to end.

3. Lack of mentorship & community support to help them provide upskilling &
opportunities to get them successfully transition to Data Engineering roles.

We understand our learner's challenges & to overcome them, we designed


Bosscoder Data Engineering Program as an industry-focused curriculum that
features Business Case-Studies & Datasets from real companies.

Furthermore, post-upskilling, our devoted recruitment team bridges learners


with premier product companies and startups. They facilitate interview
opportunities and aid in a successful transition to a Data Engineering role,
ensuring your skills find their perfect match in the professional realm.

4
Why upskill with us?
Enhance Your Career With These Enriching Aspects of
Bosscoder's Upskilling Program

Structured, Industry-vetted Curriculum

Live Classes with Industry Experts

Hyper-Personalized Learning

Regular 1:1 Mentorship

1.5K+ Bosscoder Learners & Alumni Network

Industry Relevant Projects & Real World Case


Studies

A Dedicated Recruitment Cell

250+ Partner Employers

5
Bosscoder Curriculum
Topics Total Duration

Module 1: Core Data Foundations 6 weeks


Python (Beginner python + Advance Python) 3 weeks
SQL 3 weeks

Module 2: Big Data & Cloud[1] 7 weeks

Big Data Frameworks 2 weeks

Data Streaming 1 weeks

Advanced Cloud Services 4 weeks

Module 3: ETL & Data Warehousing 8 weeks


ETL Pipelines 4 weeks
Data Warehousing 4 weeks

Module 4: Advanced Data Ops 4 weeks

Advanced-Data Engineering 2 weeks

DevOps for Data Engineering 1 weeks

Data Security 1 weeks

6
Bosscoder Curriculum
Topics Total Duration

Module 5: DSA & System Design 8 weeks

DSA 6 weeks

System Design 2 weeks

Module 6: Get Placed as Data Engineer at Top


Until you get Placed
Companies

Building a strong profile -

Applying the right way -

Acing the interview -

7
Module - 1

Core Data Foundations


Duration: 6 Weeks

Build a strong foundation in Python and SQL to tackle data


manipulation, complex algorithms, and database design. From
relational databases to NoSQL, this module equips you with
essential data engineering skills through hands-on coding and
problem-solving.

Topics Covered:

1). Python

Flowcharts, Data Types, Operations


Conditional Statements & Loops
Strings
In-build Data Structures - List, Tuples, Dictionary,
Set, Matrix Algebra, Number System
Basics of Time & Space Complexity
OOPS
Functional Programming
Exception Handling & Module
Python Libraries: Numpy, Pandas, Matplotlib,
Seaborn, Plotly etc.

8
2). SQL
Introduction to Databases & BigQuery Setup
Extracting data using SQL
Functions, Filtering & Subqueries
Joins
GROUP BY & Aggregation
Window Functions
Date and Time Functions & CTEs
Indexes & Partitioning
Normalisation & Transactions
Introduction to NoSQL: MongoDB

USPs of our Delivery


All topics taught in live classes with limited batch size helping in
instant doubt support to accelerate learning.
Assignment (post-lecture) & their evaluation.
Hyper-Personalised: Special focus on the individual with a
constant touch from student success manager & mentor.

9
Module - 2

Big Data & Cloud Services


Duration: 7 Weeks

Explore Big Data technologies like Hadoop and Spark to process


large datasets. Learn distributed computing, real-time data
processing with Apache Kafka, and advanced cloud services like
AWS Glue, preparing you for complex data pipelines.

Topics Covered:

1). Big Data Frameworks


Hadoop
HDFS (Hadoop Distributed File System)
YARN (Yet Another Resource Negotiator)
MapReduce
Apache Spark
Spark core concepts: RDDs, DataFrames, and
SparkSQL
Parallel processing and distributed computing with
Spark
Spark for data transformation, aggregation, and
analytic
Powerful data processing with PySpark for scalable
analytics

12
Distributed Databases
CAP Theorem, consistency, availability, partition
tolerance
Cassandra, HBase: Columnar data stores for large-
scale datasets
Real-World Big Data Pipeline
Design and implement a basic pipeline using
Hadoop or Spark
Data storage, transformations, and querying

2). Data Streaming

Introduction to streaming data


Apache Kafka: Basics
Stream processing with Spark Streaming

3). Advanced Cloud Services

AWS
AWS EMR
OnPrem vs Cloud
HDFS vs S3
What is S3
EC2
Elastic IP
AWS storage, networking
S3 and EBS
AWS Glue
AWS Redshift
13
AZURE
Azure Data Factory
Azure Databricks
Azure Synapse Analytics
Azure Blob Storage

Google Cloud Platform


Bigquery
Pub/sub

Cloud data solutions

Linux
Introduction to Linux
File system navigation
Process Management
Shell Scripting
System configuration and advanced Linux
commands

USPs of our Delivery


Hands-on Learning Experienc
Solve multiple real-life case study problems in live classes &
understand the tradeoffs of each algorithm

14
Module - 3

ETL & Data Warehousing


Duration: 8 Weeks

Learn to design and manage ETL pipelines and data warehouses


using tools like Apache NiFi, AWS Glue, and cloud platforms.
Develop scalable, efficient systems with real-world case studies,
focusing on key architectures like star and snowflake schemas.

Topics Covered:

1). ETL Pipelines

ETL concepts: Extract, Transform, Load


Data ingestion and transformation
Tools: Apache NiFi, AWS Glue

2). Data Warehousing

Star schem
Snowflake schema
Introduction to cloud data warehouses: Redshift,
BigQuery
OLAP vs OLTP

10
USPs of our Delivery

Hyper-personalization: Depending on student-specific learning


pace, multiple revision classes are organized.

Assignments (post-lecture) & their immediate evaluation help to


compare your performance against peers.

The focus is not just to remember maths formulas but to help


learners visualize the intuition behind concepts, enabling them
to identify patterns.

As you work on different business situation & product thinking,


you gain a deeper understanding on what insights are important
& what insights are not important for a particular scenario.

11
Module - 4

Advanced Data Ops


Duration: 4 Weeks

Master advanced data engineering with fault-tolerant system


designs, CI/CD pipelines, and containerization using Jenkins,
Docker, and Kubernetes. Learn to secure data with encryption
and role-based access control.

1). Advanced-Data Engineering

High-availability and fault-tolerant designs


Scalability strategies

2). DevOps for Data Engineering

CI/CD pipelines, Jenkins, Gitlab


Infrastructure as Code: Terraform
Containerization: Docker, Kubernetes

3). Data Security

Data encryption
Authentication and RBAC

15
USPs of our Delivery
Impactful projects like Real-Time Financial Data Processing
for Goldman Sachs, Fraud Detection Data Pipeline for PayPal
and multiple others.
1:1 discussion with your mentor regarding project
improvements.

16
Module - 5

DSA & System Design


Duration: 8 Weeks

Strengthen your skills in Data Structures, Algorithms, and


System Design. Learn to design scalable, fault-tolerant systems
and solve real-world data challenges using event-driven
architectures and efficient processing pipelines.

Topics Covered:

1). DSA

Arrays, hashmaps
Stacks, queues
Trees (binary trees, heaps)
Graphs, sorting (QuickSort, MergeSort)
Time and space complexity

2). System Design

Scalable and fault-tolerant systems


Data warehousing design
Event-driven architecture

17
Why You'll Love This Module
Master DSA and system design for data platforms.
Tackle real-world scalability and system challenges.
Build high-performing, reliable data systems.

18
Module - 6

Get Placed as Data Engineer


at Top Companies
Duration: Until you get Placed

Once you have upskilled yourself to become a great data


engineer, it is important that we now focus on getting you
interview opportunities from diverse companies.

This process is usually in 3 phases:


Build a strong profil
Applying the right way
Acing the interview

We focus on all the above 3 objects in this Module.

Topics that will be covered:

1). Building a strong profile

Resume Creatio
LinkedIn profile optimizatio
Profile creation on other platforms

2). Applying the right way

Opportunities through Bosscoder Collaboration with


250+ companie
Referral to almost all the top product companie
Sharing hiring requirement of different companies
19
3). Acing the interview

On Demand Mock interview


Offline & Online Interview Guideline
Salary Negotiation

Outcome
You getting placed at one of the top tech companies like
Google, Microsoft, Amazon, Apple & sharing us a personal
review of your journey with us.

USPs of our Delivery


Student success manager stay connected with you
throughout your placement journey to ensure you achieve
best outcome
Collaboration with 250+ companies for hiring
Collaboration with consultancies who hire for top data driven
companies
Referrals from our alumni & mentor community for almost all
the companies
Resume reviews, profile building increasing your chances of
getting shortlisted
On demand mock interviews with mentor before a specific
interview
100% support from our team to help you succeeds.

20
Projects

#1
Sales Data ETL Pipeline for Nike
Build a basic ETL pipeline to process Nike’s sales data from CSV
files and APIs. Use Apache Airflow to automate the extraction,
transformation, and loading of sales data into a PostgreSQL
database. Perform basic data aggregations to analyze sales trends
by region and product category, offering insights to improve sales
strategy.

35 hours

#2

Customer Journey Analysis for Netflix


Develop a data pipeline that tracks Netflix user behavior, including
browsing and viewing history. Stream the data using Apache Kafka
and use Apache Flink for real-time analysis of user engagement.
Store the processed data in Google BigQuery, enabling analysis of
how users interact with different content types.

40 hours

21
#3
Inventory Management Optimization for Home
Depot
Design a pipeline to optimize Home Depot’s inventory by ingesting
stock data from databases and supplier APIs. Use Apache NiFi for
data ingestion and Apache Spark for trend analysis. The final results
will be displayed in a real-time dashboard using Google Data Studio,
helping Home Depot adjust inventory levels based on sales trends.

30 hours

#4
Traffic Data Analytics for Uber
Build a simplified pipeline to process real-time Uber traffic data.
Capture traffic data streams using Apache Kafka and use Apache
Spark to compute real-time congestion levels and route
optimization. Store the results in Azure Synapse Analytics for
further analysis and generate route efficiency reports for Uber
drivers.

40 hours

22
#5
Building a Real-Time Ad Analytics Platform for
Facebook
Create a real-time data pipeline to analyze Facebook ad
performance. Use Apache Kafka to collect streaming ad data,
process it using Apache Spark Streaming for real-time metrics, and
store the processed data in Google BigQuery for deeper analysis.
Build an interactive dashboard using Tableau for monitoring ad
performance across various demographics and geographies.

40 hours

#6
Fraud Detection Data Pipeline for PayPal
Develop a fraud detection pipeline that processes real-time
transaction data. Use Apache Kafka for streaming data and process
it using Apache Flink to identify fraudulent patterns. Store the
results in Google BigQuery, enabling PayPal to respond quickly to
potential fraud cases.

40 hours

23
#7
Big Data Pipeline for E-commerce
Personalization for Amazon
Build a big data pipeline that processes customer data at scale to
deliver personalized product recommendations on Amazon. Use
Apache Hadoop for distributed data storage and Spark for data
processing. Implement recommendation algorithms using machine
learning libraries in Spark and integrate the output into Amazon S3
for fast retrieval.

45 hours

#8
Real-Time Financial Data Processing for
Goldman Sachs
Develop a high-frequency trading data pipeline for processing and
analyzing real-time stock market data. Use Apache Kafka for
capturing real-time data streams from financial markets, apply
Apache Flink for complex event processing (CEP), and store the
processed data in AWS Redshift for real-time analysis. Provide
insights to assist in trading strategies.

45 hours

24
Meet Your Instructors &
Mentors
Learn from Industry veterans
Our instructors and mentors are highly rated by working professionals
upskilling with us, they are working at top product companies & have
experience working on industry scale Data Engineering projects. They are
also familiar with the best ways to crack these companies.

Rajat Garg
Co-founder - Bosscoder, Ex - Microsoft
Introducing Rajat Garg, an esteemed graduate of NIT Delhi from
the class of 2019. Rajat embarked on his professional journey at
Microsoft, playing a pivotal role in expanding PowerPoint's web
services, catering to an impressive 200 million monthly active
users. With a solid foundation of six years in teaching, Rajat is
dedicated to education and mentorship.

Manish Garg
Co-founder of Bosscoder, Ex - Samsung, Quoori
Presenting Manish Garg, an IIT Dhanbad alumnus, has had an
impressive tech career. From Samsung's Machine Learning
engineer to collaborating with Android's founder at Essential and
leading Quoori's speech department, his journey is remarkable.
With nine years of teaching experience, Manish is dedicated to
education and mentorship.

25
Meet Your Instructors &
Mentors
Parijat Roy
Senior Data Scientist, Microsoft
Meet Parijat Roy, a seasoned data scientist from Microsoft and
Jadavpur University alum with 8 years of industry experience.
Transitioning from software engineering to data science, Parijat
specializes in Natural Language Processing (NLP) for analyzing
feedback and improving Net Promoter Scores (NPS) for Office
products. His diverse skill set makes him an ideal mentor for
aspiring data scientists.

Sankalp Tomar
Senior Data Scientist at Microsoft
Introducing Sankalp Tomar, with 10 years in the tech industry, now
a Senior Data Scientist at Microsoft. He transitioned from a System
Engineer at Infosys to a key role in Microsoft's Graphics team,
focusing on creating images from text in Office and enhancing
features in Microsoft Edge. Sankalp's diverse expertise and
journey through different tech roles make him an insightful mentor
for aspiring data scientists.

26
Alumni’s Thoughts about

Bosscoder

Pulkit Gupta

“Bosscoder Academy's personalized onboarding, live classes,


mentor sessions, and off-class support helped me to get an
amazing hike. Personal attention assured me that the decision
was worthwhile.”

Ankur Raj

“Engaging live classes, mentor support and mock interviews led to


personal and professional growth. Achieved my dream offer at
Amazon, no words to describe the feeling.”

Akshit Aggarwal

“After joining Bosscoder Academy, I realised that Proper Guidance,


Consistency and practice with projects can help you achieve
anything. As a result, I got Placed, and looking forward to more
achievements.”

Udit Sharma

“I did not have a structured path to follow to kickstart my Data


career, that's when i decided that opting for a structured course,
like Bosscoder’s, would be a good option. As a result, I got placed
at Dassault Systems.”

27
Alumni’s Thoughts about

Bosscoder
Chaitanya

“At first, I was unsure about the exact qualifications needed to


succeed in Data Engineer domain with product-based companies.
Instructors and Mentors at Bosscoder provided clarity on the
same. Live classes, curriculum, and mentorship made my
preparation successful.”

Garima Gogia

“Earlier, I struggled finding the right topics to study, and I did not
have any path to follow. What stood out for me at Bosscoder was
the detailed curriculum, covering all topics. It went from basics to
advanced topics.”

Himanshu Bhaware

“I decided to join Bosscoder primarily to get good with


fundamentals and advanced concepts of Machine Learning.
Bosscoder was really helpful to me in my quest to learn
everything I needed to land a job.”

Karthik P

“I was looking for switch to product based companies and had


started preparing for it. Thats when I came across Bosscoder.
Personal mentors, monthly mock interviews, and valuable
feedback greatly added to my interview preparation.”

28
Alumni’s Thoughts about

Bosscoder
Rohal Kurup

“Dreaming of a product-based company, I joined Bosscoder


Academy. Their excellent curriculum and structured topics of
Machine Learning guided me effectively rather than searching
randomly on YouTube.”

Vamsi Kesav

“I Joined Bosscoder Academy for a detailed Data Engineer


curriculum and top mentorship. Manish's guidance emphasizes
practical approaches for projects. Personal Mentorship helped me
receive an offer letter from BlogVault.”

Ananda

“Since Day 1, I was guided by my mentor and I got a structured


curriculum. The curriculum covers everything which is asked in a
DSML interview and it increases the chances to land a job in a
good company.”

Vritika Chaudhary

“I was always confused about what to study and what not to study
for DSML interviews then I decided to join Bosscoder Academy.
Mentors here are very supportive. Anyone who religiously follows
the classes and is consistent can get his concepts very clear.

29
Alumni’s Thoughts about

Bosscoder
Rakesh

“I was stuck with a service based company with limited exposure


but I didn’t know how to shift. I came to know about Bosscoder
Academy. With constant guidance and moral support from Megha
Ma'am, I was able to crack a good start-up company followed by
more product based companies.”

Arun VS

“Bosscoder Academy's comprehensive syllabus and structured


machine learning cover diverse topics. Live classes, mentor
interaction and peer interaction is the key difference between
learning from online resources vs attending live sessions.”

Shubhankar Singh

“The online classes, as one would expect, are excellent. The


highlight of the course for me was the mentor support program. It
really helped me build the confidence and the eloquence needed
to ace interviews.”

30
BOSSCODER
ACADEMY

Empowering

“out of school” children

For every Bosscoder program enrollment, we


contribute

1% towards the education of ‘out of school’ children.

Changing Lives: Our Impact Stories


2017 2017 Amit's

story
2023 2023
Sandhya's

story
Sandhya dropped out of school & had to do
Amit (here, age -34) studied in school till 5 but

household chores & begging to support her family. has to drop out to earn and support his family.
1st girl in Basti to go out to Pursue education, completed
Amit has now joined us a Community Leader as

12th while working & now pursuing BSW in IGNOU. well as a facilitator in Truck Union Basti.

31
Upskill Now & Make a
Successful Transition to Data
Engineer Roles
Reach out to us at
ask@bosscoderacademy.com 

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy