Data Engineering Brochure New
Data Engineering Brochure New
CURRICULUM
Brochure
Search / Ctrl+ Student’s name
B atc
h : -
Home
Data Engineering
You have access to 8 modules in software engineering course.
Onboarding
Core Data ETL & Data Big Data & Advanced Beginners
Course Foundations Warehousing Cloud Data Ops Lectures 10,000
Curriculum 5 Classes Completed 5 Classes Completed 5 Classes Completed 5 Classes Completed 5 Classes Completed
My Mentor
View Module View Module View Module View Module View Module 2100 +
assignments
Schedule 60%
uestions
Contests
Q &
answers
Placements 25
Mentors i h p
session
Leader Board
Rewards
Pay Method
Refer & Earn
Doubt Support
Logout
www.bosscoderacademy.com
About Bosscoder
Bosscoder is an outcome-focused upskilling platform for tech professionals.
Our industry-approved approach towards teaching helps our learners upskill
& transition into their dream tech roles.
Within our program, you will learn from live classes hosted by instructors
working in Data Engineering roles with top product companies.
1
Table of Contents
Content Page No.
Landscape of Data 3
Bosscoder Curriculum 6
Curriculum Deep-dive 7
Projects 16
2
Landscape of Data
Today, we're living in a world that's all about data. From the food we eat, the
music we listen to, to the health care services we receive - everything is
being shaped by data.
And this is where the magic of Data Engineering and machine learning comes
into play.
However, the best part is data revolution isn't limited to just one or two
industries. It's everywhere. This means, as Data Engineer, the opportunities
are immense.
Data is the new oil. And the ability to extract insights from this data is a
superpower.
High Compensation
High Impactful work
Future Growth in the Tech Sector
3
However, learners face many challenges while upskilling →
2. Feel overwhelmed by the vastness of the Data Engineering domain & not
sure where to start, or where to end.
3. Lack of mentorship & community support to help them provide upskilling &
opportunities to get them successfully transition to Data Engineering roles.
4
Why upskill with us?
Enhance Your Career With These Enriching Aspects of
Bosscoder's Upskilling Program
Hyper-Personalized Learning
5
Bosscoder Curriculum
Topics Total Duration
6
Bosscoder Curriculum
Topics Total Duration
DSA 6 weeks
7
Module - 1
Topics Covered:
1). Python
8
2). SQL
Introduction to Databases & BigQuery Setup
Extracting data using SQL
Functions, Filtering & Subqueries
Joins
GROUP BY & Aggregation
Window Functions
Date and Time Functions & CTEs
Indexes & Partitioning
Normalisation & Transactions
Introduction to NoSQL: MongoDB
9
Module - 2
Topics Covered:
12
Distributed Databases
CAP Theorem, consistency, availability, partition
tolerance
Cassandra, HBase: Columnar data stores for large-
scale datasets
Real-World Big Data Pipeline
Design and implement a basic pipeline using
Hadoop or Spark
Data storage, transformations, and querying
AWS
AWS EMR
OnPrem vs Cloud
HDFS vs S3
What is S3
EC2
Elastic IP
AWS storage, networking
S3 and EBS
AWS Glue
AWS Redshift
13
AZURE
Azure Data Factory
Azure Databricks
Azure Synapse Analytics
Azure Blob Storage
Linux
Introduction to Linux
File system navigation
Process Management
Shell Scripting
System configuration and advanced Linux
commands
14
Module - 3
Topics Covered:
Star schem
Snowflake schema
Introduction to cloud data warehouses: Redshift,
BigQuery
OLAP vs OLTP
10
USPs of our Delivery
11
Module - 4
Data encryption
Authentication and RBAC
15
USPs of our Delivery
Impactful projects like Real-Time Financial Data Processing
for Goldman Sachs, Fraud Detection Data Pipeline for PayPal
and multiple others.
1:1 discussion with your mentor regarding project
improvements.
16
Module - 5
Topics Covered:
1). DSA
Arrays, hashmaps
Stacks, queues
Trees (binary trees, heaps)
Graphs, sorting (QuickSort, MergeSort)
Time and space complexity
17
Why You'll Love This Module
Master DSA and system design for data platforms.
Tackle real-world scalability and system challenges.
Build high-performing, reliable data systems.
18
Module - 6
Resume Creatio
LinkedIn profile optimizatio
Profile creation on other platforms
Outcome
You getting placed at one of the top tech companies like
Google, Microsoft, Amazon, Apple & sharing us a personal
review of your journey with us.
20
Projects
#1
Sales Data ETL Pipeline for Nike
Build a basic ETL pipeline to process Nike’s sales data from CSV
files and APIs. Use Apache Airflow to automate the extraction,
transformation, and loading of sales data into a PostgreSQL
database. Perform basic data aggregations to analyze sales trends
by region and product category, offering insights to improve sales
strategy.
35 hours
#2
40 hours
21
#3
Inventory Management Optimization for Home
Depot
Design a pipeline to optimize Home Depot’s inventory by ingesting
stock data from databases and supplier APIs. Use Apache NiFi for
data ingestion and Apache Spark for trend analysis. The final results
will be displayed in a real-time dashboard using Google Data Studio,
helping Home Depot adjust inventory levels based on sales trends.
30 hours
#4
Traffic Data Analytics for Uber
Build a simplified pipeline to process real-time Uber traffic data.
Capture traffic data streams using Apache Kafka and use Apache
Spark to compute real-time congestion levels and route
optimization. Store the results in Azure Synapse Analytics for
further analysis and generate route efficiency reports for Uber
drivers.
40 hours
22
#5
Building a Real-Time Ad Analytics Platform for
Facebook
Create a real-time data pipeline to analyze Facebook ad
performance. Use Apache Kafka to collect streaming ad data,
process it using Apache Spark Streaming for real-time metrics, and
store the processed data in Google BigQuery for deeper analysis.
Build an interactive dashboard using Tableau for monitoring ad
performance across various demographics and geographies.
40 hours
#6
Fraud Detection Data Pipeline for PayPal
Develop a fraud detection pipeline that processes real-time
transaction data. Use Apache Kafka for streaming data and process
it using Apache Flink to identify fraudulent patterns. Store the
results in Google BigQuery, enabling PayPal to respond quickly to
potential fraud cases.
40 hours
23
#7
Big Data Pipeline for E-commerce
Personalization for Amazon
Build a big data pipeline that processes customer data at scale to
deliver personalized product recommendations on Amazon. Use
Apache Hadoop for distributed data storage and Spark for data
processing. Implement recommendation algorithms using machine
learning libraries in Spark and integrate the output into Amazon S3
for fast retrieval.
45 hours
#8
Real-Time Financial Data Processing for
Goldman Sachs
Develop a high-frequency trading data pipeline for processing and
analyzing real-time stock market data. Use Apache Kafka for
capturing real-time data streams from financial markets, apply
Apache Flink for complex event processing (CEP), and store the
processed data in AWS Redshift for real-time analysis. Provide
insights to assist in trading strategies.
45 hours
24
Meet Your Instructors &
Mentors
Learn from Industry veterans
Our instructors and mentors are highly rated by working professionals
upskilling with us, they are working at top product companies & have
experience working on industry scale Data Engineering projects. They are
also familiar with the best ways to crack these companies.
Rajat Garg
Co-founder - Bosscoder, Ex - Microsoft
Introducing Rajat Garg, an esteemed graduate of NIT Delhi from
the class of 2019. Rajat embarked on his professional journey at
Microsoft, playing a pivotal role in expanding PowerPoint's web
services, catering to an impressive 200 million monthly active
users. With a solid foundation of six years in teaching, Rajat is
dedicated to education and mentorship.
Manish Garg
Co-founder of Bosscoder, Ex - Samsung, Quoori
Presenting Manish Garg, an IIT Dhanbad alumnus, has had an
impressive tech career. From Samsung's Machine Learning
engineer to collaborating with Android's founder at Essential and
leading Quoori's speech department, his journey is remarkable.
With nine years of teaching experience, Manish is dedicated to
education and mentorship.
25
Meet Your Instructors &
Mentors
Parijat Roy
Senior Data Scientist, Microsoft
Meet Parijat Roy, a seasoned data scientist from Microsoft and
Jadavpur University alum with 8 years of industry experience.
Transitioning from software engineering to data science, Parijat
specializes in Natural Language Processing (NLP) for analyzing
feedback and improving Net Promoter Scores (NPS) for Office
products. His diverse skill set makes him an ideal mentor for
aspiring data scientists.
Sankalp Tomar
Senior Data Scientist at Microsoft
Introducing Sankalp Tomar, with 10 years in the tech industry, now
a Senior Data Scientist at Microsoft. He transitioned from a System
Engineer at Infosys to a key role in Microsoft's Graphics team,
focusing on creating images from text in Office and enhancing
features in Microsoft Edge. Sankalp's diverse expertise and
journey through different tech roles make him an insightful mentor
for aspiring data scientists.
26
Alumni’s Thoughts about
Bosscoder
Pulkit Gupta
Ankur Raj
Akshit Aggarwal
Udit Sharma
27
Alumni’s Thoughts about
Bosscoder
Chaitanya
Garima Gogia
“Earlier, I struggled finding the right topics to study, and I did not
have any path to follow. What stood out for me at Bosscoder was
the detailed curriculum, covering all topics. It went from basics to
advanced topics.”
Himanshu Bhaware
Karthik P
28
Alumni’s Thoughts about
Bosscoder
Rohal Kurup
Vamsi Kesav
Ananda
Vritika Chaudhary
“I was always confused about what to study and what not to study
for DSML interviews then I decided to join Bosscoder Academy.
Mentors here are very supportive. Anyone who religiously follows
the classes and is consistent can get his concepts very clear.
29
Alumni’s Thoughts about
Bosscoder
Rakesh
Arun VS
Shubhankar Singh
30
BOSSCODER
ACADEMY
Empowering
story
2023 2023
Sandhya's
story
Sandhya dropped out of school & had to do
Amit (here, age -34) studied in school till 5 but
household chores & begging to support her family. has to drop out to earn and support his family.
1st girl in Basti to go out to Pursue education, completed
Amit has now joined us a Community Leader as
12th while working & now pursuing BSW in IGNOU. well as a facilitator in Truck Union Basti.
31
Upskill Now & Make a
Successful Transition to Data
Engineer Roles
Reach out to us at
ask@bosscoderacademy.com