0% found this document useful (0 votes)
5 views3 pages

Course - Data Engineering

The document outlines a comprehensive curriculum for Data Engineering, covering topics such as the evolution of data engineering, lifecycle phases, data architecture, and technology choices. It includes detailed sessions on data generation, storage, ingestion, transformation, and serving data, along with security considerations and future trends in the field. The curriculum is structured to provide a thorough understanding of both theoretical concepts and practical applications in data engineering.

Uploaded by

lucky chikkala
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
5 views3 pages

Course - Data Engineering

The document outlines a comprehensive curriculum for Data Engineering, covering topics such as the evolution of data engineering, lifecycle phases, data architecture, and technology choices. It includes detailed sessions on data generation, storage, ingestion, transformation, and serving data, along with security considerations and future trends in the field. The curriculum is structured to provide a thorough understanding of both theoretical concepts and practical applications in data engineering.

Uploaded by

lucky chikkala
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

Curriculum For Data Engineering:

Data Engineering redefines (2 Hours)


 Evolution of Data Engineering
 Data Engineering Skills and Activities
 Data Engineer Responsibilities
 Data Engineers and Roles
Data Engineering Lifecycle (2 hours)
 Generation: Source Systems
 Storage
 Ingestion
 Transformation
 Serving Data
Data Engineering Lifecycle – Undercurrents (2 Hours)
 Security
 Data Management
 Data Ops
 Data Architecture
 Orchestration
 Software Engineering
Data Architecture – Design and Concepts (2 hours)
 Enterprise Architecture Defined
 Data Architecture Defined
 Principles of Good Data Architecture
 Major Architecture Concepts
Examples and Types of Data Architecture (2 hours)
 Data Warehouse
 Data Lake
 Cloud Data Warehouse
 Lambda Architecture
 Kappa Architecture
Choosing Technologies Across the Data Engineering Lifecycle (2 hours)
 Interoperability
 Fin Ops
 Cost Optimization
 Location
 Build vs Buy
 Today vs Future
 Open-Source Software
 Monolith vs Modular
 Serverless vs Server
Data Architecture – Case Study : Dropbox, Cloudflare, Netflix (2 hours)
Data Generation in Source Systems (4 Hours)
 Sources of Data
 Source System Practical Details
 Message Queues and Event-Streaming Platforms
 Whom You’ll Work With
 Undercurrents and Their Impact on Source Systems
Storge (4 Hours)
 Raw Ingredients of Data Storage
 Data Storage Systems
 Data Engineering Storage Abstractions
 Big ideas and trends
 Undercurrents and Their Impact on Storage
Ingestion (5 Hours)
 Data Pipelines Defined
 Key Engineering Considerations for the Ingestion Phase
o Batch Ingestion Considerations
o Message and Stream Ingestion Considerations
 Ways to Ingest Data
 Whom You’ll Work With
 Undercurrents and Their Impact on Ingestion
Transformation (5 Hours)
 Queries
 Modeling
 Transformations
 Data Pipelines
 Undercurrents and Their Impact on Transformations
Serving Data for Analytics, Machine Learning, and Reverse ETL (4 Hours)
 General Considerations for Serving Data
 Analytics
 Machine Learning
 Reverse ETL
 Ways to Serve Data for Analytics and ML
 Whom You’ll Work With
 Undercurrents and Their Impact on Serving Data
Security and Privacy (2 Hours)
 People
 Processes
 Example Security Policy
 Technology
Future of Data Engineering (2 Hours)
 The Data Engineering Lifecycle Isn’t Going Away
 The Decline of Complexity and the Rise of Easy-to-Use Data Tools
 The Cloud-Scale Data OS and Improved Interoperability
 “Enterprisey” Data Engineering
 Titles and Responsibilities Will Morph
 Moving Beyond the Modern Data Stack, Toward the Live Data Stack
Serialization and Compression (2 Hour)
 Serialization Formats
 Database Storage Engines
 Compression: gzip, bzip2, Snappy, Etc.
Cloud Networking (1 Hour)
 Cloud Network Topology
 CDNs
 The Future of Data Egress Fees
Conclusion (2 Hours)

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy