Course Content
Module 1: Introduction to Tableau
You don't currently have access to this content
5 Topics
What is Data Engineering?
You don't currently have access to this content
Roles & responsibilities of a Data Engineer
You don't currently have access to this content
Overview of cloud platforms: AWS, Azure, GCP
You don't currently have access to this content
Core concepts: ETL vs ELT, batch vs streaming, data lakes vs warehouses
You don't currently have access to this content
Tools used in cloud data engineering
You don't currently have access to this content
Module 2: Cloud Fundamentals
You don't currently have access to this content
4 Topics
Cloud computing models (IaaS, PaaS, SaaS)
You don't currently have access to this content
Cloud storage basics (object, file, block)
You don't currently have access to this content
Compute services (VMs, containers, serverless)
You don't currently have access to this content
Networking and security basics
You don't currently have access to this content
Module 3: Data Storage Systems in the Cloud
You don't currently have access to this content
5 Topics
AWS: S3, RDS, Redshift, DynamoDB
You don't currently have access to this content
Azure: Blob Storage, Data Lake Gen2, SQL DB, Cosmos DB
You don't currently have access to this content
GCP: Cloud Storage, BigQuery, Firestore
You don't currently have access to this content
Data partitioning and clustering
You don't currently have access to this content
Data modeling basics
You don't currently have access to this content
Module 4: Data Ingestion Tools
You don't currently have access to this content
3 Topics
File-based ingestion (CSV, JSON, Parquet)
You don't currently have access to this content
Real-time ingestion tools:
You don't currently have access to this content
Batch ingestion using:
You don't currently have access to this content
Module 5: Data Transformation & Processing
You don't currently have access to this content
5 Topics
ETL vs ELT strategies
You don't currently have access to this content
Using Apache Spark and PySpark
You don't currently have access to this content
Databricks on Azure and AWS
You don't currently have access to this content
SQL-based transformation (BigQuery, Snowflake, Redshift)
You don't currently have access to this content
Processing data at scale (partitioning, bucketing)
You don't currently have access to this content
Module 6: Workflow Orchestration
You don't currently have access to this content
3 Topics
Introduction to Apache Airflow
You don't currently have access to this content
DAGs, tasks, dependencies
You don't currently have access to this content
Cloud-native orchestration:
You don't currently have access to this content
Module 7: Real-Time Data Processing
You don't currently have access to this content
3 Topics
Concepts: streaming, windowing, watermarking
You don't currently have access to this content
Tools:
You don't currently have access to this content
Use cases: real-time dashboards, alerting systems
You don't currently have access to this content
Module 8: Data Warehousing and Analytics
You don't currently have access to this content
4 Topics
Redshift, BigQuery, Snowflake, Synapse
You don't currently have access to this content
Star/snowflake schema design
You don't currently have access to this content
Writing optimized SQL queries
You don't currently have access to this content
BI tool integration (Tableau, Power BI, Looker)
You don't currently have access to this content
Module 9: Data Governance, Security & Monitoring
You don't currently have access to this content
5 Topics
IAM roles and access control
You don't currently have access to this content
Data encryption at rest & in transit
You don't currently have access to this content
Logging and monitoring:
You don't currently have access to this content
Data quality and validation tools
You don't currently have access to this content
Introduction to Data Catalogs and Lineage Tracking (e.g., AWS Glue Data Catalog, Purview, Dataplex)
You don't currently have access to this content
Module 10: DevOps and CI/CD for Data Pipelines
You don't currently have access to this content
3 Topics
Version control with Git
You don't currently have access to this content
CI/CD for data workflows (GitHub Actions, Jenkins, Cloud-native tools)
You don't currently have access to this content
Infrastructure as code: Terraform, CloudFormation
You don't currently have access to this content
Module 11: Capstone Project
You don't currently have access to this content
1 Topic
Build an end-to-end cloud data pipeline:
You don't currently have access to this content
Includes
11 Lessons
41 Topics