GitHub Repository

The GitHub repository is the most important resource for this course. Use it to navigate through all course materials.

https://github.com/DataTalksClub/data-engineering-zoomcamp

GitHub repository

How to Use the Repository

  1. Start in the module folder you’re working on
  2. Read the README in that folder for an overview
  3. Follow the links to video lectures
  4. Complete homework assignments
  5. Check the cohort folder for any cohort-specific materials

The repository is your primary navigation tool. Each module README links directly to the relevant videos and resources you need.

Repository Structure

Each module has its own folder with everything you need:

Each module folder contains:

  • Course notes and examples
  • Homework assignments
  • Links to relevant video lectures
  • Code samples and notebooks

Cohorts

The cohorts/ folder contains materials specific to each edition of the course:

2026 Cohort

The 2026 cohort folder contains: