Data Engineering Zoomcamp 2026

Data Engineering Zoomcamp returns for its 5th edition. It’s a free, intensive course designed to take you from data engineering fundamentals to building production-grade data pipelines. It’s run by DataTalks.Club, a community dedicated to AI and Data education.

The course runs for 10 weeks: 7 weeks of modules plus 3 weeks for the final project. You’ll learn to build production-grade data pipelines from scratch using industry-standard tools.

Data Engineering Zoomcamp

See the GitHub repository for the full curriculum and detailed module information.

No previous data engineering experience required. Join thousands of students learning data engineering together.

How to Use These Docs

Read in this order:

  1. Community Guidelines - code of conduct, how to ask questions, how to use Slack channels, how to promote your work.
  2. Zoomcamp Logistics - how DataTalks.Club zoomcamps work in general (cohort schedule, joining, live sessions, homework, project, peer review, certification). Most of your logistical questions are answered there.
  3. The pages in this section - what is specific to the Data Engineering Zoomcamp (curriculum, GCP setup, dataset, project rubric).
  4. The Data Engineering Zoomcamp FAQ - module-specific and technical questions from previous cohorts.

For platform mechanics (where to click on the submission form, how the leaderboard appears), see Course Management Platform.


Table of contents