Final Project
For the cross-course logistics (attempts, deadlines, peer review, certification mechanics), see Final Project (Zoomcamp Logistics).
This page covers what is specific to the Data Engineering Zoomcamp project.
What you build
Build an end-to-end data pipeline with a dashboard using any dataset, tools, and cloud provider you choose.
Over two to three weeks, create a pipeline that:
- Ingests data from your chosen source.
- Processes and stores it in a data lake.
- Moves it to a data warehouse.
- Transforms it for analysis.
- Visualizes results in a dashboard.
Your choices
You decide:
- Dataset (any source you are interested in - see project datasets on GitHub for suggestions).
- Processing approach (batch or stream).
- Cloud provider (AWS, GCP, Azure, or even local with DuckDB).
- Tools (Terraform, Airflow, Kestra, Spark, Kafka, dbt, etc.).
You are not restricted to the technologies covered in the course. If the course used GCP and you want to use AWS, that is fine. Document everything in detail so peer reviewers can understand your choices.
Datasets you cannot use
The NYC taxi dataset is used throughout the course modules and homework. You cannot use it for your project. Pick any other dataset.
Dashboard
Create a dashboard with at least two visualizations. You can use Looker Studio, Streamlit, Metabase, Superset, or any BI tool of your choice.
If your dashboard is private, include a screenshot in the README. A live link is preferred where possible.
Evaluation criteria
Your project will be assessed on:
- Problem description and clarity.
- Cloud and infrastructure-as-code usage.
- Data ingestion pipeline (batch or stream).
- Data warehouse optimization.
- Data transformations.
- Dashboard quality.
- Code reproducibility and documentation.
The full rubric (with point values per criterion) is in the project README on GitHub.
Certification
Submit the project and complete peer reviews. That is the only requirement to graduate.

What is not required:
- Homework scores. Homework is for learning, not grading.
- Live session attendance.
- Leaderboard points.
For when the certificate is issued and how to add it to LinkedIn, see Certification (Zoomcamp Logistics).