What is the Data Engineering Zoomcamp?
Data Engineering Zoomcamp is a free, hands-on curriculum by DataTalks.Club that covers core data engineering topics including batch processing, streaming, orchestration, and analytics workflows.
Use this guide after the free data engineering bootcamp by DataTalks.Club to turn the Bruin module into a working portfolio project: ingestion, SQL/Python transforms, checks, lineage, and a path to production.
The Data Engineering Zoomcamp teaches the moving parts of a modern data stack. The Bruin module shows how those parts fit in one project: ingest data with ingestr, model it in SQL and Python, add quality checks, review lineage, and run the pipeline without stitching a separate scheduler around it.
The Data Engineering Zoomcamp takes you through the full lifecycle of a data engineering project. Across multiple modules, you'll work with tools like Docker, Terraform, BigQuery, Spark, and Kafka - learning how each one fits into a modern data stack.
In the Bruin path, the useful next step is simple: take one dataset, build the ingestion and transformation flow, add checks that would catch bad data, then make the project easy for another engineer to run locally or in CI.
In the Bruin module, you'll build an end-to-end pipeline without spreading the work across separate tools for ingestion, orchestration, transformation, and tests. The point is not to hide the data engineering work. It's to keep the project in one place so dependencies, checks, and deployment are easier to inspect.
If you're evaluating workflow options after the module, compare practical migration paths in our alternatives to Airflow guide.
Data Engineering Zoomcamp is a free, hands-on curriculum by DataTalks.Club that covers core data engineering topics including batch processing, streaming, orchestration, and analytics workflows.
No. The official Zoomcamp curriculum lives on DataTalks.Club. This page is Bruin's companion guide for learners who want to complete the Bruin module and turn the work into a production-style project.
Bruin helps you run ingestion, SQL/Python transforms, and data quality checks with a single workflow, so you can focus on pipeline outcomes instead of stitching many tools together.
Start with one end-to-end ingestion and transformation pipeline. A good project has a real source, a clear destination table, a few checks that would catch broken data, and a README that explains how to run it again.
Join the Bruin project competition, publish your project in the showcase, and iterate on production-ready patterns such as checks, lineage, scheduling, and governance.