Back to Showcase

U.S. Crude Oil Production Analytics Pipeline

Arsi Amallah Binhaq

The primary goal of this project is to build a robust and scalable data pipeline that tracks crude oil production across different U.S. states using publicly available data from the Energy Information Administration (EIA). The pipeline handles end-to-end data processing—from ingestion of raw datasets to transformation into analytics-ready tables—enabling analysis of production trends across regions and over time to support data-driven insights in the energy sector. Bruin Features Used This project leverages Bruin to simplify and manage the data pipeline using: Data Ingestion: Integrating raw EIA datasets directly into the data warehouse as part of the pipeline Data Transformation: SQL-based processing to clean, structure, and aggregate crude oil production data Pipeline Execution (bruin run): Running the full pipeline with automatic dependency handling Bruin AI: Assisting in SQL development and accelerating pipeline implementation

Share: