Back to Showcase

UK Retail Analytics Pipeline

An end-to-end data engineering pipeline built with Bruin and DuckDB for the UK Online Retail II dataset (1,067,371 real transactions). The pipeline ingests raw Excel data through a Python asset, cleans it in a staging layer, and produces 5 analytical mart tables covering monthly revenue trends, product performance with return rate analysis, RFM customer segmentation, country analysis, and cancellation tracking. Features 66 automated quality checks across all 7 assets, GitHub Actions CI/CD, and AI analysis via the Bruin AI Data Analyst. Key finding: one product ranks 4th by revenue at £168K but has a 100% return rate.

Share: