Back to Showcase

Kairoskop A Collective Attention Observatory

Carlos Saritama

Mooorning Everyone!!! The ancient Greeks had two words for time. • Chronos (χρόνος) — the time that passes. Seconds, hours, years. • Kairos (καιρός) — the time that matters. The moment when everything shifts. For my Data Engineering Zoomcamp 2026 capstone, I built KAIROSKOP — a real-time data observatory that scans the collective attention of civilization to detect Kairos moments as they happen. The question it answers: :point_right: What do collective attention patterns reveal about the psychological state of a civilization? The architecture ingests 4 independent attention streams simultaneously: • Wikipedia Recent Changes → what humanity is actively rewriting (collective memory) • Wikipedia Pageviews → what people seek privately after events (reactive attention) • GDELT 2.0 → what global media decides exists (institutional amplification) • arXiv RSS → where the formal knowledge frontier is moving (intellectual attention) When these 4 independent sources converge on the same topic domain within 24 hours — with no coordinating actor — that convergence is what Carl Jung called synchronicity. KAIROSKOP measures it for the first time with real data. The philosophical metrics built on top of the pipeline: :bar_chart: Synchronicity Score — cross-source convergence (Jung) :mirror: Shadow Index — what society seeks privately vs what media amplifies (Freud/Lacan) :brain: Consciousness Level — developmental altitude of collective attention (Ken Wilber) :satellite_antenna: Medium Dominance — which layer leads the narrative (McLuhan) :herb: Noospheric Density — total volume of active collective thought (Teilhard de Chardin) The full stack: :zap: Kafka (4 topics, real-time streaming) :zap: Spark Structured Streaming (enrichment + metric computation) :cloud: GCP: GCS Data Lake + BigQuery (partitioned by date, clustered by source + category) :wrench: Bruin for transformations, orchestration, quality checks, and lineage :building_construction: Terraform for infrastructure as code :bar_chart: Looker Studio dashboard (2 tiles: categorical distribution + synchronicity time series) What Bruin changed for me: I've used dbt + Airflow stacks before. Bruin replaced both with a single tool that gave me column-level lineage, quality assertions, and orchestration out of the box — without the overhead of managing two separate systems. The AI Data Analyst let me query my philosophical metrics in natural language and immediately see whether the Shadow Index was behaving as theorised. The result is a pipeline that would have taken 3 weeks to build with a traditional stack. Built in days. Data engineering isn't just infrastructure. At its best, it's a lens through which we make the invisible visible. KAIROSKOP makes the mind of civilization visible. :link: GitHub: #DataEngineering #Kafka #Spark #BigQuery #Bruin #DEZoomcamp2026 #CollectiveIntelligence #DataScience

Share: