All integrations
Criteo
+
Bruin

Criteo + Bruin

Source

Ingest Criteo data into your warehouse with incremental loading, quality checks, and full lineage. Defined in YAML, version-controlled in Git.

For business teams

What you get

  • Cross-channel ad reporting

    See Criteo spend alongside Google Ads, Facebook, and every other channel — in one place, updated automatically.

  • True ROAS, not estimated

    Join Criteo spend with actual revenue from Stripe or your CRM. Know your real return on ad spend, not what the ad platform tells you.

  • No more manual exports

    Stop downloading CSVs from Criteo. Stakeholders get fresh data every morning without asking anyone.

  • Catch budget anomalies early

    Quality checks flag unexpected spend spikes or zero-impression campaigns before they burn budget.

For data & engineering teams

How it works

  • Incremental sync with lookback

    Bruin handles Criteo's attribution windows automatically. Set lookback days in the connection URI — no custom logic needed.

  • YAML-defined, Git-versioned

    Your Criteo pipeline is a YAML file. Review in PRs, deploy with CI/CD, roll back with git revert.

  • Column-level quality checks

    Validate spend, impressions, and clicks with not_null, unique, and custom SQL checks. Pipeline stops on failure.

  • Multi-destination support

    Land Criteo data in BigQuery, Snowflake, Redshift, or DuckDB. Switch destinations by changing one line.

Before you start

Criteo account with API access
API credentials from Management Center

Step 1

Add your Criteo connection

Connect using Criteo API credentials. Add this to your Bruin environment file — credentials are stored securely and referenced by name in your pipeline YAML.

Parameters

  • client_idCriteo API client ID
  • client_secretCriteo API client secret
connections:
  criteo:
    type: criteo
    uri: "criteo://client_id:client_secret"

Step 2

Create your pipeline

Define a YAML asset that tells Bruin what to pull from Criteo and where to land it. This file lives in your Git repo — reviewable, version-controlled, and deployable with CI/CD.

Available tables

campaignsad_setsadsstatisticsaudiences
name: raw.criteo_campaigns
type: ingestr

parameters:
  source_connection: criteo
  source_table: 'campaigns'
  destination: bigquery

# Syncs campaign spend, impressions, clicks,
# and conversions incrementally.
# Backfill: bruin run --start-date 2024-01-01

Step 3

Add quality checks

Validate Criteo data on every sync. Catch negative spend, impossible click-to-impression ratios, and missing campaign IDs before they reach your reports.

Catch negative ad spend before it reaches reports
Validate impressions >= clicks on every sync
Flag campaigns with missing IDs or null spend
columns:
  - name: campaign_id
    checks:
      - name: not_null
  - name: spend
    checks:
      - name: not_null
  - name: impressions
    checks:
      - name: not_null

custom_checks:
  - name: no negative ad spend
    query: |
      SELECT COUNT(*) = 0
      FROM raw.criteo_campaigns
      WHERE spend < 0
  - name: impressions >= clicks
    query: |
      SELECT COUNT(*) = 0
      FROM raw.criteo_campaigns
      WHERE clicks > impressions

Step 4

Run it

One command. Bruin connects to Criteo, pulls data incrementally, runs your quality checks, and lands clean data in your warehouse. If a check fails, the pipeline stops — bad data never reaches downstream.

Backfill historical data with --start-date
Schedule with cron or trigger from CI/CD
Full lineage from Criteo to your dashboards
$ bruin run .
Running pipeline...

  criteo_campaigns
    ✓ Fetched 2,847 new records
    ✓ Quality: campaign_id not_null     PASSED
    ✓ Quality: spend not_null           PASSED
    ✓ Quality: no negative ad spend     PASSED
    ✓ Loaded into bigquery

  Completed in 12s

Other Ad Platform integrations

Ready to connect Criteo?

Start for free, or book a demo to see how Bruin handles ingestion, quality, lineage, and scheduling for your entire data stack.