Gusto + Bruin

Source

Ingest Gusto data into your warehouse with incremental loading, quality checks, and full lineage. Defined in YAML, version-controlled in Git.

For business teams

What you get

People analytics beyond HR tools
Join Gusto data with finance and project data. See fully-loaded team cost, hiring ROI, and attrition trends.
Headcount planning with real data
Combine Gusto org data with budget and project data. Plan headcount based on actual numbers, not estimates.
Compliance-ready data
Quality checks validate that required fields are present, records are consistent, and org hierarchy is valid.
Faster reporting cycles
Gusto data syncs automatically. HR and finance get fresh data without waiting for someone to pull a report.

For data & engineering teams

How it works

Automatic schema handling
Bruin detects Gusto schema changes and handles them automatically. No manual migration scripts.
YAML-defined, Git-versioned
Your Gusto pipeline is a YAML file. Review in PRs, deploy with CI/CD, roll back with git revert.
Hierarchy validation
Custom SQL checks validate manager-employee relationships and catch orphaned records in Gusto org data.
Incremental sync
Only sync new and changed Gusto records. Full org structure stays in sync without re-processing everything.

Before you start

Gusto account with API access

OAuth application registered

Company admin permissions

Step 1

Add your Gusto connection

Connect using Gusto OAuth credentials. Add this to your Bruin environment file, credentials are stored securely and referenced by name in your pipeline YAML.

Parameters

client_idGusto OAuth client ID
client_secretGusto OAuth client secret
refresh_tokenOAuth refresh token

connections:
  gusto:
    type: gusto
    uri: "gusto://client_id:client_secret?refresh_token=token"

Step 2

Create your pipeline

Define a YAML asset that tells Bruin what to pull from Gusto and where to land it. This file lives in your Git repo, reviewable, version-controlled, and deployable with CI/CD.

Available tables

employeespayrollsbenefitsdepartmentslocationscontractors

name: raw.gusto_employees
type: ingestr

parameters:
  source_connection: gusto
  source_table: 'employees'
  destination: bigquery

# Syncs employee records, org structure,
# and HR data with automatic schema handling.

Step 3

Add quality checks

Validate Gusto data on every sync. Catch invalid org hierarchies, missing departments, and orphaned employee records automatically.

Validate manager-employee hierarchy is valid

Catch employees with null departments

Ensure employee IDs are unique across syncs

columns:
  - name: employee_id
    checks:
      - name: not_null
      - name: unique
  - name: email
    checks:
      - name: not_null
  - name: department
    checks:
      - name: not_null

custom_checks:
  - name: no employees without manager
    query: |
      SELECT COUNT(*) = 0
      FROM raw.gusto_employees
      WHERE department IS NOT NULL
        AND manager_id IS NULL
        AND role != 'CEO'

Step 4

Run it

One command. Bruin connects to Gusto, pulls data incrementally, runs your quality checks, and lands clean data in your warehouse. If a check fails, the pipeline stops, bad data never reaches downstream.

Backfill historical data with --start-date

Schedule with cron or trigger from CI/CD

Full lineage from Gusto to your dashboards

$ bruin run .

Running pipeline...

  gusto_employees
    ✓ Fetched 2,847 new records
    ✓ Quality: campaign_id not_null     PASSED
    ✓ Quality: spend not_null           PASSED
    ✓ Quality: no negative ad spend     PASSED
    ✓ Loaded into bigquery

  Completed in 12s