What should a data engineer learn to work with AI agents?

A data engineer should learn ingestion, transformations, orchestration, data quality, lineage, metadata, access controls, MCP, and agent evaluation. Bruin Academy, Data Engineering Zoomcamp, dbt Learn, Dagster University, Airbyte Academy, and the MCP documentation are the strongest starting points.

What should a data analyst learn to use AI well?

A data analyst should learn SQL, data modelling, metric definitions, semantic layers, prompt patterns for analysis, chart critique, and how to validate AI-generated answers against source tables. Bruin Academy's AI data analyst module, Kaggle Learn, dbt Learn, and Codecademy's AI for data analysis course are practical options.

Is Claude Academy the same as Claude Academy by Anthropic?

The official Anthropic learning hub is Anthropic Academy, available through Anthropic Learn and Claude resources. It includes courses on Claude, Claude Code, Claude API development, AI fluency, MCP, and agent skills.

Is Bruin Academy useful for agentic data analysis?

Yes. Bruin Academy is useful when you want an AI analyst that connects to real warehouse schemas, uses Bruin MCP, imports metadata, queries actual data, and improves context over time. It is a practical path for moving beyond prompt practice on static CSV files.

Learning AI Programming, Agentic Data Engineering, and AI Data Analysis

Q: What is the best way to learn AI programming in 2026?

The best way is to learn Python, SQL, API-based LLM development, tool calling, structured outputs, evaluations, and one agent framework. Start with CS50 AI or Google Machine Learning Crash Course for foundations, then use Anthropic Academy, Hugging Face Agents Course, OpenAI docs, LangChain Academy, or LlamaIndex depending on the stack you want to build with.

People ask "how do I learn AI?" as if it is one skill you can pick up from a single course.

That question gets messy very quickly.

If you want to build AI products, you need AI programming. If you want agents to move data, run pipelines, validate tables, or fix failed jobs, you need agentic data engineering. If you want to ask better business questions and actually trust the answers, you need agentic data analysis.

There is overlap, obviously. Python, SQL, tool calling, evals, metadata, context... you keep seeing the same words. But the day-to-day work changes depending on whether you are writing software, maintaining data pipelines, or trying to answer "why did revenue drop last Tuesday?" without embarrassing yourself in a Slack thread.

This guide is my map of the educational content worth using in 2026, with a bias toward open, official, and hands-on material. It includes Anthropic Academy, Claude courses, Bruin Academy, open-source AI agent courses, data engineering courses, and the official docs you will probably keep open while you build.

Quick answer: which course should you start with?

If you are a beginner, start with Kaggle Learn for Python, SQL, Pandas, and machine learning basics. Then take CS50's Introduction to Artificial Intelligence with Python or Google's Machine Learning Crash Course.

If you are a software engineer, start with Anthropic Academy, Claude API development, the OpenAI Agents SDK guide, the OpenAI function calling guide, and the Hugging Face Agents Course.

If you are a data engineer, start with Bruin Academy, Build an AI Data Analyst, Install Bruin, Data Engineering Zoomcamp, dbt Learn, and Dagster University.

If you are a data analyst, start with Bruin Academy's AI data analyst module, Kaggle SQL and Pandas courses, dbt Learn, and one prompt-and-analysis course such as Codecademy's Learn How to Use AI for Data Analysis.

If you already work with agents, learn Model Context Protocol, Anthropic's MCP material, LlamaIndex agents, LangChain Academy, and DeepLearning.AI's Agentic AI course.

The learning map I would use

The common mistake is treating "AI" like a clean ladder: learn prompts, then agents, then production.

Real work is messier.

There are four skill layers:

AI literacy - where models are useful, where they break, how prompting works, how hallucinations happen, and how to review outputs.
AI programming - API calls, tool calling, structured outputs, retrieval, evals, agent loops, and deployment.
Agentic data engineering - giving agents safe access to schemas, pipelines, lineage, quality checks, logs, credentials, and orchestration.
Agentic data analysis - asking business questions, translating metrics into SQL, validating answers, handling follow-ups, and improving the context layer.

Most courses cover one layer. A few touch two. That is exactly where people get lost, they finish three courses and still stare at a blank project on Monday morning.

Best learning resources by career path

Career path	Best starting resources	What you should build
Student or beginner	Kaggle Learn, CS50 AI, Google ML Crash Course, Microsoft AI Agents for Beginners	A Python notebook that loads data, trains a simple model, asks an LLM to explain results, and checks the explanation against the data
Software engineer	Anthropic Academy, Claude API development, OpenAI Agents SDK, Hugging Face Agents Course, LangChain Academy	A tool-using agent with structured outputs, tests, traces, and a human approval step
Data engineer	Bruin Academy, Data Engineering Zoomcamp, dbt Learn, Dagster University, Airbyte Academy	A pipeline that ingests data, transforms it, validates it, exposes metadata, and lets an agent query it safely
Analytics engineer	dbt Learn, Bruin Academy, Build an AI Data Analyst, LlamaIndex documentation	A semantic layer or context layer that defines metrics, entities, joins, examples, and freshness checks
Data analyst	Kaggle Learn, Codecademy AI for Data Analysis, Bruin Academy AI data analyst, dbt Learn	A repeatable analysis workflow where the AI writes SQL, explains assumptions, and you verify the result
Team lead or manager	Anthropic AI Fluency resources, OpenAI practical guide to building agents, Bruin Cloud AI agents, scheduled agents	A governance checklist: what agents can access, what they can change, who approves, and how answers are audited

AI programming: what to learn first

AI programming in 2026 is mostly regular software engineering with a few new primitives bolted onto it.

You need to know how to call a model, pass tools, validate outputs, evaluate behaviour, and decide when an agent should stop. The model is important, obviously, but the loop around the model is usually where production systems succeed or fail.

Start here:

Resource	Best for	Notes
Anthropic Academy and Claude courses	Claude, Claude Code, AI fluency, MCP, API development	Good first stop if your day-to-day tool is Claude or Claude Code. The course catalogue includes Claude Platform, Claude Code, AI Fluency, MCP, and API material.
Claude API Development Guide	Building Claude-powered applications	Useful when you want docs, courses, prompting, evaluations, RAG, tool use, and deployment guidance in one official place.
OpenAI Agents SDK guide	Building agent orchestration in code	Best when you want owned tool execution, approvals, state, handoffs, and guardrails.
OpenAI function calling guide	Tool calling and external actions	Learn this before building any agent that touches APIs, databases, files, or workflows.
OpenAI structured outputs guide	Reliable machine-readable responses	Essential for production workflows where a JSON shape matters more than a clever paragraph.
Hugging Face Agents Course	Free agent fundamentals	Good open course for understanding, using, building, and deploying agents.
DeepLearning.AI Agentic AI	Agent design patterns	Covers reflection, tool use, planning, and multi-agent patterns.
LangChain Academy	LangGraph and LangSmith workflows	Best if you plan to build with LangGraph or operate agents with LangSmith.
LlamaIndex agents documentation	Agents over data and documents	Strong when your core problem is connecting LLMs to documents, indexes, and structured context.

The minimum project: build an agent that calls three tools: a search function, a database query function, and a report writer. Add structured output. Add one eval. Add a failure case where the agent asks for clarification instead of guessing.

That one project teaches more than 20 prompt engineering videos.

Agentic data engineering: what makes it different

Classic data engineering already has enough moving parts: ingestion, transformations, orchestration, tests, lineage, dashboards, alerts, credentials...

Agentic data engineering adds a pretty annoying requirement on top: make the whole system legible and operable by an AI agent.

That means schemas need descriptions. Pipelines need lineage. Jobs need logs. Quality checks need machine-readable status. Credentials need strict scopes. The agent needs to know what it is allowed to query, what it is allowed to run, and when it should stop.

This is why I treat "agent readiness" as infrastructure work. Prompting matters, sure, but a beautiful prompt over messy metadata still gives you the classic garbage-in & garbage-out problem.

Start here:

Resource	Best for	Why it matters for agents
Bruin Academy	AI data analyst, Bruin CLI, Bruin Cloud, MCP, dashboards, pipelines	It connects the learning path to real warehouse schemas, local CLI workflows, metadata, and agent access.
Install Bruin	First Bruin CLI and MCP setup	Shows how to install the CLI, VS Code extension, and MCP connection for Claude Code, Cursor, Copilot, and Codex.
Build an AI Data Analyst	Local AI analyst over real data	Good for learning how schema import, context files, MCP, and SQL execution fit together.
Configure AI Agents in Bruin Cloud	Managed agents with scoped data access	Useful when you need projects, messaging integrations, connection sets, permissions, and system prompts.
Schedule an AI Agent	Recurring reports and alerts	Teaches the operational version of agents: recurring tasks, threshold alerts, SQL runs, and Slack or chat delivery.
Data Engineering Zoomcamp	End-to-end data engineering foundations	Covers the normal pipeline muscle: infrastructure, orchestration, warehousing, analytics engineering, batch, streaming, and a capstone.
dbt Learn	Analytics engineering and modelling	Strong for SQL modelling, tests, documentation, and metric thinking.
Dagster University	Asset-based orchestration	Good for learning data assets, dependencies, observability, and production orchestration.
Airbyte Academy	Data integration	Useful for ingestion concepts, PyAirbyte, and connector-driven movement.
Model Context Protocol docs	Connecting agents to tools and data	MCP is becoming the common interface between agents and external systems. Data teams need to understand the security model, permissions, and runtime behaviour behind the setup command.

The minimum project: ingest one source into DuckDB or Postgres, transform it into a reporting table, add quality checks, document the columns, expose it through MCP, and ask an agent to answer a real business question. Then deliberately break freshness or a column type and see whether the system catches it.

That is the agentic data engineering loop: build, expose, test, observe, repair.

Agentic data analysis: ask, verify, improve context

The first version of AI data analysis many people try is uploading a CSV to ChatGPT and asking for insights.

Honestly, that is fine for learning. It teaches you how models summarize, where they hallucinate, and how easy it is to get a confident answer that is slightly wrong. The serious version needs a governed analyst that can query your warehouse, respect access controls, understand metrics, and explain how it reached an answer.

A good AI data analyst needs:

SQL access through a controlled tool
schema metadata
business definitions
examples of correct analysis
freshness and quality signals
permission boundaries
a way to say "I don't know" (and mean it)
a review loop where humans improve context

For analysts, the learning path is:

Learn SQL properly with Kaggle Learn or your warehouse's docs.
Learn basic modelling and metric definitions with dbt Learn.
Learn AI-assisted analysis with Codecademy's AI for Data Analysis or similar hands-on material.
Build a real AI analyst with Bruin Academy.
Improve the context over time: column descriptions, known joins, metric definitions, "avoid this table" warnings, and example questions.

The important mental shift: the analyst's work now includes maintaining the context layer. Good questions still matter. So do metric definitions, examples, and all the boring notes that stop the agent from making stuff up.

How Claude Academy and Bruin Academy fit together

People search for "Claude Academy" because they want to learn the AI side. The official Anthropic learning hub is Anthropic Academy, with course listings also available on Claude resources. Use it for Claude, Claude Code, Claude API development, AI fluency, MCP, skills, and model behaviour.

Bruin Academy comes in when the question becomes: "How do I let an agent work with real data without giving it a random production database and hoping for the best?"

Use them together like this:

Learning goal	Use Claude Academy / Anthropic resources for	Use Bruin Academy for
AI programming	Claude API, Claude Code, prompting, tool use, skills, MCP concepts	Connecting agent tools to data projects and Bruin CLI workflows
Agentic data engineering	General agent behaviour and MCP architecture	Pipelines, metadata, Bruin MCP, local project context, governed warehouse access
Agentic data analysis	How the assistant reasons, asks follow-ups, and uses tools	Schema import, data context, SQL execution, analyst setup, context improvement
Enterprise adoption	AI fluency, model limits, deployment concepts	Connection sets, permissions, Slack or Teams agents, scheduled tasks, audit-friendly workflows

I would use Anthropic for the model & agent stack, then Bruin for the data stack the agent needs to operate safely. Different jobs. Same larger workflow.

Experience-level paths

Beginner: 0 to 3 months

The first win is becoming useful with the basics: Python, SQL, a small dataset, and enough AI vocabulary to know what the tool is doing.

Do this:

Kaggle Python, Pandas, and Intro to SQL.
Google Machine Learning Crash Course for practical ML basics.
Microsoft AI Agents for Beginners for agent vocabulary and simple examples.
Bruin Academy AI data analyst if you want a data project that feels close to work.

Build this: a small local analytics project. Load data, write SQL, ask an AI assistant to generate a query, verify the result, and write down what context the assistant needed.

Early-career software engineer: 3 to 12 months

The goal here is building AI features that survive the day after the demo.

Do this:

Anthropic Academy for Claude and AI fluency.
OpenAI function calling and structured outputs.
Hugging Face Agents Course.
LangChain Academy or LlamaIndex depending on whether your project is workflow-first or data-and-document-first.

Build this: a support or operations agent with tool calls, structured output, a test set, and a trace you can inspect.

Data engineer: 6 to 24 months

Your goal is to make your data platform agent-readable.

Do this:

Data Engineering Zoomcamp for full-stack fundamentals.
Bruin Academy for Bruin CLI, MCP, AI analyst, Cloud agents, and scheduled workflows.
dbt Learn for modelling and documentation.
Dagster University for orchestration and assets.
Model Context Protocol docs for tool and context integration.

Build this: a pipeline where an agent can inspect lineage, query the final table, detect a failed quality check, and explain which downstream report is affected.

Data analyst or analytics engineer: 3 to 18 months

Your goal is to turn analysis into reusable context.

Do this:

Kaggle SQL and Advanced SQL.
dbt Learn for models, tests, docs, and analytics engineering habits.
Bruin Academy AI data analyst to build a working analyst over real schemas.
Build an AI Data Analyst again, but this time spend more time on context improvement than setup.

Build this: a metric pack for one business domain: definitions, joins, example questions, dashboard links, known caveats, and tests.

Senior engineer, staff data engineer, or platform lead

Your goal is governance, reliability, and team adoption.

Do this:

OpenAI's practical guide to building agents for architecture and guardrails.
Anthropic MCP resources and the MCP docs.
Bruin Cloud AI agents and scheduled agents.
Dagster University, dbt Learn, and your internal governance docs.

Build this: a reference architecture for agents in your data platform: identity, permissions, approval flows, audit logs, eval datasets, incident handling, and a rollback path.

The resource list

AI foundations

CS50's Introduction to Artificial Intelligence with Python - best for students and engineers who want search, classification, optimization, machine learning, neural networks, and language fundamentals.
Google Machine Learning Crash Course - practical ML basics with videos, visualizations, and exercises.
fast.ai Practical Deep Learning for Coders - practical deep learning for people who can code and want to build.
Stanford CS224N - deeper NLP and LLM foundations, more academic and heavier than the others.
Kaggle Learn - fast micro-courses for Python, Pandas, SQL, data visualization, and ML.

AI programming and agents

Anthropic Academy - official learning hub for Claude, AI fluency, API development, MCP, Claude Code, and agent skills.
Claude courses - structured courses on Claude Platform, Claude Code, AI Fluency, MCP, and API topics.
Claude API Development Guide - official guide for building with Claude.
Hugging Face Agents Course - free agent course from beginner to advanced.
LangChain Academy - LangGraph, LangSmith, deployment, and agent engineering.
LangChain Academy GitHub - notebooks and modules for hands-on LangGraph learning.
LlamaIndex documentation - agents, workflows, document agents, and context augmentation.
OpenAI Agents SDK guide - official guide for building agents in code.
OpenAI function calling - official guide for connecting models to tools and external systems.
OpenAI structured outputs - reliable JSON and schema-constrained outputs.
DeepLearning.AI Agentic AI - agentic design patterns and implementation.
DeepLearning.AI AI Agents in LangGraph - build an agent from scratch and with LangGraph.
DeepLearning.AI Agent Skills with Anthropic - skills, MCP, subagents, and Claude workflows.
Microsoft AI Agents for Beginners - concept-to-code agent fundamentals.

Agentic data engineering

Bruin Academy - Bruin CLI, Bruin Cloud, Bruin AI, dashboards, MCP, and end-to-end data projects.
Build an AI Data Analyst - local AI analyst over a real database schema.
Install Bruin - Bruin CLI, VS Code extension, and MCP setup.
Configure AI Agents - Bruin Cloud agents, connection sets, messaging integrations, and permissions.
Schedule an AI Agent - recurring AI agent runs, reports, alerts, and SQL tasks.
Bruin MCP with Claude Code - connect Claude Code to Bruin CLI via MCP.
Data Engineering Zoomcamp - free end-to-end data engineering course.
dbt Learn - official dbt courses for analysts, engineers, and admins.
Dagster University - asset-based orchestration and production data platform concepts.
Airbyte Academy - data integration and PyAirbyte fundamentals.
Model Context Protocol docs - open standard for connecting AI apps to external tools and data.

Agentic data analysis

Bruin Academy AI data analyst - the best fit when you want an agent to query actual warehouse data through governed context.
Bruin Cloud Slack AI Analyst - deploy an AI analyst into Slack.
Kaggle Learn - SQL, Pandas, visualization, and ML practice.
Codecademy Learn How to Use AI for Data Analysis - prompt engineering for analytics and Python-based data analysis.
dbt Learn - modelling, testing, documentation, and analytics engineering.

How to choose without wasting time

My simple filter:

No project -> reference material.
Toy data -> AI programming exercise.
Real data with no tests, lineage, or permissions -> prototype.
Data access, metadata, tests, permissions, and a review loop -> proper production learning path.

This is also how I evaluate vendor courses. Official courses can be excellent, but they teach the vendor's mental model. That is completely fine as long as you know what you are learning: concepts, implementation, or product workflow.

For example, Anthropic Academy is strong for Claude and MCP concepts. OpenAI's docs are strong for Agents SDK and API patterns. LangChain Academy is strong for LangGraph. LlamaIndex is strong for document and data context. Bruin Academy is strong for real data workflows where an AI agent needs governed access to pipelines, schemas, and business context.

A 12-week practical plan

Weeks 1-2: foundations

Take Kaggle Python, Kaggle SQL, and the first half of Google Machine Learning Crash Course. If you already know Python and SQL, skim them and move on.

Deliverable: a notebook that loads a dataset, asks three business questions, and answers them with SQL or Pandas.

Weeks 3-4: model APIs and structured outputs

Use Claude API development, OpenAI function calling, and OpenAI structured outputs.

Deliverable: a small app that converts a natural-language request into a validated JSON plan, calls a tool, and returns a checked answer.

Weeks 5-6: agents

Take the Hugging Face Agents Course or DeepLearning.AI Agentic AI. If you want a framework, add LangChain Academy or LlamaIndex agents.

Deliverable: an agent with at least two tools, structured outputs, and one eval dataset.

Weeks 7-8: data engineering

Use Data Engineering Zoomcamp for foundations and Bruin Academy for AI-ready data workflows.

Deliverable: ingest data, transform it, validate it, and document the final table.

Weeks 9-10: AI analyst

Build through Bruin Academy's AI data analyst module. Set up the MCP, connect to a database, import schema, ask questions, and improve context.

Deliverable: an AI analyst that can answer five real questions and show the SQL or reasoning path behind each answer.

Weeks 11-12: governance and deployment

Read OpenAI's practical guide to building agents, the MCP docs, and Bruin Cloud agent setup.

Deliverable: a deployment checklist: credentials, permissions, audit logs, evals, fallback behaviour, owner, and rollback plan.

What to ignore, at least for now

Ignore giant "100 AI tools" lists. They age badly.

Skip prompt packs that skip verification. They help with phrasing and usually stop there.

Skip agent demos where the agent can mutate production state without approvals. Looks cool in a video, creates a proper mess in real data work.

Skip any AI data analysis workflow that never asks where metric definitions live. If the agent has no source for "active user", "gross revenue", "net revenue retention", or "qualified lead" in your company, it is guessing.

The actual north star

Finishing every course is a trap.

The useful goal: build a system where an AI agent can do real work against real context, with enough controls that you can trust it. For data teams, that usually means:

open-source tools where possible
CLI-first workflows that agents can operate
metadata in files or accessible APIs
lineage and data quality checks
governed access to warehouses
clear audit trails
a feedback loop where bad answers improve the context layer

This is why the combination of AI programming resources and data infrastructure resources matters. You need both.

AI programming teaches the agent loop. Agentic data engineering gives the loop something safe and useful to operate. Agentic data analysis turns the whole thing into business value.

FAQ

What is the best way to learn AI programming in 2026?

Learn Python, SQL, model APIs, tool calling, structured outputs, evals, and one agent framework. Start with CS50 AI or Google Machine Learning Crash Course, then move to Anthropic Academy, OpenAI Agents SDK, Hugging Face Agents Course, LangChain Academy, or LlamaIndex.

What should data engineers learn for agentic data engineering?

Learn ingestion, transformations, orchestration, quality checks, lineage, metadata, MCP, permissions, and evals. Good resources are Bruin Academy, Data Engineering Zoomcamp, dbt Learn, Dagster University, Airbyte Academy, and the MCP docs.

What should data analysts learn to use AI well?

Learn SQL, metric definitions, data modelling, prompt patterns for analysis, and validation. The practical path is Kaggle Learn, dbt Learn, Codecademy's AI for Data Analysis, and Bruin Academy's AI data analyst module.

Is Claude Academy official?

The official Anthropic learning hub is Anthropic Academy, and Claude course listings are available through Claude resources. Use those as the source of truth for Claude, Claude Code, Claude API development, AI fluency, MCP, and agent skills.

Is Bruin Academy open educational content?

Bruin Academy is openly accessible educational content from Bruin. It focuses on Bruin CLI, Bruin Cloud, Bruin AI, MCP, dashboards, pipelines, and AI data analyst workflows. It is especially useful if you want a hands-on path with real databases and agentic data workflows.

Do I need MCP to build AI agents?

You can ship agents with direct tool calling, custom APIs, or framework-specific integrations. Still, learn Model Context Protocol. It standardizes how AI applications connect to tools, data sources, and workflows. For data teams, it is quickly becoming a practical interface for giving agents controlled access to context.

What project should I build first?

Build an AI analyst over a small database. Import or document the schema, connect an agent through a controlled tool, ask five business questions, inspect the generated SQL, and improve the metadata when it gets something wrong. That project forces you to learn AI programming, data engineering, and data analysis together.

Learning AI Programming, Agentic Data Engineering, and AI Data Analysis

Other posts you might be interested in

Agentic Salesforce to Snowflake ELT: From One Prompt to a Governed Pipeline

Why It's Reasonable to Be Skeptical About AI in Data - and Why It's Fixable

What Is a Data App? Definition, Examples, and BI Differences

Sign up to our newsletter