Education
18 min read

Learning AI Programming, Agentic Data Engineering, and AI Data Analysis

A practical guide to the best open and official courses for AI programming, agentic data engineering, and AI data analysis - organized by career path, experience level, and project goal.

Arsalan Noorafkan

Developer Advocate

People ask "how do I learn AI?" as if it is one skill you can pick up from a single course.

That question gets messy very quickly.

If you want to build AI products, you need AI programming. If you want agents to move data, run pipelines, validate tables, or fix failed jobs, you need agentic data engineering. If you want to ask better business questions and actually trust the answers, you need agentic data analysis.

There is overlap, obviously. Python, SQL, tool calling, evals, metadata, context... you keep seeing the same words. But the day-to-day work changes depending on whether you are writing software, maintaining data pipelines, or trying to answer "why did revenue drop last Tuesday?" without embarrassing yourself in a Slack thread.

This guide is my map of the educational content worth using in 2026, with a bias toward open, official, and hands-on material. It includes Anthropic Academy, Claude courses, Bruin Academy, open-source AI agent courses, data engineering courses, and the official docs you will probably keep open while you build.

Quick answer: which course should you start with?

If you are a beginner, start with Kaggle Learn for Python, SQL, Pandas, and machine learning basics. Then take CS50's Introduction to Artificial Intelligence with Python or Google's Machine Learning Crash Course.

If you are a software engineer, start with Anthropic Academy, Claude API development, the OpenAI Agents SDK guide, the OpenAI function calling guide, and the Hugging Face Agents Course.

If you are a data engineer, start with Bruin Academy, Build an AI Data Analyst, Install Bruin, Data Engineering Zoomcamp, dbt Learn, and Dagster University.

If you are a data analyst, start with Bruin Academy's AI data analyst module, Kaggle SQL and Pandas courses, dbt Learn, and one prompt-and-analysis course such as Codecademy's Learn How to Use AI for Data Analysis.

If you already work with agents, learn Model Context Protocol, Anthropic's MCP material, LlamaIndex agents, LangChain Academy, and DeepLearning.AI's Agentic AI course.

The learning map I would use

The common mistake is treating "AI" like a clean ladder: learn prompts, then agents, then production.

Real work is messier.

There are four skill layers:

  1. AI literacy - where models are useful, where they break, how prompting works, how hallucinations happen, and how to review outputs.
  2. AI programming - API calls, tool calling, structured outputs, retrieval, evals, agent loops, and deployment.
  3. Agentic data engineering - giving agents safe access to schemas, pipelines, lineage, quality checks, logs, credentials, and orchestration.
  4. Agentic data analysis - asking business questions, translating metrics into SQL, validating answers, handling follow-ups, and improving the context layer.

Most courses cover one layer. A few touch two. That is exactly where people get lost, they finish three courses and still stare at a blank project on Monday morning.

Best learning resources by career path

Career pathBest starting resourcesWhat you should build
Student or beginnerKaggle Learn, CS50 AI, Google ML Crash Course, Microsoft AI Agents for BeginnersA Python notebook that loads data, trains a simple model, asks an LLM to explain results, and checks the explanation against the data
Software engineerAnthropic Academy, Claude API development, OpenAI Agents SDK, Hugging Face Agents Course, LangChain AcademyA tool-using agent with structured outputs, tests, traces, and a human approval step
Data engineerBruin Academy, Data Engineering Zoomcamp, dbt Learn, Dagster University, Airbyte AcademyA pipeline that ingests data, transforms it, validates it, exposes metadata, and lets an agent query it safely
Analytics engineerdbt Learn, Bruin Academy, Build an AI Data Analyst, LlamaIndex documentationA semantic layer or context layer that defines metrics, entities, joins, examples, and freshness checks
Data analystKaggle Learn, Codecademy AI for Data Analysis, Bruin Academy AI data analyst, dbt LearnA repeatable analysis workflow where the AI writes SQL, explains assumptions, and you verify the result
Team lead or managerAnthropic AI Fluency resources, OpenAI practical guide to building agents, Bruin Cloud AI agents, scheduled agentsA governance checklist: what agents can access, what they can change, who approves, and how answers are audited

AI programming: what to learn first

AI programming in 2026 is mostly regular software engineering with a few new primitives bolted onto it.

You need to know how to call a model, pass tools, validate outputs, evaluate behaviour, and decide when an agent should stop. The model is important, obviously, but the loop around the model is usually where production systems succeed or fail.

Start here:

ResourceBest forNotes
Anthropic Academy and Claude coursesClaude, Claude Code, AI fluency, MCP, API developmentGood first stop if your day-to-day tool is Claude or Claude Code. The course catalogue includes Claude Platform, Claude Code, AI Fluency, MCP, and API material.
Claude API Development GuideBuilding Claude-powered applicationsUseful when you want docs, courses, prompting, evaluations, RAG, tool use, and deployment guidance in one official place.
OpenAI Agents SDK guideBuilding agent orchestration in codeBest when you want owned tool execution, approvals, state, handoffs, and guardrails.
OpenAI function calling guideTool calling and external actionsLearn this before building any agent that touches APIs, databases, files, or workflows.
OpenAI structured outputs guideReliable machine-readable responsesEssential for production workflows where a JSON shape matters more than a clever paragraph.
Hugging Face Agents CourseFree agent fundamentalsGood open course for understanding, using, building, and deploying agents.
DeepLearning.AI Agentic AIAgent design patternsCovers reflection, tool use, planning, and multi-agent patterns.
LangChain AcademyLangGraph and LangSmith workflowsBest if you plan to build with LangGraph or operate agents with LangSmith.
LlamaIndex agents documentationAgents over data and documentsStrong when your core problem is connecting LLMs to documents, indexes, and structured context.

The minimum project: build an agent that calls three tools: a search function, a database query function, and a report writer. Add structured output. Add one eval. Add a failure case where the agent asks for clarification instead of guessing.

That one project teaches more than 20 prompt engineering videos.

Agentic data engineering: what makes it different

Classic data engineering already has enough moving parts: ingestion, transformations, orchestration, tests, lineage, dashboards, alerts, credentials...

Agentic data engineering adds a pretty annoying requirement on top: make the whole system legible and operable by an AI agent.

That means schemas need descriptions. Pipelines need lineage. Jobs need logs. Quality checks need machine-readable status. Credentials need strict scopes. The agent needs to know what it is allowed to query, what it is allowed to run, and when it should stop.

This is why I treat "agent readiness" as infrastructure work. Prompting matters, sure, but a beautiful prompt over messy metadata still gives you the classic garbage-in & garbage-out problem.

Start here:

ResourceBest forWhy it matters for agents
Bruin AcademyAI data analyst, Bruin CLI, Bruin Cloud, MCP, dashboards, pipelinesIt connects the learning path to real warehouse schemas, local CLI workflows, metadata, and agent access.
Install BruinFirst Bruin CLI and MCP setupShows how to install the CLI, VS Code extension, and MCP connection for Claude Code, Cursor, Copilot, and Codex.
Build an AI Data AnalystLocal AI analyst over real dataGood for learning how schema import, context files, MCP, and SQL execution fit together.
Configure AI Agents in Bruin CloudManaged agents with scoped data accessUseful when you need projects, messaging integrations, connection sets, permissions, and system prompts.
Schedule an AI AgentRecurring reports and alertsTeaches the operational version of agents: recurring tasks, threshold alerts, SQL runs, and Slack or chat delivery.
Data Engineering ZoomcampEnd-to-end data engineering foundationsCovers the normal pipeline muscle: infrastructure, orchestration, warehousing, analytics engineering, batch, streaming, and a capstone.
dbt LearnAnalytics engineering and modellingStrong for SQL modelling, tests, documentation, and metric thinking.
Dagster UniversityAsset-based orchestrationGood for learning data assets, dependencies, observability, and production orchestration.
Airbyte AcademyData integrationUseful for ingestion concepts, PyAirbyte, and connector-driven movement.
Model Context Protocol docsConnecting agents to tools and dataMCP is becoming the common interface between agents and external systems. Data teams need to understand the security model, permissions, and runtime behaviour behind the setup command.

The minimum project: ingest one source into DuckDB or Postgres, transform it into a reporting table, add quality checks, document the columns, expose it through MCP, and ask an agent to answer a real business question. Then deliberately break freshness or a column type and see whether the system catches it.

That is the agentic data engineering loop: build, expose, test, observe, repair.

Agentic data analysis: ask, verify, improve context

The first version of AI data analysis many people try is uploading a CSV to ChatGPT and asking for insights.

Honestly, that is fine for learning. It teaches you how models summarize, where they hallucinate, and how easy it is to get a confident answer that is slightly wrong. The serious version needs a governed analyst that can query your warehouse, respect access controls, understand metrics, and explain how it reached an answer.

A good AI data analyst needs:

  • SQL access through a controlled tool
  • schema metadata
  • business definitions
  • examples of correct analysis
  • freshness and quality signals
  • permission boundaries
  • a way to say "I don't know" (and mean it)
  • a review loop where humans improve context

For analysts, the learning path is:

  1. Learn SQL properly with Kaggle Learn or your warehouse's docs.
  2. Learn basic modelling and metric definitions with dbt Learn.
  3. Learn AI-assisted analysis with Codecademy's AI for Data Analysis or similar hands-on material.
  4. Build a real AI analyst with Bruin Academy.
  5. Improve the context over time: column descriptions, known joins, metric definitions, "avoid this table" warnings, and example questions.

The important mental shift: the analyst's work now includes maintaining the context layer. Good questions still matter. So do metric definitions, examples, and all the boring notes that stop the agent from making stuff up.

How Claude Academy and Bruin Academy fit together

People search for "Claude Academy" because they want to learn the AI side. The official Anthropic learning hub is Anthropic Academy, with course listings also available on Claude resources. Use it for Claude, Claude Code, Claude API development, AI fluency, MCP, skills, and model behaviour.

Bruin Academy comes in when the question becomes: "How do I let an agent work with real data without giving it a random production database and hoping for the best?"

Use them together like this:

Learning goalUse Claude Academy / Anthropic resources forUse Bruin Academy for
AI programmingClaude API, Claude Code, prompting, tool use, skills, MCP conceptsConnecting agent tools to data projects and Bruin CLI workflows
Agentic data engineeringGeneral agent behaviour and MCP architecturePipelines, metadata, Bruin MCP, local project context, governed warehouse access
Agentic data analysisHow the assistant reasons, asks follow-ups, and uses toolsSchema import, data context, SQL execution, analyst setup, context improvement
Enterprise adoptionAI fluency, model limits, deployment conceptsConnection sets, permissions, Slack or Teams agents, scheduled tasks, audit-friendly workflows

I would use Anthropic for the model & agent stack, then Bruin for the data stack the agent needs to operate safely. Different jobs. Same larger workflow.

Experience-level paths

Beginner: 0 to 3 months

The first win is becoming useful with the basics: Python, SQL, a small dataset, and enough AI vocabulary to know what the tool is doing.

Do this:

  1. Kaggle Python, Pandas, and Intro to SQL.
  2. Google Machine Learning Crash Course for practical ML basics.
  3. Microsoft AI Agents for Beginners for agent vocabulary and simple examples.
  4. Bruin Academy AI data analyst if you want a data project that feels close to work.

Build this: a small local analytics project. Load data, write SQL, ask an AI assistant to generate a query, verify the result, and write down what context the assistant needed.

Early-career software engineer: 3 to 12 months

The goal here is building AI features that survive the day after the demo.

Do this:

  1. Anthropic Academy for Claude and AI fluency.
  2. OpenAI function calling and structured outputs.
  3. Hugging Face Agents Course.
  4. LangChain Academy or LlamaIndex depending on whether your project is workflow-first or data-and-document-first.

Build this: a support or operations agent with tool calls, structured output, a test set, and a trace you can inspect.

Data engineer: 6 to 24 months

Your goal is to make your data platform agent-readable.

Do this:

  1. Data Engineering Zoomcamp for full-stack fundamentals.
  2. Bruin Academy for Bruin CLI, MCP, AI analyst, Cloud agents, and scheduled workflows.
  3. dbt Learn for modelling and documentation.
  4. Dagster University for orchestration and assets.
  5. Model Context Protocol docs for tool and context integration.

Build this: a pipeline where an agent can inspect lineage, query the final table, detect a failed quality check, and explain which downstream report is affected.

Data analyst or analytics engineer: 3 to 18 months

Your goal is to turn analysis into reusable context.

Do this:

  1. Kaggle SQL and Advanced SQL.
  2. dbt Learn for models, tests, docs, and analytics engineering habits.
  3. Bruin Academy AI data analyst to build a working analyst over real schemas.
  4. Build an AI Data Analyst again, but this time spend more time on context improvement than setup.

Build this: a metric pack for one business domain: definitions, joins, example questions, dashboard links, known caveats, and tests.

Senior engineer, staff data engineer, or platform lead

Your goal is governance, reliability, and team adoption.

Do this:

  1. OpenAI's practical guide to building agents for architecture and guardrails.
  2. Anthropic MCP resources and the MCP docs.
  3. Bruin Cloud AI agents and scheduled agents.
  4. Dagster University, dbt Learn, and your internal governance docs.

Build this: a reference architecture for agents in your data platform: identity, permissions, approval flows, audit logs, eval datasets, incident handling, and a rollback path.

The resource list

AI foundations

AI programming and agents

Agentic data engineering

Agentic data analysis

How to choose without wasting time

My simple filter:

  • No project -> reference material.
  • Toy data -> AI programming exercise.
  • Real data with no tests, lineage, or permissions -> prototype.
  • Data access, metadata, tests, permissions, and a review loop -> proper production learning path.

This is also how I evaluate vendor courses. Official courses can be excellent, but they teach the vendor's mental model. That is completely fine as long as you know what you are learning: concepts, implementation, or product workflow.

For example, Anthropic Academy is strong for Claude and MCP concepts. OpenAI's docs are strong for Agents SDK and API patterns. LangChain Academy is strong for LangGraph. LlamaIndex is strong for document and data context. Bruin Academy is strong for real data workflows where an AI agent needs governed access to pipelines, schemas, and business context.

A 12-week practical plan

Weeks 1-2: foundations

Take Kaggle Python, Kaggle SQL, and the first half of Google Machine Learning Crash Course. If you already know Python and SQL, skim them and move on.

Deliverable: a notebook that loads a dataset, asks three business questions, and answers them with SQL or Pandas.

Weeks 3-4: model APIs and structured outputs

Use Claude API development, OpenAI function calling, and OpenAI structured outputs.

Deliverable: a small app that converts a natural-language request into a validated JSON plan, calls a tool, and returns a checked answer.

Weeks 5-6: agents

Take the Hugging Face Agents Course or DeepLearning.AI Agentic AI. If you want a framework, add LangChain Academy or LlamaIndex agents.

Deliverable: an agent with at least two tools, structured outputs, and one eval dataset.

Weeks 7-8: data engineering

Use Data Engineering Zoomcamp for foundations and Bruin Academy for AI-ready data workflows.

Deliverable: ingest data, transform it, validate it, and document the final table.

Weeks 9-10: AI analyst

Build through Bruin Academy's AI data analyst module. Set up the MCP, connect to a database, import schema, ask questions, and improve context.

Deliverable: an AI analyst that can answer five real questions and show the SQL or reasoning path behind each answer.

Weeks 11-12: governance and deployment

Read OpenAI's practical guide to building agents, the MCP docs, and Bruin Cloud agent setup.

Deliverable: a deployment checklist: credentials, permissions, audit logs, evals, fallback behaviour, owner, and rollback plan.

What to ignore, at least for now

Ignore giant "100 AI tools" lists. They age badly.

Skip prompt packs that skip verification. They help with phrasing and usually stop there.

Skip agent demos where the agent can mutate production state without approvals. Looks cool in a video, creates a proper mess in real data work.

Skip any AI data analysis workflow that never asks where metric definitions live. If the agent has no source for "active user", "gross revenue", "net revenue retention", or "qualified lead" in your company, it is guessing.

The actual north star

Finishing every course is a trap.

The useful goal: build a system where an AI agent can do real work against real context, with enough controls that you can trust it. For data teams, that usually means:

  • open-source tools where possible
  • CLI-first workflows that agents can operate
  • metadata in files or accessible APIs
  • lineage and data quality checks
  • governed access to warehouses
  • clear audit trails
  • a feedback loop where bad answers improve the context layer

This is why the combination of AI programming resources and data infrastructure resources matters. You need both.

AI programming teaches the agent loop. Agentic data engineering gives the loop something safe and useful to operate. Agentic data analysis turns the whole thing into business value.

FAQ

What is the best way to learn AI programming in 2026?

Learn Python, SQL, model APIs, tool calling, structured outputs, evals, and one agent framework. Start with CS50 AI or Google Machine Learning Crash Course, then move to Anthropic Academy, OpenAI Agents SDK, Hugging Face Agents Course, LangChain Academy, or LlamaIndex.

What should data engineers learn for agentic data engineering?

Learn ingestion, transformations, orchestration, quality checks, lineage, metadata, MCP, permissions, and evals. Good resources are Bruin Academy, Data Engineering Zoomcamp, dbt Learn, Dagster University, Airbyte Academy, and the MCP docs.

What should data analysts learn to use AI well?

Learn SQL, metric definitions, data modelling, prompt patterns for analysis, and validation. The practical path is Kaggle Learn, dbt Learn, Codecademy's AI for Data Analysis, and Bruin Academy's AI data analyst module.

Is Claude Academy official?

The official Anthropic learning hub is Anthropic Academy, and Claude course listings are available through Claude resources. Use those as the source of truth for Claude, Claude Code, Claude API development, AI fluency, MCP, and agent skills.

Is Bruin Academy open educational content?

Bruin Academy is openly accessible educational content from Bruin. It focuses on Bruin CLI, Bruin Cloud, Bruin AI, MCP, dashboards, pipelines, and AI data analyst workflows. It is especially useful if you want a hands-on path with real databases and agentic data workflows.

Do I need MCP to build AI agents?

You can ship agents with direct tool calling, custom APIs, or framework-specific integrations. Still, learn Model Context Protocol. It standardizes how AI applications connect to tools, data sources, and workflows. For data teams, it is quickly becoming a practical interface for giving agents controlled access to context.

What project should I build first?

Build an AI analyst over a small database. Import or document the schema, connect an agent through a controlled tool, ask five business questions, inspect the generated SQL, and improve the metadata when it gets something wrong. That project forces you to learn AI programming, data engineering, and data analysis together.