People ask "how do I learn AI?" as if it is one skill you can pick up from a single course.
That question gets messy very quickly.
If you want to build AI products, you need AI programming. If you want agents to move data, run pipelines, validate tables, or fix failed jobs, you need agentic data engineering. If you want to ask better business questions and actually trust the answers, you need agentic data analysis.
There is overlap, obviously. Python, SQL, tool calling, evals, metadata, context... you keep seeing the same words. But the day-to-day work changes depending on whether you are writing software, maintaining data pipelines, or trying to answer "why did revenue drop last Tuesday?" without embarrassing yourself in a Slack thread.
This guide is my map of the educational content worth using in 2026, with a bias toward open, official, and hands-on material. It includes Anthropic Academy, Claude courses, Bruin Academy, open-source AI agent courses, data engineering courses, and the official docs you will probably keep open while you build.
If you are a beginner, start with Kaggle Learn for Python, SQL, Pandas, and machine learning basics. Then take CS50's Introduction to Artificial Intelligence with Python or Google's Machine Learning Crash Course.
If you are a software engineer, start with Anthropic Academy, Claude API development, the OpenAI Agents SDK guide, the OpenAI function calling guide, and the Hugging Face Agents Course.
If you are a data engineer, start with Bruin Academy, Build an AI Data Analyst, Install Bruin, Data Engineering Zoomcamp, dbt Learn, and Dagster University.
If you are a data analyst, start with Bruin Academy's AI data analyst module, Kaggle SQL and Pandas courses, dbt Learn, and one prompt-and-analysis course such as Codecademy's Learn How to Use AI for Data Analysis.
If you already work with agents, learn Model Context Protocol, Anthropic's MCP material, LlamaIndex agents, LangChain Academy, and DeepLearning.AI's Agentic AI course.
The common mistake is treating "AI" like a clean ladder: learn prompts, then agents, then production.
Real work is messier.
There are four skill layers:
- AI literacy - where models are useful, where they break, how prompting works, how hallucinations happen, and how to review outputs.
- AI programming - API calls, tool calling, structured outputs, retrieval, evals, agent loops, and deployment.
- Agentic data engineering - giving agents safe access to schemas, pipelines, lineage, quality checks, logs, credentials, and orchestration.
- Agentic data analysis - asking business questions, translating metrics into SQL, validating answers, handling follow-ups, and improving the context layer.
Most courses cover one layer. A few touch two. That is exactly where people get lost, they finish three courses and still stare at a blank project on Monday morning.
| Career path | Best starting resources | What you should build |
|---|
| Student or beginner | Kaggle Learn, CS50 AI, Google ML Crash Course, Microsoft AI Agents for Beginners | A Python notebook that loads data, trains a simple model, asks an LLM to explain results, and checks the explanation against the data |
| Software engineer | Anthropic Academy, Claude API development, OpenAI Agents SDK, Hugging Face Agents Course, LangChain Academy | A tool-using agent with structured outputs, tests, traces, and a human approval step |
| Data engineer | Bruin Academy, Data Engineering Zoomcamp, dbt Learn, Dagster University, Airbyte Academy | A pipeline that ingests data, transforms it, validates it, exposes metadata, and lets an agent query it safely |
| Analytics engineer | dbt Learn, Bruin Academy, Build an AI Data Analyst, LlamaIndex documentation | A semantic layer or context layer that defines metrics, entities, joins, examples, and freshness checks |
| Data analyst | Kaggle Learn, Codecademy AI for Data Analysis, Bruin Academy AI data analyst, dbt Learn | A repeatable analysis workflow where the AI writes SQL, explains assumptions, and you verify the result |
| Team lead or manager | Anthropic AI Fluency resources, OpenAI practical guide to building agents, Bruin Cloud AI agents, scheduled agents | A governance checklist: what agents can access, what they can change, who approves, and how answers are audited |
AI programming in 2026 is mostly regular software engineering with a few new primitives bolted onto it.
You need to know how to call a model, pass tools, validate outputs, evaluate behaviour, and decide when an agent should stop. The model is important, obviously, but the loop around the model is usually where production systems succeed or fail.
Start here:
| Resource | Best for | Notes |
|---|
| Anthropic Academy and Claude courses | Claude, Claude Code, AI fluency, MCP, API development | Good first stop if your day-to-day tool is Claude or Claude Code. The course catalogue includes Claude Platform, Claude Code, AI Fluency, MCP, and API material. |
| Claude API Development Guide | Building Claude-powered applications | Useful when you want docs, courses, prompting, evaluations, RAG, tool use, and deployment guidance in one official place. |
| OpenAI Agents SDK guide | Building agent orchestration in code | Best when you want owned tool execution, approvals, state, handoffs, and guardrails. |
| OpenAI function calling guide | Tool calling and external actions | Learn this before building any agent that touches APIs, databases, files, or workflows. |
| OpenAI structured outputs guide | Reliable machine-readable responses | Essential for production workflows where a JSON shape matters more than a clever paragraph. |
| Hugging Face Agents Course | Free agent fundamentals | Good open course for understanding, using, building, and deploying agents. |
| DeepLearning.AI Agentic AI | Agent design patterns | Covers reflection, tool use, planning, and multi-agent patterns. |
| LangChain Academy | LangGraph and LangSmith workflows | Best if you plan to build with LangGraph or operate agents with LangSmith. |
| LlamaIndex agents documentation | Agents over data and documents | Strong when your core problem is connecting LLMs to documents, indexes, and structured context. |
The minimum project: build an agent that calls three tools: a search function, a database query function, and a report writer. Add structured output. Add one eval. Add a failure case where the agent asks for clarification instead of guessing.
That one project teaches more than 20 prompt engineering videos.
Classic data engineering already has enough moving parts: ingestion, transformations, orchestration, tests, lineage, dashboards, alerts, credentials...
Agentic data engineering adds a pretty annoying requirement on top: make the whole system legible and operable by an AI agent.
That means schemas need descriptions. Pipelines need lineage. Jobs need logs. Quality checks need machine-readable status. Credentials need strict scopes. The agent needs to know what it is allowed to query, what it is allowed to run, and when it should stop.
This is why I treat "agent readiness" as infrastructure work. Prompting matters, sure, but a beautiful prompt over messy metadata still gives you the classic garbage-in & garbage-out problem.
Start here:
| Resource | Best for | Why it matters for agents |
|---|
| Bruin Academy | AI data analyst, Bruin CLI, Bruin Cloud, MCP, dashboards, pipelines | It connects the learning path to real warehouse schemas, local CLI workflows, metadata, and agent access. |
| Install Bruin | First Bruin CLI and MCP setup | Shows how to install the CLI, VS Code extension, and MCP connection for Claude Code, Cursor, Copilot, and Codex. |
| Build an AI Data Analyst | Local AI analyst over real data | Good for learning how schema import, context files, MCP, and SQL execution fit together. |
| Configure AI Agents in Bruin Cloud | Managed agents with scoped data access | Useful when you need projects, messaging integrations, connection sets, permissions, and system prompts. |
| Schedule an AI Agent | Recurring reports and alerts | Teaches the operational version of agents: recurring tasks, threshold alerts, SQL runs, and Slack or chat delivery. |
| Data Engineering Zoomcamp | End-to-end data engineering foundations | Covers the normal pipeline muscle: infrastructure, orchestration, warehousing, analytics engineering, batch, streaming, and a capstone. |
| dbt Learn | Analytics engineering and modelling | Strong for SQL modelling, tests, documentation, and metric thinking. |
| Dagster University | Asset-based orchestration | Good for learning data assets, dependencies, observability, and production orchestration. |
| Airbyte Academy | Data integration | Useful for ingestion concepts, PyAirbyte, and connector-driven movement. |
| Model Context Protocol docs | Connecting agents to tools and data | MCP is becoming the common interface between agents and external systems. Data teams need to understand the security model, permissions, and runtime behaviour behind the setup command. |
The minimum project: ingest one source into DuckDB or Postgres, transform it into a reporting table, add quality checks, document the columns, expose it through MCP, and ask an agent to answer a real business question. Then deliberately break freshness or a column type and see whether the system catches it.
That is the agentic data engineering loop: build, expose, test, observe, repair.
The first version of AI data analysis many people try is uploading a CSV to ChatGPT and asking for insights.
Honestly, that is fine for learning. It teaches you how models summarize, where they hallucinate, and how easy it is to get a confident answer that is slightly wrong. The serious version needs a governed analyst that can query your warehouse, respect access controls, understand metrics, and explain how it reached an answer.
A good AI data analyst needs:
- SQL access through a controlled tool
- schema metadata
- business definitions
- examples of correct analysis
- freshness and quality signals
- permission boundaries
- a way to say "I don't know" (and mean it)
- a review loop where humans improve context
For analysts, the learning path is:
- Learn SQL properly with Kaggle Learn or your warehouse's docs.
- Learn basic modelling and metric definitions with dbt Learn.
- Learn AI-assisted analysis with Codecademy's AI for Data Analysis or similar hands-on material.
- Build a real AI analyst with Bruin Academy.
- Improve the context over time: column descriptions, known joins, metric definitions, "avoid this table" warnings, and example questions.
The important mental shift: the analyst's work now includes maintaining the context layer. Good questions still matter. So do metric definitions, examples, and all the boring notes that stop the agent from making stuff up.
People search for "Claude Academy" because they want to learn the AI side. The official Anthropic learning hub is Anthropic Academy, with course listings also available on Claude resources. Use it for Claude, Claude Code, Claude API development, AI fluency, MCP, skills, and model behaviour.
Bruin Academy comes in when the question becomes: "How do I let an agent work with real data without giving it a random production database and hoping for the best?"
Use them together like this:
| Learning goal | Use Claude Academy / Anthropic resources for | Use Bruin Academy for |
|---|
| AI programming | Claude API, Claude Code, prompting, tool use, skills, MCP concepts | Connecting agent tools to data projects and Bruin CLI workflows |
| Agentic data engineering | General agent behaviour and MCP architecture | Pipelines, metadata, Bruin MCP, local project context, governed warehouse access |
| Agentic data analysis | How the assistant reasons, asks follow-ups, and uses tools | Schema import, data context, SQL execution, analyst setup, context improvement |
| Enterprise adoption | AI fluency, model limits, deployment concepts | Connection sets, permissions, Slack or Teams agents, scheduled tasks, audit-friendly workflows |
I would use Anthropic for the model & agent stack, then Bruin for the data stack the agent needs to operate safely. Different jobs. Same larger workflow.
The first win is becoming useful with the basics: Python, SQL, a small dataset, and enough AI vocabulary to know what the tool is doing.
Do this:
- Kaggle Python, Pandas, and Intro to SQL.
- Google Machine Learning Crash Course for practical ML basics.
- Microsoft AI Agents for Beginners for agent vocabulary and simple examples.
- Bruin Academy AI data analyst if you want a data project that feels close to work.
Build this: a small local analytics project. Load data, write SQL, ask an AI assistant to generate a query, verify the result, and write down what context the assistant needed.
The goal here is building AI features that survive the day after the demo.
Do this:
- Anthropic Academy for Claude and AI fluency.
- OpenAI function calling and structured outputs.
- Hugging Face Agents Course.
- LangChain Academy or LlamaIndex depending on whether your project is workflow-first or data-and-document-first.
Build this: a support or operations agent with tool calls, structured output, a test set, and a trace you can inspect.
Your goal is to make your data platform agent-readable.
Do this:
- Data Engineering Zoomcamp for full-stack fundamentals.
- Bruin Academy for Bruin CLI, MCP, AI analyst, Cloud agents, and scheduled workflows.
- dbt Learn for modelling and documentation.
- Dagster University for orchestration and assets.
- Model Context Protocol docs for tool and context integration.
Build this: a pipeline where an agent can inspect lineage, query the final table, detect a failed quality check, and explain which downstream report is affected.
Your goal is to turn analysis into reusable context.
Do this:
- Kaggle SQL and Advanced SQL.
- dbt Learn for models, tests, docs, and analytics engineering habits.
- Bruin Academy AI data analyst to build a working analyst over real schemas.
- Build an AI Data Analyst again, but this time spend more time on context improvement than setup.
Build this: a metric pack for one business domain: definitions, joins, example questions, dashboard links, known caveats, and tests.
Your goal is governance, reliability, and team adoption.
Do this:
- OpenAI's practical guide to building agents for architecture and guardrails.
- Anthropic MCP resources and the MCP docs.
- Bruin Cloud AI agents and scheduled agents.
- Dagster University, dbt Learn, and your internal governance docs.
Build this: a reference architecture for agents in your data platform: identity, permissions, approval flows, audit logs, eval datasets, incident handling, and a rollback path.
- CS50's Introduction to Artificial Intelligence with Python - best for students and engineers who want search, classification, optimization, machine learning, neural networks, and language fundamentals.
- Google Machine Learning Crash Course - practical ML basics with videos, visualizations, and exercises.
- fast.ai Practical Deep Learning for Coders - practical deep learning for people who can code and want to build.
- Stanford CS224N - deeper NLP and LLM foundations, more academic and heavier than the others.
- Kaggle Learn - fast micro-courses for Python, Pandas, SQL, data visualization, and ML.
- Anthropic Academy - official learning hub for Claude, AI fluency, API development, MCP, Claude Code, and agent skills.
- Claude courses - structured courses on Claude Platform, Claude Code, AI Fluency, MCP, and API topics.
- Claude API Development Guide - official guide for building with Claude.
- Hugging Face Agents Course - free agent course from beginner to advanced.
- LangChain Academy - LangGraph, LangSmith, deployment, and agent engineering.
- LangChain Academy GitHub - notebooks and modules for hands-on LangGraph learning.
- LlamaIndex documentation - agents, workflows, document agents, and context augmentation.
- OpenAI Agents SDK guide - official guide for building agents in code.
- OpenAI function calling - official guide for connecting models to tools and external systems.
- OpenAI structured outputs - reliable JSON and schema-constrained outputs.
- DeepLearning.AI Agentic AI - agentic design patterns and implementation.
- DeepLearning.AI AI Agents in LangGraph - build an agent from scratch and with LangGraph.
- DeepLearning.AI Agent Skills with Anthropic - skills, MCP, subagents, and Claude workflows.
- Microsoft AI Agents for Beginners - concept-to-code agent fundamentals.
- Bruin Academy - Bruin CLI, Bruin Cloud, Bruin AI, dashboards, MCP, and end-to-end data projects.
- Build an AI Data Analyst - local AI analyst over a real database schema.
- Install Bruin - Bruin CLI, VS Code extension, and MCP setup.
- Configure AI Agents - Bruin Cloud agents, connection sets, messaging integrations, and permissions.
- Schedule an AI Agent - recurring AI agent runs, reports, alerts, and SQL tasks.
- Bruin MCP with Claude Code - connect Claude Code to Bruin CLI via MCP.
- Data Engineering Zoomcamp - free end-to-end data engineering course.
- dbt Learn - official dbt courses for analysts, engineers, and admins.
- Dagster University - asset-based orchestration and production data platform concepts.
- Airbyte Academy - data integration and PyAirbyte fundamentals.
- Model Context Protocol docs - open standard for connecting AI apps to external tools and data.
My simple filter:
- No project -> reference material.
- Toy data -> AI programming exercise.
- Real data with no tests, lineage, or permissions -> prototype.
- Data access, metadata, tests, permissions, and a review loop -> proper production learning path.
This is also how I evaluate vendor courses. Official courses can be excellent, but they teach the vendor's mental model. That is completely fine as long as you know what you are learning: concepts, implementation, or product workflow.
For example, Anthropic Academy is strong for Claude and MCP concepts. OpenAI's docs are strong for Agents SDK and API patterns. LangChain Academy is strong for LangGraph. LlamaIndex is strong for document and data context. Bruin Academy is strong for real data workflows where an AI agent needs governed access to pipelines, schemas, and business context.
Take Kaggle Python, Kaggle SQL, and the first half of Google Machine Learning Crash Course. If you already know Python and SQL, skim them and move on.
Deliverable: a notebook that loads a dataset, asks three business questions, and answers them with SQL or Pandas.
Use Claude API development, OpenAI function calling, and OpenAI structured outputs.
Deliverable: a small app that converts a natural-language request into a validated JSON plan, calls a tool, and returns a checked answer.
Take the Hugging Face Agents Course or DeepLearning.AI Agentic AI. If you want a framework, add LangChain Academy or LlamaIndex agents.
Deliverable: an agent with at least two tools, structured outputs, and one eval dataset.
Use Data Engineering Zoomcamp for foundations and Bruin Academy for AI-ready data workflows.
Deliverable: ingest data, transform it, validate it, and document the final table.
Build through Bruin Academy's AI data analyst module. Set up the MCP, connect to a database, import schema, ask questions, and improve context.
Deliverable: an AI analyst that can answer five real questions and show the SQL or reasoning path behind each answer.
Read OpenAI's practical guide to building agents, the MCP docs, and Bruin Cloud agent setup.
Deliverable: a deployment checklist: credentials, permissions, audit logs, evals, fallback behaviour, owner, and rollback plan.
Ignore giant "100 AI tools" lists. They age badly.
Skip prompt packs that skip verification. They help with phrasing and usually stop there.
Skip agent demos where the agent can mutate production state without approvals. Looks cool in a video, creates a proper mess in real data work.
Skip any AI data analysis workflow that never asks where metric definitions live. If the agent has no source for "active user", "gross revenue", "net revenue retention", or "qualified lead" in your company, it is guessing.
Finishing every course is a trap.
The useful goal: build a system where an AI agent can do real work against real context, with enough controls that you can trust it. For data teams, that usually means:
- open-source tools where possible
- CLI-first workflows that agents can operate
- metadata in files or accessible APIs
- lineage and data quality checks
- governed access to warehouses
- clear audit trails
- a feedback loop where bad answers improve the context layer
This is why the combination of AI programming resources and data infrastructure resources matters. You need both.
AI programming teaches the agent loop. Agentic data engineering gives the loop something safe and useful to operate. Agentic data analysis turns the whole thing into business value.
Learn Python, SQL, model APIs, tool calling, structured outputs, evals, and one agent framework. Start with CS50 AI or Google Machine Learning Crash Course, then move to Anthropic Academy, OpenAI Agents SDK, Hugging Face Agents Course, LangChain Academy, or LlamaIndex.
Learn ingestion, transformations, orchestration, quality checks, lineage, metadata, MCP, permissions, and evals. Good resources are Bruin Academy, Data Engineering Zoomcamp, dbt Learn, Dagster University, Airbyte Academy, and the MCP docs.
Learn SQL, metric definitions, data modelling, prompt patterns for analysis, and validation. The practical path is Kaggle Learn, dbt Learn, Codecademy's AI for Data Analysis, and Bruin Academy's AI data analyst module.
The official Anthropic learning hub is Anthropic Academy, and Claude course listings are available through Claude resources. Use those as the source of truth for Claude, Claude Code, Claude API development, AI fluency, MCP, and agent skills.
Bruin Academy is openly accessible educational content from Bruin. It focuses on Bruin CLI, Bruin Cloud, Bruin AI, MCP, dashboards, pipelines, and AI data analyst workflows. It is especially useful if you want a hands-on path with real databases and agentic data workflows.
You can ship agents with direct tool calling, custom APIs, or framework-specific integrations. Still, learn Model Context Protocol. It standardizes how AI applications connect to tools, data sources, and workflows. For data teams, it is quickly becoming a practical interface for giving agents controlled access to context.
Build an AI analyst over a small database. Import or document the schema, connect an agent through a controlled tool, ask five business questions, inspect the generated SQL, and improve the metadata when it gets something wrong. That project forces you to learn AI programming, data engineering, and data analysis together.