Choose this when
You have an AI app or workflow that needs reliability, cost control, latency, security, and observability.
AI learning path
Move from demos to AI systems that can be monitored, debugged, paid for, and improved.
You have an AI app or workflow that needs reliability, cost control, latency, security, and observability.
You can plan a production stack for prompts, models, retrieval, evals, tracing, deployment, and operations.
Move on when you can answer: what changed, what broke, who noticed, and how will we prevent it?
Do
This is the route through the topic. Watch and open the material inside the step where it is used.
Step 1
Capture inputs, outputs, traces, costs, latency, user feedback, and model versions.
Watch here
Full Stack Deep Learning
System-level view of shipping ML and AI products.
Open here
Book · Chip Huyen · Intermediate to advanced
You are moving from demos to production systems.
Open resourceStep 2
Use evals, canaries, fallbacks, and rollback plans for prompt and model changes.
Watch here
MLOps Community
Use this for deployment, operations, and production system concerns.
Open here
Course videos · Full Stack Deep Learning · Intermediate to advanced
You want the whole lifecycle of ML and AI product development.
Open resourceStep 3
Feed production failures back into data, prompts, retrieval, and product decisions.
Watch here
Chip Huyen
Connects evals, systems, and production engineering judgment.
Open here
Open source tool and docs · Arize AI · Intermediate
You need to trace, inspect, and evaluate LLM app behavior.
Open resourceDocs and cookbooks · Langfuse · Intermediate
You need production LLM tracing, scoring, and prompt operations.
Open resourceAdd tracing and a release checklist to one AI workflow, then use a real failure to improve it.
Reference
Step 1
Book · Chip Huyen · Intermediate to advanced
You are moving from demos to production systems.
Step 2
Course videos · Full Stack Deep Learning · Intermediate to advanced
You want the whole lifecycle of ML and AI product development.
Step 3
Open source tool and docs · Arize AI · Intermediate
You need to trace, inspect, and evaluate LLM app behavior.
Step 3
Docs and cookbooks · Langfuse · Intermediate
You need production LLM tracing, scoring, and prompt operations.
Intermediate to advanced
Use the book page and related essays as a production engineering path.
View educator
Intermediate to advanced
Read the applied ML and LLM systems posts.
View educator
Intermediate to advanced
Read the evals guide and build a small test set for your own app.
View educator
Beginner to intermediate
Browse the How I AI interviews and copy the workflows that match your role.
View educator
Intermediate to advanced
Watch AI Engineer talks for production patterns and tool choices.
View educator
Beginner to advanced
Start with ChatGPT Prompt Engineering for Developers, then pick a RAG or agents course.
View educator