Langfuse starting video
Langfuse Docs · Observability, Prompt management, Evals, Tracing
AI education source
Langfuse Docs
Good operational material for tracing, scoring, and improving production LLM apps.
Start with: Instrument a toy app with traces, then add scores and eval datasets.
Start with the educator-specific video, then use the related topic videos to fill in prerequisites and adjacent skills.
Langfuse Docs · Observability, Prompt management, Evals, Tracing
Weights & Biases · evals, llm apps, observability
Arize AI · evals, observability, tracing
Promptfoo · evals, prompt testing, red teaming
Hamel Husain and Shreya Shankar · evals, product, llm reliability
Teams shipping LLM applications should start here when they need observability, prompt management, evals, and tracing. The strongest fit is a learner who wants material in these formats: docs, cookbooks, open source tool.
Instrument a toy app with traces, then add scores and eval datasets. After that, open one related resource below and write down the exact workflow, concept, or implementation pattern you want to apply.
Good operational material for tracing, scoring, and improving production LLM apps. Use this profile when you are comparing educators by topic, level, format, and practical usefulness rather than browsing random AI content.
Compare the skill coverage, the starting recommendation, and the related videos. If you need observability, search the directory for that skill and shortlist three profiles before committing to a course, book, or playlist.
| Resource | Kind | Level | Use when |
|---|---|---|---|
|
LLM Evals
Hamel Husain
|
Guide | Intermediate | Your AI app needs quality checks before users see it. |
|
W&B LLM Evaluation Course
Weights & Biases
|
Free course | Intermediate | You need to debug and measure LLM app quality. |
|
Phoenix by Arize
Arize AI
|
Open source tool and docs | Intermediate | You need to trace, inspect, and evaluate LLM app behavior. |
|
Langfuse Docs
Langfuse
|
Docs and cookbooks | Intermediate | You need production LLM tracing, scoring, and prompt operations. |
|
Promptfoo Intro
Promptfoo
|
Open source docs | Intermediate | You need regression tests for prompts, models, and LLM outputs. |
|
AI Evals for Engineers & PMs
Hamel Husain and Shreya Shankar
|
Cohort course | Intermediate | You are shipping AI features and need a serious evaluation workflow. |
|
Hamel's AI evals guides
Hamel Husain
|
Guides | Intermediate to advanced | Use this when you want Hamel Husain's material for evals and related AI skills. |
|
AI Evals for Engineers and PMs
Shreya Shankar
|
Course | Intermediate | Use this when you want Shreya Shankar's material for evals and related AI skills. |
|
W&B Courses
Weights & Biases
|
Free courses | Intermediate | Use this when you want Weights & Biases's material for llm apps and related AI skills. |
|
Phoenix
Arize AI
|
Docs | Intermediate | Use this when you want Arize AI's material for observability and related AI skills. |