AI education source

Langfuse

Langfuse Docs

Good operational material for tracing, scoring, and improving production LLM apps.

Start with: Instrument a toy app with traces, then add scores and eval datasets.

Videos

Start with the educator-specific video, then use the related topic videos to fill in prerequisites and adjacent skills.

Langfuse starting video

Langfuse Docs · Observability, Prompt management, Evals, Tracing

W&B LLM Evaluation Course

Weights & Biases · evals, llm apps, observability

Phoenix by Arize

Arize AI · evals, observability, tracing

Promptfoo Intro

Promptfoo · evals, prompt testing, red teaming

AI Evals for Engineers & PMs

Hamel Husain and Shreya Shankar · evals, product, llm reliability

Skills

Learner questions

Who should learn from Langfuse?

Teams shipping LLM applications should start here when they need observability, prompt management, evals, and tracing. The strongest fit is a learner who wants material in these formats: docs, cookbooks, open source tool.

What should I do first?

Instrument a toy app with traces, then add scores and eval datasets. After that, open one related resource below and write down the exact workflow, concept, or implementation pattern you want to apply.

What problem does this help with?

Good operational material for tracing, scoring, and improving production LLM apps. Use this profile when you are comparing educators by topic, level, format, and practical usefulness rather than browsing random AI content.

How do I compare this with other educators?

Compare the skill coverage, the starting recommendation, and the related videos. If you need observability, search the directory for that skill and shortlist three profiles before committing to a course, book, or playlist.

Related resources

Resource	Kind	Level	Use when
LLM Evals Hamel Husain	Guide	Intermediate	Your AI app needs quality checks before users see it.
W&B LLM Evaluation Course Weights & Biases	Free course	Intermediate	You need to debug and measure LLM app quality.
Phoenix by Arize Arize AI	Open source tool and docs	Intermediate	You need to trace, inspect, and evaluate LLM app behavior.
Langfuse Docs Langfuse	Docs and cookbooks	Intermediate	You need production LLM tracing, scoring, and prompt operations.
Promptfoo Intro Promptfoo	Open source docs	Intermediate	You need regression tests for prompts, models, and LLM outputs.
AI Evals for Engineers & PMs Hamel Husain and Shreya Shankar	Cohort course	Intermediate	You are shipping AI features and need a serious evaluation workflow.
Hamel's AI evals guides Hamel Husain	Guides	Intermediate to advanced	Use this when you want Hamel Husain's material for evals and related AI skills.
AI Evals for Engineers and PMs Shreya Shankar	Course	Intermediate	Use this when you want Shreya Shankar's material for evals and related AI skills.
W&B Courses Weights & Biases	Free courses	Intermediate	Use this when you want Weights & Biases's material for llm apps and related AI skills.
Phoenix Arize AI	Docs	Intermediate	Use this when you want Arize AI's material for observability and related AI skills.