Arize AI profile photo

AI education source

Arize AI

Phoenix

Useful for debugging and evaluating LLM applications once you move beyond prototypes.

Start with: Try Phoenix tracing on a small RAG or agent app.

Videos

Educator videos are listed first. Similar videos are labelled and included when they cover the same skills or adjacent topics.

Phoenix by Arize

Educator video

Arize AI · evals, observability, tracing

LLM evaluation with W&B

Similar video

Weights & Biases · evals, llm apps, observability, mlops

Promptfoo red teaming

Similar video

Promptfoo · evals, prompt testing, red teaming, security

Skills

Learner questions

Who should learn from Arize AI?

AI engineers and ML teams should start here when they need observability, evals, tracing, and rag debugging. The strongest fit is a learner who wants material in these formats: docs, open source tool, examples.

What should I do first?

Try Phoenix tracing on a small RAG or agent app. After that, open one related resource below and write down the exact workflow, concept, or implementation pattern you want to apply.

What problem does this help with?

Useful for debugging and evaluating LLM applications once you move beyond prototypes. Use this profile when you are comparing educators by topic, level, format, and practical usefulness rather than browsing random AI content.

How do I compare this with other educators?

Compare the skill coverage, the starting recommendation, and the related videos. If you need observability, search the directory for that skill and shortlist three profiles before committing to a course, book, or playlist.

Related resources

Resource Kind Level Use when
LLM Evals
Hamel Husain
Guide Intermediate Your AI app needs quality checks before users see it.
W&B LLM Evaluation Course
Weights & Biases
Free course Intermediate You need to debug and measure LLM app quality.
Phoenix by Arize
Arize AI
Open source tool and docs Intermediate You need to trace, inspect, and evaluate LLM app behavior.
Langfuse Docs
Langfuse
Docs and cookbooks Intermediate You need production LLM tracing, scoring, and prompt operations.
Promptfoo Intro
Promptfoo
Open source docs Intermediate You need regression tests for prompts, models, and LLM outputs.
AI Evals for Engineers & PMs
Hamel Husain and Shreya Shankar
Cohort course Intermediate You are shipping AI features and need a serious evaluation workflow.
Hamel's AI evals guides
Hamel Husain
Guides Intermediate to advanced Use this when you want Hamel Husain's material for evals and related AI skills.
AI Evals for Engineers and PMs
Shreya Shankar
Course Intermediate Use this when you want Shreya Shankar's material for evals and related AI skills.
W&B Courses
Weights & Biases
Free courses Intermediate Use this when you want Weights & Biases's material for llm apps and related AI skills.
Phoenix
Arize AI
Docs Intermediate Use this when you want Arize AI's material for observability and related AI skills.