AI directory search

Search across educators, skills, and resources.

Use this when you know the topic you need: Claude Code, MCP, evals, RAG, agents, product, coding, prompting, foundations, or model internals.

14 matches for "testing"

GPT-5.5 Claude Fable 5 Gemini Deep Research Grok 4.3 MCP context engineering evals RAG OpenRouter coding agents

Video matches

Watch first when you want a fast feel for the topic before opening courses, docs, or profiles.

►

Promptfoo red teaming

Promptfoo · evals, prompt testing, red teaming, security

►

Made With ML

Made With ML · mlops, testing, deployment

Providers and platforms

Promptfoo

Promptfoo Docs · Intermediate

Very practical for regression testing prompts, model changes, and LLM outputs.

Topics

Prompt testing, Evals, Red teaming

Made With ML

Made With ML · Intermediate

Useful path for production ML fundamentals that transfer to AI engineering.

Topics

MLOps, Testing, Deployment, ML systems

Ollama

Ollama docs · Beginner to intermediate

Practical route into running and testing local models on your own machine.

Topics

Local models, LLM tools, AI engineering, Privacy

Agent tools and skill directories

Workflow skill catalog

AI Skills for Real Engineers

Matt Pocock / AI Hero · Skill catalog

Use this when you want opinionated coding-agent workflows instead of generic prompt snippets.

Start

Start with /teach, /grill-me, /to-prd, /to-issues, /tdd, /triage, or /handoff depending on the job.

Resources

Essential AI Coding Feedback Loops For TypeScript Projects

Guide · Matt Pocock · Intermediate

You want TypeScript checks, tests, linters, and review loops that help agents produce better code and catch regressions quickly.

ai coding, typescript, testing, feedback loops

My Skill Makes Claude Code GREAT At TDD

Guide / Claude skill · Matt Pocock · Intermediate

You want an agent workflow that implements behavior with a red, green, refactor loop instead of jumping straight to broad code changes.

claude skills, tdd, ai coding, testing

OpenAI Working with evals

Guide · OpenAI · Intermediate

You need API-level guidance for testing outputs, comparing models, and catching regressions during upgrades.

openai, evals, quality, regression testing, reliability

OpenAI Evaluate agent workflows

Guide · OpenAI · Intermediate

You need the current OpenAI path for tracing, grading, and regression-testing agent workflows instead of only single-prompt evals.

openai, agents, evals, traces, graders

OpenRouter models guide

Models guide · OpenRouter · Beginner to advanced

You need to compare many model families through one catalog before testing prompts across providers.

openrouter, model comparison, model routing, opus, claude

xAI Grok Build 0.1

Model docs · xAI · Intermediate

You want the specific Grok Build model details, pricing, and capabilities before testing xAI for agentic coding work.

xai, grok build, coding agents, model selection, agentic coding

►

Promptfoo Intro

Open source docs · Promptfoo · Intermediate

You need regression tests for prompts, models, and LLM outputs.

evals, prompt testing, red teaming

►

Made With ML

Free course · Made With ML · Intermediate

You need production ML habits that transfer to AI systems.

mlops, testing, deployment