AI directory search

Search across educators, skills, and resources.

Use this when you know the topic you need: Claude Code, MCP, evals, RAG, agents, product, coding, prompting, foundations, or model internals.

3 matches for "RLHF"

Educators

Good route into how models are trained after pretraining.

Skills

RLHF, Post-training, Alignment

Resources

RLHF Book

Book · Nathan Lambert · Advanced

You want to understand post-training and preference optimization.

rlhf, alignment, post-training

RLHF Book

Book · Nathan Lambert · Advanced

Use this when you want Nathan Lambert's material for rlhf and related AI skills.

RLHF, Post-training, Alignment