Nathan Lambert
RLHF Book · Advanced
Good route into how models are trained after pretraining.
Skills
RLHF, Post-training, Alignment
AI directory search
Use this when you know the topic you need: Claude Code, MCP, evals, RAG, agents, product, coding, prompting, foundations, or model internals.
5 matches for "Post-training"
RLHF Book · Advanced
Good route into how models are trained after pretraining.
Skills
RLHF, Post-training, Alignment
Hugging Face Learn · Beginner to advanced
One of the best free ecosystems for learning open-source AI by building with models, datasets, spaces, agents, context engineering, and MCP workflows.
Topics
Agents, Context engineering, MCP, Transformers, Post-training, Open models
Book · Nathan Lambert · Advanced
You want to understand post-training and preference optimization.
rlhf, alignment, post-training
Free course · Hugging Face · Intermediate
You want a current structured course on instruction tuning, fine-tuning, and evaluation around compact open models.
fine-tuning, post-training, open models, evaluation, smollm
Book · Nathan Lambert · Advanced
Use this when you want Nathan Lambert's material for rlhf and related AI skills.
RLHF, Post-training, Alignment