17

Lesson 17 of 20 · Smart Helpers

Fine-tuning and RLHF

After pre-training, models are fine-tuned with human feedback (RLHF). Humans rank responses, teaching the model to prefer helpful, safe outputs.

RLHF uses human rankings to improve models.
Fine-tuning aligns models with human preferences.

Think about it

What is fine-tuning an LLM?

Done with AI? Try something else

⌨️Typing Practice 🎮Play a Game 🔢Math 🤖AI Lessons 📖Flashcards 🛍️Shop 🎲I'm Bored

Your Cart (0)

Your cart is empty

Browse our shop to find activities your kids will love