RLHF (Reinforcement Learning from Human Feedback)

A training technique that uses human evaluators to rank AI outputs, teaching the model to produce responses humans actually prefer. The discipline behind why modern AI engines sound coherent and helpful rather than technically correct and useless.

← Back to Glossary