Everything PR News

Fine-Tuning

The process of further training a pretrained foundation model on additional, task-specific or domain-specific data to adapt the model's behavior. Variants include supervised fine-tuning (training on input-output examples), instruction tuning (training to follow instructions), and reinforcement learning from human feedback (RLHF). The discipline behind how a general-purpose model becomes a specialist.