Extracting target signal
Original Claud
1
Mentions
6.0K
Views
“Grouped with original ChatGPT as models trained using base model + supervised finetuning + reward model + RL/optimization.”