Reinforcement Fine-Tuning—12 Days of OpenAI Day 2
名人访谈·2024-12-09 11:57
Hi, everyone. My name is Mark, and I lead research at OpenAI. Yesterday, we took O1 out of preview, and we launched it in Chat GPT. We're soon going to launch it in the API. If you haven't been following O1, it's our latest series of model improvements that allow the models to think for a while before they come back with a response. Today, we're really excited to preview our latest advancement in our model customization program. It'll let users fine-tune O1 on their own datasets. And again, this isn't stand ...