Workflow
GPT3
icon
Search documents
What's with these OpenAI charts?
The Vergeยท 2025-08-08 14:07
There's a chart about hallucinations. GTP5 scores a 50. They don't explain what this score means except deception rate.And 03 scored a 47.4%, but the bar for 03 is like more than double the height. It basically makes it look like GTP 5 is materially better at not hallucinating on codegen when in reality it's worse. And this is literally a bar chart called deception across models.These are bewildering. There's no consistency. The highest number is not in the highest spot.I don't understand what's happening. ...