Before launching, GPT-4o broke records on chatbot leaderboard under a secret name

May 13, 2024

0 275 Less than a minute

Enlarge (credit: Getty Images)

On Monday, OpenAI employee William Fedus confirmed on X that a mysterious chat-topping AI chatbot known as “gpt-chatbot” that had been undergoing testing on LMSYS’s Chatbot Arena and frustrating experts was, in fact, OpenAI’s newly announced GPT-4o AI model. He also revealed that GPT-4o had topped the Chatbot Arena leaderboard, achieving the highest documented score ever.

“GPT-4o is our new state-of-the-art frontier model. We’ve been testing a version on the LMSys arena as im-also-a-good-gpt2-chatbot,” Fedus tweeted.

Chatbot Arena is a website where visitors converse with two random AI language models side by side without knowing which model is which, then choose which model gives the best response. It’s a perfect example of vibe-based AI benchmarking, as AI researcher Simon Willison calls it.

Read 8 remaining paragraphs | Comments

Before launching, GPT-4o broke records on chatbot leaderboard under a secret name

Leave a Reply Cancel reply

Microsoft’s VASA-1 can deepfake a person with one photo and one audio track

Universities Must Defend Their Independence by Rejecting Trump’s “Compact”

Circumcision, Tylenol, and Autism? RFK Jr. Misses the Cut

Microsoft warns of new “Payroll Pirate” scam stealing employees’ direct deposits

Maria Corina Machado, Venezuelan Champion of Freedom, Wins the Nobel Peace Prize

Friday Feature: Arrows Christian Academy

iPhone keyboard for blind to shut down as maker cites Apple “abuse” of developers

Users fume after My Cloud network breach locks them out of their data

Author discovers AI-generated counterfeit books written in her name on Amazon

Authorities bust SIM-swap ring they say took millions from the rich and famous