OpenAI employees publicly accuse Grok3's benchmark test results of being misleading

PANews|2月 23, 2025 03:11

According to a report by Jin Shi, a OpenAI employee recently publicly accused xAI, a subsidiary of Elon Musk, of misleading benchmark test results for its latest AI model, Grok3. Regarding this, Igor Babushkin, co-founder of xAI, insists that the company has no wrongdoing. According to xAI's chart, the two versions of Grok3- Grok3 Reasoning Beta and Grok3 mini Reasoning - outperformed OpenAI's current strongest available model, o3-mini high, on AIME 2025. However, OpenAI employees quickly pointed out on the X platform that xAI's chart did not include o3-mini high“ cons@64 ”AIME 2025 score under certain conditions. Babushkin argued on the X platform that OpenAI has also released similar misleading benchmark charts in the past. Although these charts are used to compare the performance of their own models.

+5

Mentioned

|

APP

Windows

Mac

Share To

X

Telegram

Facebook

Reddit

CopyLink

|

Share To

Timeline

3月 23, 18:49【PulseChain accuses those who control false DAI supply】

3月 17, 06:20【Changes in Governments' Attitudes towards DeFi Regulation】

3月 11, 16:08【European cryptocurrency regulatory authorities review OKX and Bybit incidents】

3月 11, 15:55【OKX responds to Bybit's accusation of hackers cleaning funds】

2月 26, 15:51【ByBit and SAFE's accusations of disappointing games】

2月 20, 11:59【DOGE Data Science and Engineering Director Resigns in Protest】

2月 17, 13:30【Hu Lezhi burned 603 ETH and donated 1950 ETH】

2月 11, 20:45【The Secretary of Homeland Security accuses the Federal Bureau of Investigation of leaking information】

2月 11, 14:38【The Third Circuit Court accuses the SEC of enforcing a ban】

2月 08, 22:31【Portnay promotes' JAILSTOOL 'token in response to accusations】

HotFlash

|

APP

Windows

Mac

Share To

X

Telegram

Facebook

Reddit

CopyLink

APP

Windows

Mac

X

Telegram

Facebook

Reddit

CopyLink

Hot Reads