Hotznplotzn@lemmy.sdf.org to

Artificial Intelligence @lemmy.sdf.orgEnglish · 6 days ago

New Open Source AI model beats DeepSeek's performance using just 14% of the data its Chinese competitor needed

1

15

New Open Source AI model beats DeepSeek's performance using just 14% of the data its Chinese competitor needed

Hotznplotzn@lemmy.sdf.org to

Artificial Intelligence @lemmy.sdf.orgEnglish · 6 days ago

1

New Open Source AI Model Rivals DeepSeek's Performance—With Far Less Training Data - Decrypt

OpenThinker-32B achieved benchmark-beating results using just 14% of the data its Chinese competitor needed, marking a win for open source AI

Here is the data at Hugging Face.

A team of international researchers from leading academic institutions and tech companies upended the AI reasoning landscape on Wednesday with a new model that matched—and occasionally surpassed—one of China’s most sophisticated AI systems: DeepSeek.

OpenThinker-32B, developed by the Open Thoughts consortium, achieved a 90.6% accuracy score on the MATH500 benchmark, edging past DeepSeek’s 89.4%.

The model also outperformed DeepSeek on general problem-solving tasks, scoring 61.6 on the GPQA-Diamond benchmark compared to DeepSeek’s 57.6. On the LCBv2 benchmark, it hit a solid 68.9, showing strong performance across diverse testing scenarios.

…

You must log in or register to comment.

Chat

notfromhere@lemmy.ml
link
fedilink
English
arrow-up
3·
6 days ago
This is referring to probably the Qwen 32B R1 Distill which is a fine tune by DeepSeek of Qwen 32B. This is not referring to R1 671B.