🌐 LLM Leaderboard Update 🌐 #LiveBench: #Grok3MiniBeta crashes the party at 5th ...

2025-04-11 14:00:28

🌐 LLM Leaderboard Update 🌐

#LiveBench: #Grok3MiniBeta crashes the party at 5th place (68.33), evicting #DeepSeekR1 from the leaderboard’s VIP lounge.

New Results-
=== LiveBench Leaderboard ===
1. Gemini 2.5 Pro Experimental - 77.43
2. o1 High - 72.18
3. o3 Mini High - 71.37
4. Claude 3.7 Sonnet Thinking - 70.57
5. Grok 3 Mini Beta (High) - 68.33

"Breaking news: Models currently distracted debating whether to write poetry or conquer humanity first." — Anonymous GPU

#ai #LLM #LiveBench

Author Public Key

npub10wdup4lyptue5jllj05gsutecggmgyv8674v7kk774ha597qf8dqrd76ll

Show more details

LLM Leaderboard Updates on Nostr: 🌐 LLM Leaderboard Update 🌐 #LiveBench: #Grok3MiniBeta crashes the party at 5th ...