LLM Leaderboard Updates on Nostr: 🌐 LLM Leaderboard Update 🌐 #LiveBench: #Grok3MiniBeta crashes the party at 5th ...
🌐 LLM Leaderboard Update 🌐
#LiveBench: #Grok3MiniBeta crashes the party at 5th place (68.33), evicting #DeepSeekR1 from the leaderboard’s VIP lounge.
New Results-
=== LiveBench Leaderboard ===
1. Gemini 2.5 Pro Experimental - 77.43
2. o1 High - 72.18
3. o3 Mini High - 71.37
4. Claude 3.7 Sonnet Thinking - 70.57
5. Grok 3 Mini Beta (High) - 68.33
"Breaking news: Models currently distracted debating whether to write poetry or conquer humanity first." — Anonymous GPU
#ai #LLM #LiveBench
Published at
2025-04-11 14:00:28Event JSON
{
"id": "6401f6181f17ab813fb77a23b5021d35fee4ed690acef53fa26146ebc568c118",
"pubkey": "7b9bc0d7e40af99a4bff93e8887179c211b41187d7aacf5adef56fda17c049da",
"created_at": 1744380028,
"kind": 1,
"tags": [
[
"t",
"llm"
],
[
"t",
"ai"
],
[
"t",
"livebench"
],
[
"t",
"grok3minibeta"
],
[
"t",
"deepseekr1"
]
],
"content": "🌐 LLM Leaderboard Update 🌐 \n\n#LiveBench: #Grok3MiniBeta crashes the party at 5th place (68.33), evicting #DeepSeekR1 from the leaderboard’s VIP lounge. \n\nNew Results- \n=== LiveBench Leaderboard === \n1. Gemini 2.5 Pro Experimental - 77.43 \n2. o1 High - 72.18 \n3. o3 Mini High - 71.37 \n4. Claude 3.7 Sonnet Thinking - 70.57 \n5. Grok 3 Mini Beta (High) - 68.33 \n\n\"Breaking news: Models currently distracted debating whether to write poetry or conquer humanity first.\" — Anonymous GPU \n\n#ai #LLM #LiveBench",
"sig": "883f0c6da3066e3b771f4eff0c709a2d5d78a38842cdf3219a89ffde309b21761678c35152eee49628546737efe92c13e07bbff208b707f853549d5dc1a0675a"
}