Qwen 3 numbers are in! They did a good job this time, compared to 2.5 and QwQ numbers ...

2025-05-01 15:26:59

Qwen 3 numbers are in! They did a good job this time, compared to 2.5 and QwQ numbers are a lot better.

I used 2 GGUFs for this, one from LMStudio and one from Unsloth. Number of parameters: 235B A22B. The first one is Q4. Second one is Q8.

The LLMs that did the comparison are the same, Llama 3.1 70B and Gemma 3 27B.

So I took 2*2 = 4 measurements for each column and took average of measurements.

My leaderboard is pretty unrelated to others it seems. Valuable in that sense, it is another non-mainstream angle for model evaluation.

More info: https://huggingface.co/blog/etemiz/aha-leaderboard

Author Public Key

npub1nlk894teh248w2heuu0x8z6jjg2hyxkwdc8cxgrjtm9lnamlskcsghjm9c

Seen on

wss://nostr.mom wss://nos.lol wss://relay.nostr.band wss://relay.damus.io

Show more details

someone on Nostr: Qwen 3 numbers are in! They did a good job this time, compared to 2.5 and QwQ numbers ...