quoting nevent1q…g4mkMade a bot to save myself having to compulsively check all the LLM benchmarks I care about every day. Gonna add ARC-AGI when I get a chance.
Impressed by the new Gemini 2.5 Flash today, for such a small model!
nevent1q…lf6k
#devstr #vibecoding you might like, includes aider polyglot and SWE-Bench Verified
Joe Resident on Nostr: Made this bot ...
Made this bot