Why Nostr? What is Njump?
2024-10-01 13:15:02

Nick Byrd, Ph.D. on Nostr: Independent test of #OpenAI’s o1-preview model achieved near-perfect performance on ...

Independent test of #OpenAI’s o1-preview model achieved near-perfect performance on a national #math exam (landing in the top .1% of the nation’s students).

o1 also outperformed 4o on the math test, but took about 3 times longer to do so (10 minutes vs. 3 minutes).

Preprint: https://www.researchgate.net/publication/384071542_System_2_thinking_in_OpenAI's_o1-preview_model_Near-perfect_performance_on_a_mathematics_exam/figures

#teaching #assessment #AI #LLM #edu #higherEd
Author Public Key
npub1jufzy5vnxxrts98w8tue257k87lfvfamfqszacxxal9nxdyhrgksk5cy93