Why Nostr? What is Njump?
2025-03-11 17:38:20

Terence Tao on Nostr: An interesting experiment on #MathOverflow, where a user gave 15 different MO ...

An interesting experiment on #MathOverflow, where a user gave 15 different MO problems for o-1 to answer, with the aim of verifying and then rewriting the answer into a presentable form if the AI generated answer was correct. The outcome was: one question answered correctly, verified, and rewritten; one question given a useful lead, which led the experimenter to find a more direct answer; one possibly correct answer that the experimenter was not able to verify; and the remainder described as "a ton of time consuming chaos", in which the experimenter spent much time trying to verify a hallucinated response before giving up. https://meta.mathoverflow.net/questions/6114/capabilities-and-limits-of-ai-on-mathoverflow

I found the discussion for possible AI disclosure policies for MO in the post to also be interesting.
Author Public Key
npub1hsf727dlfy55vvm5wuqwyh457uwsc24pxn5f7vxnd4lpvv8phw3sjm7r3k