Why Nostr? What is Njump?
2025-03-10 17:46:18
in reply to

Terence Tao on Nostr: nprofile1q…8g23r Actually, I think the problem of untrustworthy sources on the ...

Actually, I think the problem of untrustworthy sources on the internet (particularly on social media) predates generative AI, though certainly AI bots and "deepfake" images exacerbate the issue. With or without generative AI, it has become increasingly important to know how to independently verify information.

In the specific realm of pure mathematics, though, there is a potential solution to this problem by directing generative AI output to pass through a formal proof assistant to obtain a guarantee of correctness. At present, the experiments in this direction are only capable of resolving low-level undergraduate problems (such as computing a definite integral) by this approach, and it is still not clear whether the high-level conceptual component of a LLM-generated answer to a mathematical question can be captured by such formal languages; but I would imagine that requiring the LLM to formally verify at least some of the finer details of their output would significantly increase their broader reliability. (A similar phenomenon has already been observed in LLM-based solutions to Math Olympiad type challenges, in which models which do not directly attempt to answer the question, but instead create code in a more reliable language such as Python to solve the problem, significantly outperform pure LLM models.)
Author Public Key
npub1hsf727dlfy55vvm5wuqwyh457uwsc24pxn5f7vxnd4lpvv8phw3sjm7r3k