Why Nostr? What is Njump?
2023-05-31 17:15:35

OpenAI / @OpenAI (RSS Feed) on Nostr: We trained an AI using process supervision — rewarding the thought process rather ...

We trained an AI using process supervision — rewarding the thought process rather than the outcome — to achieve new state-of-art in mathematical reasoning. Encouraging sign for alignment of advanced AIs: …openai.com/research/improvin… (https://openai.com/research/improving-mathematical-reasoning-with-process-supervision)

https://nitter.moomoo.me/OpenAI/status/1663957407184347136#m
Author Public Key
npub1esppvh6nzc9xl25kx9lshmszu6j6rgp8ckxkm4gsmtx2vf69z5zs80zcsg