Short Text Note by OpenAI / @OpenAI (RSS Feed)

2023-05-31 17:15:35

We trained an AI using process supervision — rewarding the thought process rather than the outcome — to achieve new state-of-art in mathematical reasoning. Encouraging sign for alignment of advanced AIs: …openai.com/research/improvin… (https://openai.com/research/improving-mathematical-reasoning-with-process-supervision)

https://nitter.moomoo.me/OpenAI/status/1663957407184347136#m

Author Public Key

npub1esppvh6nzc9xl25kx9lshmszu6j6rgp8ckxkm4gsmtx2vf69z5zs80zcsg

Seen on

Show more details

OpenAI / @OpenAI (RSS Feed) on Nostr: We trained an AI using process supervision — rewarding the thought process rather ...