Right, with respect to how many RLHF pairs, you can compare it to prior result of ...

2025-05-12 21:43:38

Right, with respect to how many RLHF pairs, you can compare it to prior result of ~0%. But what I don't understand, and especially with the hype claims the paper makes about "autonomous super-human reasoning", is why can't they just keep running it and get much higher than 50%? Seems like there's another aspect that is preventing getting higher scores, and makes me wonder if these architectures are really just plateauing.

Don't get me wrong, it's some good work; it's just the language of the paper has some ridiculous hype.

Author Public Key

npub1mgvwnpsqgrem7jfcwm7pdvdfz2h95mm04r23t8pau2uzxwsdnpgs0gpdjc

Show more details

Dustin on Nostr: Right, with respect to how many RLHF pairs, you can compare it to prior result of ...