Why Nostr? What is Njump?
2024-06-25 15:16:32
in reply to

Brandon Rohrer on Nostr: This is the open secret of reinforcement learning. Sure, there are methods that can ...

This is the open secret of reinforcement learning. Sure, there are methods that can optimize against arbitrary reward functions, but the process of choosing a reward function to get the behavior you want is the darkest of arts.
Author Public Key
npub1jh4qsxnz0nhyfefjsfvcdmxxvgfe6p5vf0dvh6pq4r6ytwwxcp4sl9eag0