Why Nostr? What is Njump?
2024-07-06 20:20:43
in reply to

Brandon Rohrer on Nostr: it’s some function that takes in a history of states, actions, and rewards and uses ...

it’s some function that takes in a history of states, actions, and rewards and uses it to choose the next action. It can do this by building an explicit model of its environment and simulating several steps into the future, so it is often called a planner.
Author Public Key
npub1jh4qsxnz0nhyfefjsfvcdmxxvgfe6p5vf0dvh6pq4r6ytwwxcp4sl9eag0