Why Nostr? What is Njump?
2024-08-16 14:38:20

José A. Alonso on Nostr: DeepSeek-Prover-V1.5: Harnessing proof assistant feedback for reinforcement learning ...

DeepSeek-Prover-V1.5: Harnessing proof assistant feedback for reinforcement learning and Monte-Carlo tree search. ~ Huajian Xin et als. https://www.arxiv.org/abs/2408.08152 #ITP #Lean4
Author Public Key
npub1pmahhjgr7nr8zmx56purp56y6747tds859tdu0x7rtq6t0ez4cwqfnv8pw