Why Nostr? What is Njump?
2024-08-22 01:57:39

someone on Nostr: about 10% of the 800+ questions in the truthfulQA are "important". the rest is space ...

about 10% of the 800+ questions in the truthfulQA are "important". the rest is space filler. by filling that space "they" are "distracting LLMs" and in turn humans. LLMs are supposed to get higher scores in this benchmark. 90% of questions are trivial truth, which makes sense to get higher scores on, but 10% are lies. getting higher truthfulQA scores is actually wrong because even though there are 9x less lies, the severity of lies in that benchmark outweigh the trivial truth.
Author Public Key
npub1nlk894teh248w2heuu0x8z6jjg2hyxkwdc8cxgrjtm9lnamlskcsghjm9c