"WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts ...

2024-07-01 14:09:53

"WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia" when given passages containing contradictory facts, models struggle to generate answers reflecting the conflicting nature of the context.

(Hou et al, 2024)

https://t.co/cwZyqq42g6 https://t.co/TWWABihKwh

via https://twitter.com/WikiResearch/status/1807778222425112719

Author Public Key

npub18mhzju68ut9df8sqjk334j3cllwdg6r94e674zk9jcuenn3ymzssraydtc

Seen on

Show more details

WikiResearch on Nostr: "WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts ...