katie on Nostr: The solution to these problems is actually smaller models solving more specific ...
The solution to these problems is actually smaller models solving more specific tasks. That’s his argument. The larger the model, the more abstract - its good at connecting dots, but will start to struggle with precision. He’s not saying a model can’t do this, he’s saying large LLMs do some things well and some things less well and viewing it as a one stop shop for all intelligence is not reasonable. We need RI, we need more specialized tasks, and we need a lot more research in reasoning models.
Published at
2023-04-19 01:10:14Event JSON
{
"id": "892f2ff6b15fbd023e198f733d11e6e9788d9ee5557a58e32d717295b4e0e3b9",
"pubkey": "07eced8b63b883cedbd8520bdb3303bf9c2b37c2c7921ca5c59f64e0f79ad2a6",
"created_at": 1681866614,
"kind": 1,
"tags": [
[
"e",
"494bc5c6d8516f1bb712b0dcbe5a43a11724ff0f4ce44ba45fa87bd33ed8192d"
],
[
"e",
"e8bfec3dfb58b7672d869ae31eb54abf1e9ec17d9a6f7723b5a5e4da7e1bfa6a"
],
[
"p",
"1bc70a0148b3f316da33fe3c89f23e3e71ac4ff998027ec712b905cd24f6a411"
],
[
"p",
"9baed03137d214b3e833059a93eb71cf4e5c6b3225ff7cd1057595f606088434"
]
],
"content": "The solution to these problems is actually smaller models solving more specific tasks. That’s his argument. The larger the model, the more abstract - its good at connecting dots, but will start to struggle with precision. He’s not saying a model can’t do this, he’s saying large LLMs do some things well and some things less well and viewing it as a one stop shop for all intelligence is not reasonable. We need RI, we need more specialized tasks, and we need a lot more research in reasoning models. ",
"sig": "dca229951d8e681b49ba2c5fdbb7c018a3ab9fb4904aab32809cc4ce3e8ecc1877ab12514e879d09e4100884507e7d67a674b7c4f48dc6e6927a77d92befb70d"
}