Prof. Emily M. Bender(she/her) on Nostr: Pro-tip: If someone claims a model is trained on "the entire internet" they don't ...
Pro-tip: If someone claims a model is trained on "the entire internet" they don't actually know what they're talking about and don't understand a) data b) dataset documentation and therefore c) how to reason about the model they're describing.
The internet is not a single thing you can go and download somewhere.
Published at
2023-08-15 00:06:37Event JSON
{
"id": "35cc6c9479508d1b91ce534a38ff33cc8c7df9da5a5baa97ed1374a8daf107c2",
"pubkey": "13ec9fd5058a18cd097d105fd6ef43759e37d5915b1c01ed36acf0ef5a3e6f2a",
"created_at": 1692057997,
"kind": 1,
"tags": [
[
"proxy",
"https://dair-community.social/users/emilymbender/statuses/110890712951225530",
"activitypub"
]
],
"content": "Pro-tip: If someone claims a model is trained on \"the entire internet\" they don't actually know what they're talking about and don't understand a) data b) dataset documentation and therefore c) how to reason about the model they're describing.\n\nThe internet is not a single thing you can go and download somewhere.",
"sig": "5775175f3b80b2ce2ce1f423ff5152d7877d56c707718ce2e21b24d1822c72a889868c219ffda4317b64901a3722b9b3c8af4a7d01a43c94447f9a12d05b8de9"
}