neigsendoig on Nostr: That sounds like a Common Crawl model, which is not a good dataset to use AI on. At ...
That sounds like a Common Crawl model, which is not a good dataset to use AI on. At least it's "open-source" (It's ethical source, let's be real here).
Maybe I could interest you in taking a look at non-Common Crawl models like DeepSeek R1 if reasoning modeling is being planned?
Published at
2025-05-01 23:47:57Event JSON
{
"id": "32d2b28f9929ed585d82a1341e6551dad77cdf6e18d8006cf40652b6a4471bda",
"pubkey": "d9a329afabb263a8af216838eb45e1f04cc3000704464947c4b907e1bef580d7",
"created_at": 1746143277,
"kind": 1,
"tags": [
[
"e",
"79e39030dfb709a56794d0b5b90cea1f1face473fd1a4b1105e8e4a0e6dfa674",
"wss://relay.primal.net",
"root"
],
[
"e",
"b6d6eaad95ec162fb4c4694dbeb89101669dc531f3ba3c111d1feac848dd9217",
"wss://relay.primal.net",
"reply"
],
[
"p",
"7dc38be721c89e9fd382d12555f45aa17efea3c89e9130c81c320f5b4f44066d"
],
[
"p",
"d9a329afabb263a8af216838eb45e1f04cc3000704464947c4b907e1bef580d7"
],
[
"p",
"8ea485266b2285463b13bf835907161c22bb3da1e652b443db14f9cee6720a43"
]
],
"content": "That sounds like a Common Crawl model, which is not a good dataset to use AI on. At least it's \"open-source\" (It's ethical source, let's be real here).\n\nMaybe I could interest you in taking a look at non-Common Crawl models like DeepSeek R1 if reasoning modeling is being planned?",
"sig": "6ca034d13a0b90f4783ea4fd2151b797b09150054f57e88deaf393ba76c28d2ff4835bef05744c61cb8a34de7c471a896f4aecf6530df03c462b0e72f44303b5"
}