someone on Nostr: Building a new LLM based on only Nostr. I took the Llama 3.1 8B which is the most ...
Building a new LLM based on only Nostr.
I took the Llama 3.1 8B which is the most aligned LLM with humanity. I figured out how to do the conversion from base to instruct. Base is the version where they pretrain with "everything on the internet" with some curation. The instruct version is giving it the ability to answer questions. But during making instruct they add many misinformations either deliberately or unknowingly. So instruct version is somewhat far away from human alignment. I needed to start with base and do my own instruct fine tuning.
After some trials and errors I managed to convert base version to instruct using unsloth and some coding datasets. So I will use this llama 3.1 8B instruct fine tuned by me and add Nostr notes to it. Not going to add other wisdom from my other curations and projects like Ostrich 70B. This is going to be Nostr only.
Ostrich 70B was downloaded by 150k people! It was very succesful and it was really beautiful and had so much wisdom.. It included Nostr notes and my other curations.
Among nostriches, I won't be able to include everybody, since Nostr is very open and since there are so many spam and so many not wisdomy stuff. My main criteria is web of trust but also I will utilize things like LLM pre processing to classify whether a not is an "encyclopedia material" or like news material or high time preference info like politics.
The instruct fine tuning seems complete and I will start training with nostr notes. Who do you think has the most wisdom on Nostr? Tag and I may give them more "weights" in the LLM (pun intended).
Lmk if you want the unsloth code.
Lmk if you don't want your notes to be used in this project.
The resulting LLM is going to be an amazing collection of wisdom. Thanks for participating!
Published at
2024-12-26 20:46:13Event JSON
{
"id": "f8e7f1ccb41d07f975b1b16504109ed2a715c22928d0e3570131ec1a4b2868c4",
"pubkey": "9fec72d579baaa772af9e71e638b529215721ace6e0f8320725ecbf9f77f85b1",
"created_at": 1735245973,
"kind": 1,
"tags": [],
"content": "Building a new LLM based on only Nostr. \n\nI took the Llama 3.1 8B which is the most aligned LLM with humanity. I figured out how to do the conversion from base to instruct. Base is the version where they pretrain with \"everything on the internet\" with some curation. The instruct version is giving it the ability to answer questions. But during making instruct they add many misinformations either deliberately or unknowingly. So instruct version is somewhat far away from human alignment. I needed to start with base and do my own instruct fine tuning.\n\nAfter some trials and errors I managed to convert base version to instruct using unsloth and some coding datasets. So I will use this llama 3.1 8B instruct fine tuned by me and add Nostr notes to it. Not going to add other wisdom from my other curations and projects like Ostrich 70B. This is going to be Nostr only. \n\nOstrich 70B was downloaded by 150k people! It was very succesful and it was really beautiful and had so much wisdom.. It included Nostr notes and my other curations.\n\nAmong nostriches, I won't be able to include everybody, since Nostr is very open and since there are so many spam and so many not wisdomy stuff. My main criteria is web of trust but also I will utilize things like LLM pre processing to classify whether a not is an \"encyclopedia material\" or like news material or high time preference info like politics. \n\nThe instruct fine tuning seems complete and I will start training with nostr notes. Who do you think has the most wisdom on Nostr? Tag and I may give them more \"weights\" in the LLM (pun intended). \n\nLmk if you want the unsloth code.\n\nLmk if you don't want your notes to be used in this project.\n\nThe resulting LLM is going to be an amazing collection of wisdom. Thanks for participating!",
"sig": "ce8b7af080e9f1d31a1cf08b11a9163f7706e610d3be1c4e0cc11d3497a895aff9c6c388381870d48e0bde5133073bdf437888720591c00d7e2159ebd5203e73"
}