Do LLMs load their entire model in VRAM?

Why Nostr? What is Njump?

npub13v…2hy07

2024-12-16 00:04:06

in reply to nevent1q…wsgh

Do LLMs load their entire model in VRAM?

Author Public Key

npub13v68j9479je6h958e84h32xs2gjvq3duuk6pd0w4q95evh4s73wqk2hy07

Show more details