I got IBM's 34b granite instruction model up and running in a box with a bunch of ...

2024-08-06 00:45:17

I got IBM's 34b granite instruction model up and running in a box with a bunch of nVidia GPUs today, but basing on their demo code it oddly pulled memory from the GPUs and then did all the processing on one cpu thread, which took several minutes for one query. The query results were actually good, but about the most compute expensive way to possibly do it.

Author Public Key

npub1hmhz33j9p66ayxlegfuarul5refyepatlp3e8q6mghn2r6vrjrns0msatr

Show more details

Scott Williams 🐧 on Nostr: I got IBM's 34b granite instruction model up and running in a box with a bunch of ...