npub12lx9hrk3glyf6w4rceguw29qjxlqfvelznkrrlm67c32a7s5u8yqf5mpam (npub12lx…mpam) Wouldn't using int8 if you can increase performance?
The thing about the 8 GPUs is just bad programming. I understand how it happened. They just built software for their hardware platform. However if they were to ever change hardware they'd have to rewrite large chunks of their code. Better to just do it correctly from the start.