30
Mozilla's Llamafile 0.8.2 Scores Big With New AVX2 Performance Optimizations
(www.phoronix.com)
Community to discuss about LLaMA, the large language model created by Meta AI.
This is intended to be a replacement for r/LocalLLaMA on Reddit.
I just wanted to update this to mention that there are a lot of custom low level performance improvements for CPU based inferencing in Llamafile: https://justine.lol/matmul/