Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mi
Python6722apache-2.0
2 days ago
gpullmpytorch
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language mo
Go98471mit
gemmagemma2go