Ollama
Notes
Release notes
v0.31.1
· recent
What's Changed
- mlx: tighten up gemma4 moe loading code by @pdevine in https://github.com/ollama/ollama/pull/16964
- mlx: bump to latest version to include new small batch matmul kernel @jessegross @dhiltgen
- llama.cpp: bump to b9840 @dhiltgen
- improved gemma4 MTP performance @jessegross
Full Changelog: https://github.com/ollama/ollama/compare/v0.31.0...v0.31.1