Ollama

ollama/ollama last check 2026-07-02 20:01 UTC 193 releases recent

Notes

site

Release notes

v0.31.1 · recent

What's Changed

mlx: tighten up gemma4 moe loading code by @pdevine in https://github.com/ollama/ollama/pull/16964
mlx: bump to latest version to include new small batch matmul kernel @jessegross @dhiltgen
llama.cpp: bump to b9840 @dhiltgen
improved gemma4 MTP performance @jessegross

Full Changelog: https://github.com/ollama/ollama/compare/v0.31.0...v0.31.1