Ollama

ollama/ollama last check 2026-06-18 22:01 UTC 191 releases recent

Notes

Release notes

v0.6.3 · 1y+

What's Changed

New sliding window attention optimizations for Gemma 3, improving inference speed and memory allocation for long context windows.
Improved loading speed of Gemma 3
ollama create will now return the name of unsupported architectures
Fixed error talloc->buffer_id >= 0 when running a model
Fixed (int)sched->hash_set.size >= graph->n_nodes + graph->n_leafs error when running a model
ollama create will now automatically select the right template when importing Gemma 3 from safetensors
ollama show -v will now correctly render boolean values as true or false

@rylativity made their first contribution in https://github.com/ollama/ollama/pull/9874

Full Changelog: https://github.com/ollama/ollama/compare/v0.6.2...v0.6.3