Ollama
Notes
Release notes
v0.6.3
· 1y+
What's Changed
- New sliding window attention optimizations for Gemma 3, improving inference speed and memory allocation for long context windows.
- Improved loading speed of Gemma 3
ollama createwill now return the name of unsupported architectures- Fixed error
talloc->buffer_id >= 0when running a model - Fixed
(int)sched->hash_set.size >= graph->n_nodes + graph->n_leafserror when running a model ollama createwill now automatically select the right template when importing Gemma 3 from safetensorsollama show -vwill now correctly render boolean values astrueorfalse
New Contributors
- @rylativity made their first contribution in https://github.com/ollama/ollama/pull/9874
Full Changelog: https://github.com/ollama/ollama/compare/v0.6.2...v0.6.3