Ollama

ollama/ollama last check 191 releases recent
Notes
Release notes
v0.6.3 · 1y+
view on github

What's Changed

  • New sliding window attention optimizations for Gemma 3, improving inference speed and memory allocation for long context windows.
  • Improved loading speed of Gemma 3
  • ollama create will now return the name of unsupported architectures
  • Fixed error talloc->buffer_id >= 0 when running a model
  • Fixed (int)sched->hash_set.size >= graph->n_nodes + graph->n_leafs error when running a model
  • ollama create will now automatically select the right template when importing Gemma 3 from safetensors
  • ollama show -v will now correctly render boolean values as true or false

New Contributors

Full Changelog: https://github.com/ollama/ollama/compare/v0.6.2...v0.6.3