Ollama

ollama/ollama last check 2026-06-18 22:01 UTC 191 releases recent

Notes

Release notes

v0.6.6 · 1y+

New models

IBM Granite 3.3: 2B and 8B models with 128K context length that have been fine-tuned for improved reasoning and instruction-following capabilities.
DeepCoder: a fully open-Source 14B coder model at O3-mini level, with a 1.5B version also available.

New, faster model downloading: OLLAMA_EXPERIMENT=client2 ollama serve will run Ollama using a new downloader with improved performance and reliability when running ollama pull. Please share feedback here!
Fixed memory leak issues when running Gemma 3, Mistral Small 3.1 and other models on Ollama
Improved performance of ollama create when importing models from Safetensors
Ollama will now allow tool function parameters with either a single type or an array of types by @rozgo
Fixed certain out of memory issues from not reserving enough memory at startup
Fix nondeterministic model unload order by @IreGaddr
Include the items and $defs fields to properly handle array types in the API by @sheffler
OpenAI-Beta headers are now included in the CORS safelist by @drifkin
Fixed issue where model tensor data would be corrupted when importing models from Safetensors

@drifkin made their first contribution in https://github.com/ollama/ollama/pull/10169
@rozgo made their first contribution in https://github.com/ollama/ollama/pull/9434
@qwerty108109 made their first contribution in https://github.com/ollama/ollama/pull/10168
@IreGaddr made their first contribution in https://github.com/ollama/ollama/pull/10185
@sheffler made their first contribution in https://github.com/ollama/ollama/pull/10091

Full Changelog: https://github.com/ollama/ollama/compare/v0.6.5...v0.6.6