Ollama

ollama/ollama last check 2026-06-18 21:01 UTC 191 releases recent

Notes

site

Release notes

v0.7.1 · 1y+

What's Changed

Improved model memory management to allocate sufficient memory to prevent crashes when running multimodal models in certain situations
Enhanced memory estimation for models to prevent unintended memory offloading
ollama show will now show ... when data is truncated
Fixed crash that would occur with qwen2.5vl
Fixed crash on Nvidia's CUDA for llama3.2-vision
Support for Alibaba's Qwen 3 and Qwen 2 architectures in Ollama's new multimodal engine

New Contributors

@ronxldwilson made their first contribution in https://github.com/ollama/ollama/pull/10763
@DarkCaster made their first contribution in https://github.com/ollama/ollama/pull/10779

Full Changelog: https://github.com/ollama/ollama/compare/v0.7.0...v0.7.1