Ollama

ollama/ollama last check 191 releases recent
Notes
Release notes
v0.12.8 · 6m+
view on github

<img width="512" height="512" alt="Ollama_halloween_background" src="https://github.com/user-attachments/assets/ac1f37c5-c81a-446f-8e99-97ef5ebd7d05" />

What's Changed

  • qwen3-vl performance improvements, including flash attention support by default
  • qwen3-vl will now output less leading whitespace in the response when thinking
  • Fixed issue where deepseek-v3.1 thinking could not be disabled in Ollama's new app
  • Fixed issue where qwen3-vl would fail to interpret images with transparent backgrounds
  • Ollama will now stop running a model before removing it via ollama rm
  • Fixed issue where prompt processing would be slower on Ollama's engine
  • Ignore unsupported iGPUs when doing device discovery on Windows

New Contributors

Full Changelog: https://github.com/ollama/ollama/compare/v0.12.7...v0.12.8