Ollama
Notes
Release notes
v0.17.5
What's Changed
- Fixed crash in Qwen 3.5 models when split over GPU & CPU
- Fixed issue where Qwen 3.5 models would repeat themselves due to no presence penalty (note: you may have to redownload the
qwen3.5models:ollama pull qwen3.5:35bfor example) ollama run --verbosewill now show peak memory usage when using Ollama's MLX engine- Fixed memory issues and crashes in MLX runner
- Fixed issue where Ollama would not be able to run models imported from Qwen3.5 GGUF files
Full Changelog: https://github.com/ollama/ollama/compare/v0.17.4...v0.17.5