Ollama

ollama/ollama last check 2026-06-18 23:02 UTC 191 releases recent

Notes

site

Release notes

v0.3.14 · 1y+

IBM and Ollama are partnering to bring Granite models!

New Models

Granite 3 MoE: The IBM Granite 1B and 3B models are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.
Granite 3 Dense: The IBM Granite 2B and 8B models are designed to support tool-based use cases and support for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

Thank you @gabe-l-hart for contributing Granite support to Ollama!

What's Changed

Fix crashes for AMD GPUs with small system memory
Fix error that would occur on macOS 11 Big Sur
Fixed issue creating models from bf16 file types
Improve CPU performance by improving default thread counts
IBM granite/granitemoe architecture support by @gabe-l-hart in https://github.com/ollama/ollama/pull/6760

New Contributors

@JHubi1 made their first contribution in https://github.com/ollama/ollama/pull/6465
@gabe-l-hart made their first contribution in https://github.com/ollama/ollama/pull/6760

Full Changelog: https://github.com/ollama/ollama/compare/v0.3.13...v0.3.14