Ollama

ollama/ollama last check 191 releases recent
Notes
Release notes
v0.3.14 · 1y+
view on github

IBM and Ollama are partnering to bring Granite models!

New Models

  • Granite 3 MoE: The IBM Granite 1B and 3B models are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.
  • Granite 3 Dense: The IBM Granite 2B and 8B models are designed to support tool-based use cases and support for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.

Thank you @gabe-l-hart for contributing Granite support to Ollama!

What's Changed

  • Fix crashes for AMD GPUs with small system memory
  • Fix error that would occur on macOS 11 Big Sur
  • Fixed issue creating models from bf16 file types
  • Improve CPU performance by improving default thread counts
  • IBM granite/granitemoe architecture support by @gabe-l-hart in https://github.com/ollama/ollama/pull/6760

New Contributors

Full Changelog: https://github.com/ollama/ollama/compare/v0.3.13...v0.3.14