Ollama

ollama/ollama last check 2026-06-19 02:01 UTC 191 releases recent

Notes

Release notes

v0.1.18 · 1y+

New models

TinyLlama: a compact 1.1B Llama model on 3 trillion tokens
OpenHermes 2: A 7B model, fine-tuned on Mistral with strong multi-turn chat skills and system prompt capabilities.
WizardCoder 33B: a new 33B state of the art code generation model: ollama run wizardcoder:33b
Dolphin Phi: a 2.7B uncensored model, based on the Phi language model by Microsoft Research

Added /? shortcuts help command to ollama run to list keyboard shortcuts
Improved performance when sending follow up messages in ollama run or via the API.
Fixed issues where certain 7B models would error on GPUs with 4GB of memory or less
Fixed issue where Llava model prompts couldn't start with a file path
Fixed issue where model would not be correctly reloaded if options or parameters changed between requests
Ollama will now automatically pull new models when running older ggml format models. If using custom ggml format models in a Modelfile, please import GGUF models instead.

Full Changelog: https://github.com/jmorganca/ollama/compare/v0.1.17...v0.1.18