Ollama

ollama/ollama last check 191 releases recent
Notes
Release notes
v0.1.18 · 1y+
view on github

New models

  • TinyLlama: a compact 1.1B Llama model on 3 trillion tokens
  • OpenHermes 2: A 7B model, fine-tuned on Mistral with strong multi-turn chat skills and system prompt capabilities.
  • WizardCoder 33B: a new 33B state of the art code generation model: ollama run wizardcoder:33b
  • Dolphin Phi: a 2.7B uncensored model, based on the Phi language model by Microsoft Research

What's Changed

  • Added /? shortcuts help command to ollama run to list keyboard shortcuts
  • Improved performance when sending follow up messages in ollama run or via the API.
  • Fixed issues where certain 7B models would error on GPUs with 4GB of memory or less
  • Fixed issue where Llava model prompts couldn't start with a file path
  • Fixed issue where model would not be correctly reloaded if options or parameters changed between requests
  • Ollama will now automatically pull new models when running older ggml format models. If using custom ggml format models in a Modelfile, please import GGUF models instead.

New Contributors

Full Changelog: https://github.com/jmorganca/ollama/compare/v0.1.17...v0.1.18