Ollama

ollama/ollama last check 2026-06-19 01:01 UTC 191 releases recent

Notes

site

Release notes

v0.1.39 · 1y+

view on github

New models

Cohere Aya 23: A new state-of-the-art, multilingual LLM covering 23 different languages.
Mistral 7B 0.3: A new version of Mistral 7B with initial support for function calling.
Phi-3 Medium: a 14B parameters, lightweight, state-of-the-art open model by Microsoft.
Phi-3 Mini 128K and Phi-3 Medium 128K: versions of the Phi-3 models that support a context window size of 128K
Granite code: A family of open foundation models by IBM for Code Intelligence

Llama 3 import

It is now possible to import and quantize Llama 3 and its finetunes from Safetensors format to Ollama.

First, clone a Hugging Face repo with a Safetensors model:

git clone https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct
cd Meta-Llama-3-8B-Instruct

Next, create a Modelfile:

FROM .

TEMPLATE &quot;&quot;&quot;{{ if .System }}&lt;|start_header_id|&gt;system&lt;|end_header_id|&gt;

{{ .System }}&lt;|eot_id|&gt;{{ end }}{{ if .Prompt }}&lt;|start_header_id|&gt;user&lt;|end_header_id|&gt;

{{ .Prompt }}&lt;|eot_id|&gt;{{ end }}&lt;|start_header_id|&gt;assistant&lt;|end_header_id|&gt;

{{ .Response }}&lt;|eot_id|&gt;&quot;&quot;&quot;

PARAMETER stop &lt;|start_header_id|&gt;
PARAMETER stop &lt;|end_header_id|&gt;
PARAMETER stop &lt;|eot_id|&gt;

Then, create and quantize a model:

ollama create --quantize q4_0 -f Modelfile my-llama3 
ollama run my-llama3

What's Changed

Fixed issues where wide characters such as Chinese, Korean, Japanese and Russian languages.
Added new OLLAMA_NOHISTORY=1 environment variable that can be set to disable history when using ollama run
New experimental OLLAMA_FLASH_ATTENTION=1 flag for ollama serve that improves token generation speed on Apple Silicon Macs and NVIDIA graphics cards
Fixed error that would occur on Windows running ollama create -f Modelfile
ollama create can now create models from I-Quant GGUF files
Fixed EOF errors when resuming downloads via ollama pull
Added a Ctrl+W shortcut to ollama run

New Contributors

@rapmd73 made their first contribution in https://github.com/ollama/ollama/pull/4467
@sammcj made their first contribution in https://github.com/ollama/ollama/pull/4120
@likejazz made their first contribution in https://github.com/ollama/ollama/pull/4535

Full Changelog: https://github.com/ollama/ollama/compare/v0.1.38...v0.1.39