Ollama

notes:
Release list
0.13.3
RECENT
0.13.2
0.13.1
0.13.0
0.12.11
0.12.10
0.12.9
0.12.8
0.12.7
0.12.6
0.12.5
0.12.4
0.12.3
0.12.2
0.12.1
0.12.0
0.11.11
0.11.10
0.11.9
0.11.8
Release notes:

ollama run now works with embedding models

ollama run can now run embedding models to generate vector embeddings from text:

ollama run embeddinggemma "Hello world"

Content can also be provided to ollama run via standard input:

echo "Hello world" | ollama run embeddinggemma

What's Changed

  • Fixed errors when running qwen3-vl:235b and qwen3-vl:235b-instruct
  • Enable flash attention for Vulkan (currently needs to be built from source)
  • Add Vulkan memory detection for Intel GPU using DXGI+PDH
  • Ollama will now return tool call IDs from the /api/chat API
  • Fixed hanging due to CPU discovery
  • Ollama will now show login instructions when switching to a cloud model in interactive mode
  • Fix reading stale VRAM data
  • ollama run now works with embedding models

New Contributors

  • @ryanycoleman made their first contribution in https://github.com/ollama/ollama/pull/11740
  • @Rajathbail made their first contribution in https://github.com/ollama/ollama/pull/12929
  • @virajwad made their first contribution in https://github.com/ollama/ollama/pull/12664
  • @AXYZdong made their first contribution in https://github.com/ollama/ollama/pull/8601

Full Changelog: https://github.com/ollama/ollama/compare/v0.12.9...v0.12.10

Copyright © 2023 - All right reserved by Yadoc SAS