Release Watcher - Ollama v0.13.2

Release list
0.15.4	2026-02-01 RECENT
0.15.3	2026-02-01 RECENT
0.15.2	2026-01-27
0.15.1	2026-01-24
0.15.0	2026-01-21
0.14.3	2026-01-16
0.14.2	2026-01-16
0.14.1	2026-01-14
0.14.0	2026-01-10
0.13.5	2025-12-18
0.13.4	2025-12-13
0.13.3	2025-12-09
0.13.2	2025-12-04
0.13.1	2025-11-27
0.13.0	2025-11-19
0.12.11	2025-11-12
0.12.10	2025-11-05
0.12.9	2025-10-31
0.12.8	2025-10-30
0.12.7	2025-10-29

Release list

0.15.4

2026-02-01

RECENT

0.15.3

2026-02-01

RECENT

0.15.2

2026-01-27

0.15.1

2026-01-24

0.15.0

2026-01-21

0.14.3

2026-01-16

0.14.2

2026-01-16

0.14.1

2026-01-14

0.14.0

2026-01-10

0.13.5

2025-12-18

0.13.4

2025-12-13

0.13.3

2025-12-09

0.13.2

2025-12-04

0.13.1

2025-11-27

0.13.0

2025-11-19

0.12.11

2025-11-12

0.12.10

2025-11-05

0.12.9

2025-10-31

0.12.8

2025-10-30

0.12.7

2025-10-29

What's Changed

Flash attention is now enabled by default for vision models such as mistral-3, gemma3, qwen3-vl and more. This improves memory utilization and performance when providing images as input.
Fixed GPU detection on multi-GPU CUDA machines
Fixed issue where deepseek-v3.1 would always think even with thinking is disabled in Ollama's app

New Contributors

@chengcheng84 made their first contribution in https://github.com/ollama/ollama/pull/13265
@nathan-hook made their first contribution in https://github.com/ollama/ollama/pull/13256

Full Changelog: https://github.com/ollama/ollama/compare/v0.13.1...v0.13.2-rc0