Ollama

Ollama: Model Integration and Windows System Improvements

Today's development focused on expanding model support with Gemma4-12b and DeepSeek OCR-2 integration, while addressing critical Windows system issues including sleep prevention during inference and improved cleanup processes.

Duration: PT2M21S

https://podlog.io/listen/ollama-3aed006f/episode/ollama-model-integration-and-windows-system-improvements-b740c92f

Transcript

Good morning. This is your Ollama developer briefing for June 4th, 2026.

The biggest theme today is expanding model capabilities. The team merged support for Gemma4-12b models in PR 16457, which required updates to the renderer resolution and create commands. A more substantial addition is the proposed DeepSeek OCR-2 integration in PR 16503, which introduces a complex three-stage vision encoder combining SAM ViT-B, Qwen2, and projector components with a DeepSeek MoE text decoder. This required a new GGUF converter and compatibility layer handlers.

Windows users will see significant system-level improvements. PR 16468 addresses a long-standing issue where Windows machines could sleep during inference, interrupting generation. The fix uses Windows execution state APIs to keep systems awake during processing. Additionally, PR 16458 improves cleanup by ensuring llama-server processes are properly terminated during Windows installer and app cleanup, preventing orphaned processes.

The development pipeline received important maintenance updates. The team bumped llama.cpp to version b9493 in PR 16463, refreshing compatibility patches. PR 16477 fixed a critical "unknown projector type" crash in gemma4 patch wiring, and PR 16473 enabled Metal GPU offload for projectors on macOS.

Several focused fixes improve the developer experience. PR 16469 resolved an issue where the "requires" command in Modelfiles was being ignored for MLX-based models. PR 16465 added support for tilde home directory paths in image file inputs, and PR 16509 cleaned up empty usage fields in completion stream chunks.

Looking ahead, these changes strengthen Ollama's model ecosystem while improving system reliability, particularly on Windows. The DeepSeek OCR integration, if merged, will significantly expand vision processing capabilities.

That's your update for today. Keep building.