Ollama: MLX Runner Revolution and Documentation Polish

Today we're diving into a massive infrastructure upgrade with Patrick Devine's new MLX runner implementation, bringing method-based bindings and GLM4-MoE-Lite model support in nearly 15,000 lines of new code. We also saw great community contributions improving documentation with integration overviews, FAQ fixes, and context length updates.

2026-02-11T11:02:50Z

Duration: PT4M2S

Episode overview

This episode is a short developer briefing from Ollama.

It explains recent repository work in plain language.

Show: Ollama
Published: 2026-02-11T11:02:50Z
Audio duration: PT4M2S

Transcript excerpt

This excerpt keeps the crawler page concise. Listen to the episode or use the RSS feed for the full update.

Hey there, fellow developers! Welcome back to another episode of the Ollama podcast. I'm your host, and wow, do we have an exciting day to unpack together. Grab your favorite beverage because we're diving into some seriously cool infrastructure work that's going to change how Ollama handles certain model types.

So picture this - you're building something amazing, and you realize you need a completely new way to run models efficiently. That's exactly what Patrick Devine tackled, and the result is absolutely stunning. We just saw the merge of not one, but two massive pull requests that introduce an entirely new MLX runner to…

Let's start with the foundation. Patrick's first PR laid the groundwork with safetensors quantization specifically for MLX. Now, I know "safetensors quantization" might sound intimidating, but think of it like this - imagine you're reorganizing your entire filing system to make everything faster and more efficient.…

But here's where it gets really exciting - the second PR built an entire MLX runner on top of that foundation. We're talking about method-based MLX bindings, a subprocess-based runner, KV cache with tree management, and a basic sampler. Patrick…

What…

No…

Nearby episodes from Ollama

Editor Integration Revolution 2026-02-18T11:05:24Z
MLX Display Bug Squashing Day 2026-02-17T11:02:23Z
MLX Runner Gets Major Model Upgrades 2026-02-16T11:05:02Z
MLX Performance Breakthrough and Anthropic Search 2026-02-14T11:01:55Z
Refactoring Rollercoaster and Developer Experience Wins 2026-02-06T11:07:41Z
Bug Squashing Bonanza 2026-02-05T11:07:27Z
Smooth Onboarding for New Users 2026-02-02T11:01:39Z
Polish and Perfectionism - The Art of Getting the Details Right 2026-01-30T11:01:45Z