Ollama: Tool Calling Gets Smarter

Four significant pull requests merged today focusing on tool calling improvements and system reliability. Jeffrey Morgan and Parth Sareen led major enhancements to Qwen3 and GLM parsers for better tool calling behavior, while Eva H fixed a critical Windows startup crash and Patrick Devine added peak memory usage visibility for MLX models.

2026-02-27T11:02:49Z

Duration: PT3M47S

Episode overview

This episode is a short developer briefing from Ollama.

It explains recent repository work in plain language.

Show: Ollama
Published: 2026-02-27T11:02:49Z
Audio duration: PT3M47S

Transcript excerpt

This excerpt keeps the crawler page concise. Listen to the episode or use the RSS feed for the full update.

Hey there, wonderful developers! Welcome back to another episode of the Ollama podcast. I'm so excited to share what's been happening in the codebase because today feels like one of those days where everything just clicks into place.

So grab your favorite beverage and let's dive into what the team accomplished yesterday. We had four fantastic pull requests merge, and honestly, there's a beautiful theme running through all of this work around making Ollama smarter and more reliable.

Let's start with the star of the show - Jeffrey Morgan's work on fixing Qwen3 tool calling in thinking mode. Now, if you've ever worked with language models that can use tools, you know how tricky the parsing can get, especially when the model is in that "thinking" state where it's reasoning through a problem.…

Building on that theme, Parth Sareen jumped in with some excellent work on stable tool call indexing for both GLM47 and Qwen3 parsers. This might sound like a small detail, but consistent indexing is absolutely crucial when you're dealing with multiple tool calls. Think about it - if your model wants to use three…

Now, switching gears to reliability, Eva H came through with a fix that Windows users…

A…

Nearby episodes from Ollama

Cloud Integration Drama and AI Model Expansion 2026-03-04T11:10:53Z
Smarter Sampling and Crash Prevention 2026-03-02T11:01:42Z
Building Bridges for Better Model Compatibility 2026-03-01T11:01:49Z
MLX Runner Gets Rock Solid 2026-02-28T11:03:09Z
Cleaner Shutdowns and Faster Startups 2026-02-26T11:03:23Z
Qwen 3.5 Architecture Lands with Safety Upgrades 2026-02-25T11:01:49Z
Memory Management Revolution 2026-02-24T11:06:08Z
Nemotron Architecture Lands with Unified Cache Vision 2026-02-23T11:03:10Z