Ollama: Thinking Streams and Local Tool Power-ups

The Ollama team delivered three solid improvements focusing on AI streaming capabilities and local model empowerment. ParthSareen tackled the complex challenge of properly splitting mixed thinking streams in OpenAI compatibility, while Eva H unlocked web search capabilities for local tool-enabled models, removing cloud-only restrictions.

2026-03-12T10:06:42Z

Duration: PT3M59S

Episode overview

This episode is a short developer briefing from Ollama.

It explains recent repository work in plain language.

Show: Ollama
Published: 2026-03-12T10:06:42Z
Audio duration: PT3M59S

Transcript excerpt

This excerpt keeps the crawler page concise. Listen to the episode or use the RSS feed for the full update.

Hey there, code crafters! Welcome back to another episode of the Ollama podcast. I'm your host, and wow, do we have some thoughtful improvements to dig into today from March 11th. Grab your favorite beverage because we're talking about some really clever problem-solving that happened in the codebase.

Let's jump right into our main story today, and it's all about making AI interactions smoother and more powerful. We had three merged pull requests that really show the team thinking deeply about user experience and capability expansion.

First up, ParthSareen tackled something that sounds simple but is actually quite nuanced - splitting mixed thinking stream chunks in the OpenAI compatibility layer. Now, if you've worked with streaming AI responses, you know that sometimes you get this mix of the AI's "thinking" process along with actual content or…

Next, Eva H delivered something I'm really excited about - enabling local tool models to perform web searches. Before this change, web search was locked behind a cloud-model-only guard in the Anthropic middleware. Eva simply removed those artificial barriers, and now your local models with tool support can search…

Our third merge came from…

Wha…

Nearby episodes from Ollama

The Caching Revolution 2026-03-19T10:04:52Z
Bug Squashing and Launch Improvements 2026-03-16T00:00:00Z
Launch Command Gets a Major Polish 2026-03-14T10:11:48Z
Spring Cleaning and Performance Gains 2026-03-13T10:04:50Z
Stability First - Error Handling and Performance Fixes 2026-03-11T10:02:32Z
MLX Gets a Major Upgrade and Web Search Goes Live 2026-03-10T10:05:52Z
Simplifying the Sampling Story 2026-03-08T10:03:36Z
Cloud Models Get Smarter & Build Performance Boost 2026-03-07T11:18:50Z