Ollama: Bug Squashing Bonanza

Today's episode covers six important fixes that landed in Ollama, including a crucial token counting bug that was shortchanging users by one token, improvements to error messaging for cloud models, and several deep fixes to the Qwen3 model implementation. Plus, we see a major revert of Claude integration improvements, showing how even the best teams sometimes need to step back and reassess.

2026-02-05T11:07:27Z

Duration: PT3M54S

Episode overview

This episode is a short developer briefing from Ollama.

It explains recent repository work in plain language.

Show: Ollama
Published: 2026-02-05T11:07:27Z
Audio duration: PT3M54S

Transcript excerpt

This excerpt keeps the crawler page concise. Listen to the episode or use the RSS feed for the full update.

Hey there, wonderful developers! Welcome back to another episode of the Ollama podcast. I'm your host, and wow, do we have a story of dedication and polish for you today. February 5th brought us six merged pull requests that really showcase the kind of meticulous attention to detail that makes software truly great.

Let's dive right into today's main event, because this is one of those days where the fixes might seem small on the surface, but they represent the kind of craftsmanship that users absolutely notice.

First up, Jesse Gross tackled what I'm calling the "one token heist" - and honestly, this is such a perfect example of why thorough testing matters. Users were getting shortchanged by exactly one token when they set a prediction limit. Imagine asking for 100 tokens and only getting 99! The bug was happening because…

Next, Bruce MacDonald improved the user experience in a really thoughtful way. You know how frustrating it is when you get a cryptic "401 Unauthorized" error? Well, Bruce made sure that when you're trying to use cloud models but aren't signed in, you get a helpful message that actually tells you what to do. It's one…

Now, here's where today's story gets really…

But…

Nearby episodes from Ollama

MLX Runner Gets Major Model Upgrades 2026-02-16T11:05:02Z
MLX Performance Breakthrough and Anthropic Search 2026-02-14T11:01:55Z
MLX Runner Revolution and Documentation Polish 2026-02-11T11:02:50Z
Refactoring Rollercoaster and Developer Experience Wins 2026-02-06T11:07:41Z
Smooth Onboarding for New Users 2026-02-02T11:01:39Z
Polish and Perfectionism - The Art of Getting the Details Right 2026-01-30T11:01:45Z
Cleaning Up the Config Game 2026-01-28T11:01:39Z
Speed Boost and Model Magic 2026-01-25T11:04:03Z