Ollama: Precision Revolution - New Float Formats and Testing Powerhouse

The Ollama team delivered three major improvements focused on precision and testing capabilities. Patrick Devine introduced support for cutting-edge float formats (mxfp4, mxfp8, nvfp4) that promise better model efficiency, while Daniel Hiltgen enhanced the testing infrastructure with individual model testing and comprehensive vision/tool calling stress tests. A Windows CI fix rounds out a solid day of platform improvements.

2026-03-25T10:04:04Z

Duration: PT4M3S

Episode overview

This episode is a short developer briefing from Ollama.

It explains recent repository work in plain language.

Show: Ollama
Published: 2026-03-25T10:04:04Z
Audio duration: PT4M3S

Transcript excerpt

This excerpt keeps the crawler page concise. Listen to the episode or use the RSS feed for the full update.

Hey there, fellow code explorers! Welcome back to another episode of the Ollama podcast. I'm your host, and wow, do we have some exciting updates to dive into today. Grab your favorite beverage because we're talking about some seriously cool advances in model precision and testing infrastructure.

You know what I love about today's updates? They're the perfect example of how great software evolves - we've got cutting-edge research meeting rock-solid engineering practices. Let me paint you the picture of what happened.

First up, Patrick Devine just landed something that's going to make AI enthusiasts everywhere do a little happy dance. We're talking about support for three new floating-point formats: mxfp4, mxfp8, and nvfp4. Now, if those sound like alphabet soup to you, here's the beautiful story behind them.

Think of these formats as different ways to store numbers in your models, kind of like choosing between different sized containers for your ingredients. The magic here is that Patrick's work lets you import models in bf16 format - that's bfloat16, which is already pretty efficient - and then convert them to these…

What makes this even cooler is the direct fp8 to mxfp8 conversion…

S…

Nearby episodes from Ollama

Smoothing the Launch Experience 2026-03-29T10:00:54Z
Fixing the Inconsistencies That Matter 2026-03-28T10:11:50Z
Smart Caching and Better User Experience 2026-03-27T10:11:09Z
VS Code Integration Takes Center Stage 2026-03-26T10:11:22Z
MLX Performance Breakthrough and Smarter Caching 2026-03-24T10:04:16Z
Nvidia Partnership Takes Center Stage 2026-03-21T10:02:43Z
Bug Squashing Bonanza 2026-03-20T10:03:36Z
The Caching Revolution 2026-03-19T10:04:52Z