New Model Support and Memory Management Wins

Today brought some exciting developments with GLM-4.7-Flash model support landing alongside crucial memory management fixes for image generation models. Jeffrey Morgan led the charge with major architecture additions, while the team also tackled integration improvements and API consistency issues that'll make life easier for downstream developers.

2026-01-20T11:03:12Z

Duration: PT4M7S

Episode overview

This episode is a short developer briefing from Ollama.

It explains recent repository work in plain language.

Show: Ollama
Published: 2026-01-20T11:03:12Z
Audio duration: PT4M7S

Transcript excerpt

This excerpt keeps the crawler page concise. Listen to the episode or use the RSS feed for the full update.

Hey there, fellow developers! Welcome back to another episode of the Ollama podcast. I'm your host, and wow, do we have some fascinating changes to dive into today from January 20th. Grab your favorite beverage because we're about to explore some really cool model architecture work and some clever performance…

Let's start with the big story of the day - Jeffrey Morgan just merged support for a brand new model architecture called GLM-4.7-Flash. This is huge! We're talking about over 2,500 lines of code changes across 17 files to add the `Glm4MoeLiteForCausalLM` architecture. Now, if you're wondering what makes this…

The implementation includes new converters, model definitions, and parsers - it's like watching a well-orchestrated symphony of code organization. And here's a fun detail: Jeffrey mentioned they should probably rename the renderer and parser to "glm47" to keep things consistent. These kinds of naming considerations…

Now, speaking of smart optimizations, Jeffrey also tackled a really clever performance issue around image generation models. Picture this scenario - you're trying to delete a model with `ollama rm`, but the system accidentally loads the entire model into…

But…

Da…

Nearby episodes from Ollama

Memory Magic and Command Makeover 2026-01-24T11:07:21Z
Making Ollama Play Nice with Everyone 2026-01-23T11:05:03Z
The Great Cleanup - Manifests Get Their Own Home 2026-01-22T11:03:05Z
New Model Architecture and Image Generation Fixes 2026-01-21T11:05:07Z
FLUX.2 Image Generation Arrives 2026-01-19T11:25:28Z
Image Generation Goes Native and Parser Cleanup Magic 2026-01-18T11:01:44Z
Dynamic Loading and Experimental Models Take Center Stage 2026-01-17T11:03:17Z
Release Day Rescue Mission 2026-01-14T16:36:41Z