GLM-4.6V
Open-source multimodal model with native tool use
Open Source
Artificial Intelligence
Development

Featured onDecember 9th, 2025

Shipixen

Go from nothing → deployed Next.js codebase in minutes • Sponsored

Get Shipixen ⧉

Product upvotes vs the next 3

Waiting for data. Loading

Product comments vs the next 3

Waiting for data. Loading

Product upvote speed vs the next 3

Waiting for data. Loading

Product upvotes and comments

Waiting for data. Loading

Product vs the next 3

GLM-4.6V

Open-source multimodal model with native tool use

GLM-4.6V is GLM's newest open-source multimodal model with a 128k context window. It features native function calling, bridging visual perception with executable actions for complex agentic workflows like web search and coding.

Top comment

Upvotes248

▲ 248View on ProductHunt ⧉

Comments11

11 commentsSee comments on PH ⧉

Product of the Day4th

Hi everyone!

GLM-4.6V is a significant iteration for the GLM multimodal series. It scales the training context window to 128k and hits SOTA visual understanding for its size.

The biggest update here is the native Function Calling. For the first time in the GLM architecture, tool use is integrated directly into the visual model. This effectively bridges the gap from "visual perception" to "executable action."

It can automatically generate high-quality image-text interleaved content and handle complete workflows independently, like viewing products, comparing prices, and generating shopping lists. The frontend replication and visual interaction capabilities are also impressive, which significantly shortens the path from design to code for developers.

Try it on Z.ai or find the open weights on HF.

GLM-4.6VOpen-source multimodal model with native tool useOpen SourceArtificial IntelligenceDevelopment

Product upvotes and comments

Product vs the next 3

Top comment

GLM-4.6V
Open-source multimodal model with native tool use
Open Source
Artificial Intelligence
Development