Product upvotes vs the next 3

Waiting for data. Loading

Product comments vs the next 3

Waiting for data. Loading

Product upvote speed vs the next 3

Waiting for data. Loading

Product upvotes and comments

Waiting for data. Loading

Product vs the next 3

Loading

InternVL3

Open MLLMs excelling in vision, reasoning & long context

Open MLLM family (1B-78B) from OpenGVLab. Excels at vision, reasoning, long context & agents via native multimodal pre-training. Outperforms base LLMs on text tasks.

Top comment

Hi everyone!

Check out InternVL3 from OpenGVLab – a new family of open vision-language models.

They used a training approach mixing vision and text data from the start, which reportedly leads to strong performance in both understanding images/video and handling text tasks well.

These models show good reasoning abilities and can handle long inputs. The weights and code are openly available.

You can experience these model capabilities directly on their Chat Web and HF Space.