Product upvotes vs the next 3

Waiting for data. Loading

Product comments vs the next 3

Waiting for data. Loading

Product upvote speed vs the next 3

Waiting for data. Loading

Product upvotes and comments

Waiting for data. Loading

Product vs the next 3

Loading

V-JEPA 2

Meta's world model for physical world understanding

V-JEPA 2 is Meta's new world model, trained on video to understand and predict the physical world. It enables zero-shot robot planning and sets SOTA benchmarks in visual understanding. Model, code, and new benchmarks are now open.

Top comment

Hi everyone!

V-JEPA 2 is Meta's new world model, a serious take on building AI that understands the physical world with the kind of intuition humans have. It's a foundational step towards what they call Advanced Machine Intelligence (AMI).

It learns from over a million hours of video, not just static images, to build a sense of how things move, interact, and follow basic physics. This allows it to understand and predict what might happen next in a scene.

And this isn't just theory. It's being used for zero-shot robot planning, letting a robot pick up and move objects it has never seen before. That’s a very impressive demonstration of where this technology is headed.

On top of the model and code, Meta has also released three new benchmarks for physical reasoning, which is a great contribution to help push the entire research community forward.