Iván Hernández Dalas: Ai2 says its Molmo 2 multimodal AI model can do more with less data
Ai2 said Molmo 2 improves on its earlier models despite its compact size. | Source: Ai2 The Allen Institute for AI, also known as Ai2, last week released Molmo 2, its latest multimodel suite capable of precise spatial and temporal understanding of video, image, and multi-image sets. Building on the first Molmo platform, Molmo 2 has advanced capabilities in video pointing, multi-frame reasoning, and object tracking. Molmo 2 is an 8B-parameter model that surpasses last year’s 72B-parameter Molmo in accuracy, temporal understanding, and pixel-level grounding. Ai2 said it also bests proprietary models like Gemini 3 on key emerging skills like video tracking. When it comes to image and multi-image reasoning, Ai2 claimed the Molmo 2 4B variant outperforms open models such as Qwen 3-VL-8B while using fewer parameters. Skills like these help the model, and any application or system built on top of it, to understand what is happening, where it is happening, and what it means. Molmo 2 is ...