Back to Leaderboard

AllenAI: Molmo2 8B

Developed by allenai

API Pricing (per 1M tokens)

Input$0.20
Output$0.20

Intelligence Benchmarks

MMLU ProN/A
GPQAN/A
Intelligence IndexN/A
LivebenchN/A

Technical Specifications

Context Window37K Tokens
Vision SupportYes
Tokens per SecondN/A
Time to First Token (s)N/A
Modalities
textimagevideo->text

About AllenAI: Molmo2 8B

Molmo2-8B is an open vision-language model developed by the Allen Institute for AI (Ai2) as part of the Molmo2 family, supporting image, video, and multi-image understanding and grounding. It is based on Qwen3-8B and uses SigLIP 2 as its vision backbone, outperforming other open-weight, open-data models on short videos, counting, and captioning, while remaining competitive on long-video tasks.