Back to Leaderboard

Inception: Mercury

Developed by inception

API Pricing (per 1M tokens)

Input$0.25
Output$1.00

Intelligence Benchmarks

MMLU ProN/A
GPQAN/A
Intelligence IndexN/A
LivebenchN/A

Technical Specifications

Context Window128K Tokens
Vision SupportNo
Tokens per SecondN/A
Time to First Token (s)N/A
Modalities
text->text

About Inception: Mercury

Mercury is the first diffusion large language model (dLLM). Applying a breakthrough discrete diffusion approach, the model runs 5-10x faster than even speed optimized models like GPT-4.1 Nano and Claude 3.5 Haiku while matching their performance. Mercury's speed enables developers to provide responsive user experiences, including with voice agents, search interfaces, and chatbots. Read more in the [blog post] (https://www.inceptionlabs.ai/blog/introducing-mercury) here.
Inception: Mercury Pricing, API Limits & Benchmarks 2026