Sarvam AI Outshines Gemini and ChatGPT with 84.3% OCR Accuracy, Global Eyes on India

Sarvam AI Gains Global Backing as Vision Hits 93.28% Accuracy and Bulbul V3 Expands to 11 Indian Languages
Sarvam AI Outshines Gemini and ChatGPT
Written By:
Simran Mishra
Reviewed By:
Manisha Sharma
Published on

India takes a major step forward in artificial intelligence at a global scale. Sarvam AI, a Bengaluru-based startup, has surprised the tech community with AI models that perform better than Google Gemini and ChatGPT on specific India-focused tasks.

Sarvam AI has delivered a breakthrough that changes long-held perceptions. This startup has shown that India can build world-class artificial intelligence models from the ground up. This achievement has shifted attention toward India as a serious AI innovator. 

A Push Toward Sovereign AI

Sarvam AI focuses on sovereign AI. The company builds foundational models within the country, reducing reliance on foreign systems. It also solves local problems with sharper precision. Two recent releases have driven this momentum. Sarvam Vision and Bulbul V3 now set new benchmarks.

Indian languages often challenge global AI systems. Sarvam Vision targets optical character recognition. The model reads complex documents with high accuracy. The model scored 84.3% on the olmOCR Bench, surpassing Google Gemini 3 Pro and other popular OCR tools. ChatGPT ranked much lower on the same test.

The model also performed strongly on OmniDocBench v1.5. This benchmark measures real-world document understanding. Sarvam Vision achieved 93.28% accuracy. Enterprises and researchers have praised the consistency.

Global Attention and Industry Response

Sarvam AI co-founder Pratyush Kumar shared these results on X. The posts sparked strong global reactions. Tech commentators admitted previous doubts. Deedy Das publicly revised his stance. He highlighted the value of focused Indic language models. He also praised reasonable pricing and strong performance across OCR and speech tasks.

Bulbul V3 has added another layer of impact. This text-to-speech model focuses on natural voice generation. Bulbul V3 supports more than 35 voices across 11 Indian languages. Expansion plans target 22 languages. The model delivers stable and expressive output. It reduces errors common in long or complex scripts.

Bulbul V3 now competes with global leaders like ElevenLabs. Startups and developers have welcomed the shift. Pratik Desai from KissanAI called Bulbul the default choice for Indic use cases. He cited better cost efficiency and steady improvements.

What This Means for India’s AI Future

Sarvam AI’s rise holds wider importance. India has long served as an AI talent hub. Core innovation often stayed abroad. Sarvam AI changes that narrative. Sovereign AI now gains credibility. Banking, education, and government services stand to benefit. Multilingual accuracy remains critical in these sectors.

There are still a lot of challenges. Scaling compute infrastructure demands heavy investment. Global benchmarks still favor large English-focused models. Sarvam AI continues to target depth over breadth. The strategy aligns with India’s linguistic diversity and scale.

Sarvam AI now represents a shift in global AI development. India no longer follows trends. It shapes them with purpose-built innovation.

Also Read: X Likely to Get its Own AI Video Editor, Launch Expected in 3 Months

Join our WhatsApp Channel to get the latest news, exclusives and videos on WhatsApp

Related Stories

No stories found.
logo
Analytics Insight: Latest AI, Crypto, Tech News & Analysis
www.analyticsinsight.net