Oriserve open-sources India-focused AI speech model fine-tuned on Whisper

Oriserve has open-sourced Whisper–Hindi2Hinglish-Apex, a fine-tuned ASR model built for Hindi, Hinglish and Indian-accented English, improving accuracy on real-world, code-mixed and noisy audio.

The launch targets a critical gap in AI speech systems for India, where global ASR models typically show accuracy drops on code-mixed speech, strong regional accents and noisy telephonic audio

Oriserve has released Whisper–Hindi2Hinglish-Apex, an open-source automatic speech recognition (ASR) model fine-tuned on OpenAI’s Whisper and adapted for Hindi, Hinglish and Indian-accented English. The model is now available on Hugging Face.

The launch targets a critical gap in AI speech systems for India, where global ASR models typically show accuracy drops on code-mixed speech, strong regional accents and noisy telephonic audio. While Whisper remains a widely used multilingual ASR model, its performance declines on non-standardised and hybrid Indian datasets.

Whisper–Hindi2Hinglish-Apex retains Whisper’s architecture but is trained on more than 1,000 hours of conversational Indian audio, including call-centre recordings and mixed Hindi–English speech. The fine-tuning is intended to improve accuracy in enterprise conditions, where accent diversity and low-quality audio are common.

The model contains about 800 million parameters. Oriserve says it offers:

•faster inference than larger Whisper variants,

•a 42% improvement over Whisper’s baseline on internal benchmarks, and

•stronger handling of accented, hybrid and noisy audio.

“India needs speech models trained on its own linguistic data, not just adapted global datasets,” said Anurag Jain, co-founder, Oriserve. “Open-sourcing this model enables developers to build AI systems aligned with real Indian audio environments.”

Co-founder Maaz Ansari said the effort is aimed at reducing dependence on proprietary cloud ASR systems and enabling on-premise or hybrid deployments across sectors such as BFSI, telecom, healthcare and education.

This is Oriserve’s third release in its open-source AI series. The company plans to extend fine-tuned Whisper variants to Marathi, Gujarati, Tamil, Telugu, Kannada, Malayalam, Bengali and Punjabi as part of its larger multilingual AI roadmap.

Read the full article covered by Financial Express.

Related Post

ORISERVE’S GENERATIVE VOICE AI PLATFORM IS DRIVING STRATEGIC TRANSFORMATION IN BFSI REVENUE OPERATIONS

Oriserve (ORI), a bootstrapped startup with a team of over 100 professionals based in Mumbai and Delhi, is revolutionising enterprise communications as a next-generation voice-based Generative AI platform tailored for Banking, Financial Services and Insurance (BFSI). With over 1.2 billion conversations orchestrated globally, ORI is establishing a formidable presence in India and the Middle East […]

How Oriserve is Powering Business Automation: Maaz Ansari Speaks

An Exclusive Interview with Maaz Ansari, Co-Founder of Oriserve (Ori), a next-gen Generative AI platform Maaz Ansari, Co-Founder of Oriserve (Ori), is at the forefront of redefining enterprise communication with a next-gen Generative AI platform. In this interview, he shares Ori’s journey, its mission to transform customer engagement, and how AI-driven innovation is bridging the […]

Human+AI Collaboration in Debt Collections and Customer Retention

By Maaz Ansari, Co-Founder, Oriserve The way organisations approach debt collections and customer retention is undergoing a fundamental transformation. What was once a rigid, script-driven process handled largely through call centres is now giving way to more nuanced, technology-enabled interactions. Customers, even in sensitive financial situations, increasingly expect speed, empathy, and personalised engagement — a reality that traditional models struggle to […]