Oriserve open-sources India-focused AI speech model fine-tuned on Whisper

Oriserve has open-sourced Whisper–Hindi2Hinglish-Apex, a fine-tuned ASR model built for Hindi, Hinglish and Indian-accented English, improving accuracy on real-world, code-mixed and noisy audio.

The launch targets a critical gap in AI speech systems for India, where global ASR models typically show accuracy drops on code-mixed speech, strong regional accents and noisy telephonic audio

Oriserve has released Whisper–Hindi2Hinglish-Apex, an open-source automatic speech recognition (ASR) model fine-tuned on OpenAI’s Whisper and adapted for Hindi, Hinglish and Indian-accented English. The model is now available on Hugging Face.

The launch targets a critical gap in AI speech systems for India, where global ASR models typically show accuracy drops on code-mixed speech, strong regional accents and noisy telephonic audio. While Whisper remains a widely used multilingual ASR model, its performance declines on non-standardised and hybrid Indian datasets.

Whisper–Hindi2Hinglish-Apex retains Whisper’s architecture but is trained on more than 1,000 hours of conversational Indian audio, including call-centre recordings and mixed Hindi–English speech. The fine-tuning is intended to improve accuracy in enterprise conditions, where accent diversity and low-quality audio are common.

The model contains about 800 million parameters. Oriserve says it offers:

•faster inference than larger Whisper variants,

•a 42% improvement over Whisper’s baseline on internal benchmarks, and

•stronger handling of accented, hybrid and noisy audio.

“India needs speech models trained on its own linguistic data, not just adapted global datasets,” said Anurag Jain, co-founder, Oriserve. “Open-sourcing this model enables developers to build AI systems aligned with real Indian audio environments.”

Co-founder Maaz Ansari said the effort is aimed at reducing dependence on proprietary cloud ASR systems and enabling on-premise or hybrid deployments across sectors such as BFSI, telecom, healthcare and education.

This is Oriserve’s third release in its open-source AI series. The company plans to extend fine-tuned Whisper variants to Marathi, Gujarati, Tamil, Telugu, Kannada, Malayalam, Bengali and Punjabi as part of its larger multilingual AI roadmap.

Read the full article covered by Financial Express.

Related Post

Personalizing BFSI Customer Journeys with AI

Anurag Jain, Co-Founder Oriserve  Anurag is a tech strategist and entrepreneur passionate about turning cutting-edge ideas into impactful solutions. Driven by a mission to empower businesses through technology, over the past 12 years, he has worn many hats including product manager, AI advocate, and startup scaler. When I first started working with banks and insurers, personalization […]

AI Revolution: Oriserve’s Anurag Jain on How Enterprises are Redefining Trust in AI

How Trust Is Redefining Enterprise AI Adoption, Oriserve’s Anurag Jain Shares Expert Insights Artificial Intelligence is becoming a transformative force in the business world, with predictions suggesting it will add $15 trillion to global GDP by 2030. Many believe the country leading in AI will dominate the next generation. However, businesses face the challenge of […]

ORISERVE’S GENERATIVE VOICE AI PLATFORM IS DRIVING STRATEGIC TRANSFORMATION IN BFSI REVENUE OPERATIONS

Oriserve (ORI), a bootstrapped startup with a team of over 100 professionals based in Mumbai and Delhi, is revolutionising enterprise communications as a next-generation voice-based Generative AI platform tailored for Banking, Financial Services and Insurance (BFSI). With over 1.2 billion conversations orchestrated globally, ORI is establishing a formidable presence in India and the Middle East […]