
Meesho delivers real-time, multilingual customer support with Conversational AI
Skalierung unglaublicher Erlebnisse für Millionen von Nutzern in Hindi und Englisch
Black Friday
Building human-realistic mock interviews for millions of job seekers across India
Interview preparation in India has long been broken - generic, disconnected, and inaccessible to most job seekers.
Apna, India’s leading job search and careers platform, set out to change that by making every mock interview feel like a real one - personalized to each role, company, and candidate.
With over 60 million users and 10,000+ companies across 30,000+ roles, Apna’s vision required more than training modules. It demanded conversation - lifelike timing, empathy, and domain depth - at massive scale.
To achieve this, Apna engineered one of the most advanced AI interview ecosystems, powered by ElevenLabs Text to Speech and Blue Machines’ voice orchestration platform. Together, these systems have delivered over 1.5 million AI interviews, totaling 7.5 million voice minutes, with sub-300 ms latency.
For interview simulations to feel natural, voice quality and responsiveness are inseparable. Any audible delay or robotic tone breaks immersion and trust.
Apna selected ElevenLabs for three core reasons:
These qualities allow Apna to preserve the rhythm of real conversation while maintaining emotional credibility at scale.
Um diese lebensechten Interviews zu ermöglichen, musste Apna eine komplexe Orchestrierungsherausforderung lösen. Ein simuliertes Interview, das real wirkt, erfordert mehr als nur ein Skript; es benötigt synchronisierte Präzision in Bezug auf Stimme, Latenz, Empathie und Kontext – alles im Einklang mit Maschinengeschwindigkeit.
Jedes Unternehmen führt Interviews unterschiedlich. Die Rolle eines Produktmanagers könnte auf Metrikverständnis getestet werden; die Rolle eines Bankkreditsachbearbeiters auf Compliance-Logik; und eine Führungskraft einer E-Commerce-Plattform auf Routenoptimierung.
Hinter den Kulissen hat Apnas Orchestrierungsplattform, Blue Machines, einen Retrieval-Augmented Generation (RAG) Graphen für jede Rolle × Unternehmensschnittstelle erstellt:Blue Machines built a Retrieval-Augmented Generation (RAG) graph for each role × company intersection:
● 10 000 + companies × 50–100 roles = ~500 million micro-models.
● Each model anchored to company-specific rubrics, tone, and vocabulary.
They integrated ElevenLabs’ streaming TTS directly into its conversational loop. Each turn begins with candidate speech, processed by multilingual ASR and NLU models, followed by workflow logic that evaluates intent, emotional tone, and role-specific context. The system then retrieves relevant domain data, composes the next question, and plays it back through ElevenLabs — all within roughly 300 milliseconds end-to-end.
“Each synthesized response begins playback within ~150–180 ms, thanks to ElevenLabs’ low-latency APIs integrated directly into Apna and Blue Machines’ orchestration layer”, said Abhishek Ranjan, CTO, Apna
At 300 ms, the human brain perceives speech as continuous rather than delayed - the threshold where realism begins.
The result is a system that balances technical precision with emotional depth. Thousands of interviews run concurrently across Indian English, Hindi, and code-mixed speech, each maintaining the rhythm, empathy, and credibility of a real human exchange.
A 24-year-old candidate from Pune shared:
The AI interviewer knew my résumé, switched between Hindi and English, and challenged me like a real HDFC bank panel. I cracked the job on my next attempt.
For the first time, candidates can practice interviews that feel truly real – tailored to their résumé, company, and dream role.
Apna’s AI Interview Prep shows how voice technology can democratize opportunity - giving millions of job seekers the same level of preparation once reserved for a privileged few.
For many, practicing with a lifelike interviewer builds real confidence before their first human interview.
By combining real-time voice with adaptive context and empathy, Apna has turned preparation into participation - giving everyone, regardless of background or language, an equal chance to succeed.
Apna’s AI Interview Prep defines the next generation of AI-driven learning and interviewing.
Realistic, responsive voices powered by the ElevenLabs Text to Speech API let candidates experience personalized feedback, natural timing, and bilingual fluency that text-based practice could never offer.
Through this collaboration, Apna has redefined what scalable learning sounds like - proving that voice-based AI can extend human opportunity, not replace it.
Apna’s success demonstrates how high-fidelity voice can transform education, employability, and access to opportunity at national scale.
If you’re building conversational learning tools, AI interviewers, or any system where realism and empathy matter, discover what's possible with ElevenLabs Conversational Agents Platform.

Skalierung unglaublicher Erlebnisse für Millionen von Nutzern in Hindi und Englisch

20.000 Stunden mehrsprachiger Kundengespräche pro Monat in umsetzbare Erkenntnisse verwandeln
Bereitgestellt von ElevenLabs Agenten