Description : Senior Data Scientist Speech, Voice & Conversational AI
Experience : 12 to 15 Years
Location : Remote
Notice Period : Immediate to 30 Days
About The Role
We are looking for a Senior Data Scientist specializing in Speech, Voice, and Conversational AI to lead the design and deployment of scalable, real-time voice AI systems.
This role involves working on advanced AI solutions leveraging Generative AI and Large Language Models to transform contact center and BPO operations.
Key Responsibilities
- Design and oversee the end-to-end architecture for voice AI solutions, focusing on real-time speech-to-text, text-to-speech, and voice biometrics.
- Evaluate and integrate leading speech platforms and APIs, such as Google Cloud Speech, Amazon Transcribe, Azure Speech Services, and Whisper (OpenAI).
- Develop voice pipelines capable of handling noise cancellation, language identification, and real-time streaming at a production scale.
- Architect and implement GenAI-powered conversational agents using Large Language Models like GPT-4 and open-source alternatives.
- Design workflows using Agentic AI frameworks to build multi-step, tool-using voice agents, optimizing for speech-centric use cases.
- Lead solution design workshops to translate business requirements into scalable voice AI architectures, establishing best practices and accelerators.
- Mentor and develop a team of data scientists and ML engineers, fostering expertise in speech AI and generative AI technologies.
- Stay updated with the latest GenAI and speech AI research, applying insights to enhance the organization's AI Skills & Experience :
- 12 to 15 years of experience in Data Science, Machine Learning, or AI Engineering, with 5+ years focused on speech, voice, or audio ML.
- Proven track record of deploying production-grade speech/voice AI systems using major cloud platforms like Google, Azure, or AWS.
- Deep knowledge of Generative AI, including transformer architectures and inference optimization.
- Experience in building Agentic AI solutions with autonomous decision-making capabilities.
- Published research or contributions to open-source projects in speech or NLP is a strong advantage.
- Experience with BPO, contact center transformations using voice AI, and familiarity with speech analytics and agent assist technologies is preferred.
Preferred Qualifications
- Experience in BPO / Contact Center domain (Mandatory)
Education
- Masters or Ph.D. in Computer Science, Electrical Engineering, Computational Linguistics, or related field
(ref:hirist.tech)