Job Description

Description : Senior Data Scientist Speech, Voice & Conversational AI

Experience : 12 to 15 Years

Location : Remote

Notice Period : Immediate to 30 Days

About The Role

We are looking for a Senior Data Scientist specializing in Speech, Voice, and Conversational AI to lead the design and deployment of scalable, real-time voice AI systems.

This role involves working on advanced AI solutions leveraging Generative AI and Large Language Models to transform contact center and BPO operations.

Key Responsibilities

Design and oversee the end-to-end architecture for voice AI solutions, focusing on real-time speech-to-text, text-to-speech, and voice biometrics.
Evaluate and integrate leading speech platforms and APIs, such as Google Cloud Speech, Amazon Transcribe, Azure Speech Services, and Whisper (OpenAI).
Develop voice pipelines capable of handling noise cancellation, language identification, and real-time streaming at a production scale.
Architect and implement GenAI-powered conversational agents using Large Language Models like GPT-4 and open-source alternatives.
Design workflows using Agentic AI frameworks to build multi-step, tool-using voice agents, optimizing for speech-centric use cases.
Lead solution design workshops to translate business requirements into scalable voice AI architectures, establishing best practices and accelerators.
Mentor and develop a team of data scientists and ML engineers, fostering expertise in speech AI and generative AI technologies.
Stay updated with the latest GenAI and speech AI research, applying insights to enhance the organization's AI Skills & Experience :
12 to 15 years of experience in Data Science, Machine Learning, or AI Engineering, with 5+ years focused on speech, voice, or audio ML.
Proven track record of deploying production-grade speech/voice AI systems using major cloud platforms like Google, Azure, or AWS.
Deep knowledge of Generative AI, including transformer architectures and inference optimization.
Experience in building Agentic AI solutions with autonomous decision-making capabilities.
Published research or contributions to open-source projects in speech or NLP is a strong advantage.
Experience with BPO, contact center transformations using voice AI, and familiarity with speech analytics and agent assist technologies is preferred.

Preferred Qualifications