Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text and text-to-speech. The Research Engineer will partner with ...
Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text and text-to-speech. The Research Engineer will partner with ...
Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text (STT) and text-to-speech (TTS). They are seeking a highly ...
Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text (STT) and text-to-speech (TTS). They are seeking a highly ...
IVR developer
Austin, TX · On-site
Integrate speech recognition and text-to-speech technologies using Nuance. Monitor and maintain IVR systems to ensure high availability and performance. Troubleshoot and resolve issues related to IVR ...
IVR developer
Austin, TX · On-site
Integrate speech recognition and text-to-speech technologies using Nuance. Monitor and maintain IVR systems to ensure high availability and performance. Troubleshoot and resolve issues related to IVR ...
Whether text-to-text, text-to-speech, speech-to-text, or speech-to-speech, machine translation can help overlooked communities finally be understood in the world. HLT will bring critical educational ...
Whether text-to-text, text-to-speech, speech-to-text, or speech-to-speech, machine translation can help overlooked communities finally be understood in the world. HLT will bring critical educational ...
Lead AI/ML Engineer
$120K - $159K/yr
You will lead the design and delivery of end-to-end voice AI solutions, combining large language models with speech technologies such as speech-to-text, text-to-speech, and real-time streaming audio ...
Quick apply
Lead AI/ML Engineer
$120K - $159K/yr
You will lead the design and delivery of end-to-end voice AI solutions, combining large language models with speech technologies such as speech-to-text, text-to-speech, and real-time streaming audio ...
Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text (STT) and text-to-speech (TTS). As a QA Engineering Manager ...
Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text (STT) and text-to-speech (TTS). As a QA Engineering Manager ...
Software Engineer III, Speech Production, Infrastructure
Mountain View, CA · On-site
$67.75 - $91.25/hr
Knowledge of speech and language technologies, (e.g., Automatic Speech Recognition (ASR), Natural Language Understanding (NLU), Text to speech (TTS). About the job Google's software engineers develop ...
Software Engineer III, Speech Production, Infrastructure
Mountain View, CA · On-site
$67.75 - $91.25/hr
Knowledge of speech and language technologies, (e.g., Automatic Speech Recognition (ASR), Natural Language Understanding (NLU), Text to speech (TTS). About the job Google's software engineers develop ...
... text, text-to-speech, and conversational AI platforms to create natural, responsive, and emotionally aware voice experiences tailored to financial use cases. • Adapt & Fine-Tune Audio and ...
... text, text-to-speech, and conversational AI platforms to create natural, responsive, and emotionally aware voice experiences tailored to financial use cases. • Adapt & Fine-Tune Audio and ...
Expertise and intuition for training models in the audio domain, including text-to-speech, ASR, speech-to-speech, speech-emotion-recognition, or other models * Experience in training audio ...
Quick apply
Expertise and intuition for training models in the audio domain, including text-to-speech, ASR, speech-to-speech, speech-emotion-recognition, or other models * Experience in training audio ...
... Text-to-Speech), and NLP/LLM pipelines. • Create frameworks for conversational flows, prompt engineering, retrieval-augmented generation (RAG), and context management. Solution Development • ...
... Text-to-Speech), and NLP/LLM pipelines. • Create frameworks for conversational flows, prompt engineering, retrieval-augmented generation (RAG), and context management. Solution Development • ...
... text, text-to-speech, and conversational AI platforms to create natural, responsive, and emotionally aware voice experiences tailored to financial use cases. • Adapt & Fine-Tune Audio and ...
... text, text-to-speech, and conversational AI platforms to create natural, responsive, and emotionally aware voice experiences tailored to financial use cases. • Adapt & Fine-Tune Audio and ...
Speech recognition & Text to Speech * Web & Cloud technologies Skills: * IVR, Genesys, Java, Telephony, SQL, UNIX, Windows, Oracle
Speech recognition & Text to Speech * Web & Cloud technologies Skills: * IVR, Genesys, Java, Telephony, SQL, UNIX, Windows, Oracle
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App. Google recently named Speechify the Chrome Extension of the Year and Apple named ...
Text To Speech information
What is a Text To Speech (TTS) job?
What are some common challenges faced by professionals working in Text to Speech (TTS) development roles?
What is the difference between Text To Speech vs Voice Actor?
| Aspect | Text To Speech | Voice Actor |
|---|---|---|
| Required Credentials | None or basic audio editing skills | Voice training, acting skills, often professional demos |
| Work Environment | Software, digital platforms, remote | Recording studios, on-location, remote |
| Industry Usage | Automation, AI, tech companies | Media, entertainment, advertising |
| Search & Comparison Intent | Automated voice solutions, TTS technology | Voice acting, narration, character voices |
Text To Speech involves using software to convert written text into spoken words, primarily for automation and digital applications. Voice Actors, on the other hand, provide human voice recordings for media, entertainment, and advertising. While TTS is tech-driven and often used in AI and accessibility tools, Voice Actors bring emotional nuance and personality to their performances. Both roles are essential in their respective industries, but they differ significantly in skills, environment, and purpose.
What are the key skills and qualifications needed to thrive as a Text to Speech Engineer, and why are they important?

Job description
Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text and text-to-speech. The Research Engineer will partner with research scientists to prototype and validate novel modeling ideas, focusing on scalable model training and tooling for speech technologies.
Responsibilities:
• Architect and manage horizontally scalable systems that dramatically accelerate the end-to-end training lifecycle for Speech-to-Text (STT) and Text-to-Speech (TTS) models.
• Design and implement internal UIs and tools that make ML systems and workflows accessible to non-technical stakeholders across the company.
• Oversee and manage training tooling, job orchestration, experiment tracking, and data storage.
Qualifications:
Required:
• Strong experience with the machine learning research pipeline, particularly in STT or related speech domains. This includes experimenting with and evaluating new architectures and modeling approaches, and implementing large-scale training systems.
• Proficiency with orchestration and infrastructure tools like Kubernetes, Docker, and Prefect.
• Familiarity with ML lifecycle tools such as MLflow.
• Experience building internal tools or dashboards for non-technical users.
• Hands-on experience with data engineering practices for unstructured audio and text data.
• Comfortable working in cross-functional teams that include researchers, engineers, and product stakeholders.
Company:
Deepgram provides a voice artificial intelligence platform for speech-to-text, text-to-speech, and voice applications. Founded in 2015, the company is headquartered in San Francisco, USA, with a team of 51-200 employees. The company is currently Growth Stage.
About Deepgram
Sourced by ZipRecruiter
Industry
Software development
Company size
1 - 10 Employees
Headquarters location
Mountain View, CA, US
Year founded
2015