1

Text To Speech Jobs (NOW HIRING)

Deepgram is the leading platform for Voice AI technology, providing real-time APIs for speech-to-text and text-to-speech applications. The role of Research Staff, Data Science involves solving ...

Lead AI/ML Engineer

Mountain View, CA

$120.90K - $159.30K/yr

You will lead the design and delivery of end-to-end voice AI solutions, combining large language models with speech technologies such as speech-to-text, text-to-speech, and real-time streaming audio ...

next page

Showing results 1-20

Text To Speech information

What are the key skills and qualifications needed to thrive as a Text to Speech Engineer, and why are they important?

To thrive as a Text to Speech Engineer, you need a strong background in computer science, linguistics, and digital signal processing, often supported by a relevant degree. Experience with machine learning frameworks, speech synthesis toolkits (like Tacotron or WaveNet), and programming languages such as Python or C++ is typically required. Creativity, analytical thinking, and cross-functional communication skills help you collaborate with diverse teams and innovate in voice technology. These skills ensure the development of accurate, natural-sounding speech systems that meet user and client needs.

What are some common challenges faced by professionals working in Text to Speech (TTS) development roles?

Professionals in Text to Speech development often encounter challenges such as fine-tuning synthetic voices to sound natural and expressive, handling diverse accents or languages, and optimizing algorithms for various platforms. Collaboration with linguists, UX designers, and software engineers is frequent, as ensuring accessibility and seamless integration across applications is a top priority. Staying updated on advances in AI and deep learning is essential, as the field evolves rapidly and demands continuous improvement in both technical and creative aspects.

What is a Text To Speech (TTS) job?

A Text To Speech (TTS) job typically involves converting written text into spoken audio using specialized software or AI technology. Professionals in this field may work on developing, fine-tuning, or implementing TTS systems for various applications, such as virtual assistants, accessibility tools, or audiobooks. The role can also include tasks like voice data collection, script editing, and quality assurance of generated speech. TTS jobs are important for making digital content more accessible to people with visual impairments or reading difficulties. The field combines elements of linguistics, software engineering, and artificial intelligence.

What is the difference between Text To Speech vs Voice Actor?

AspectText To SpeechVoice Actor
Required CredentialsNone or basic audio editing skillsVoice training, acting skills, often professional demos
Work EnvironmentSoftware, digital platforms, remoteRecording studios, on-location, remote
Industry UsageAutomation, AI, tech companiesMedia, entertainment, advertising
Search & Comparison IntentAutomated voice solutions, TTS technologyVoice acting, narration, character voices

Text To Speech involves using software to convert written text into spoken words, primarily for automation and digital applications. Voice Actors, on the other hand, provide human voice recordings for media, entertainment, and advertising. While TTS is tech-driven and often used in AI and accessibility tools, Voice Actors bring emotional nuance and personality to their performances. Both roles are essential in their respective industries, but they differ significantly in skills, environment, and purpose.

More about Text To Speech jobs
What cities are hiring for Text To Speech jobs? Cities with the most Text To Speech job openings:
What states have the most Text To Speech jobs? States with the most job openings for Text To Speech jobs include:
What job categories do people searching Text To Speech jobs look for? The top searched job categories for Text To Speech jobs are:
Infographic showing various Text To Speech job openings in the United States as of May 2026, with employment types broken down into 6% Internship, 55% Full Time, 33% Part Time, and 6% Contract. Highlights an 89% In-person, and 11% Remote job distribution.
Senior Product Marketing Manager, Text-to-Speech (TTS)

Senior Product Marketing Manager, Text-to-Speech (TTS)

Deepgram

Remote

$145K - $185K/yr

Full-time

Medical, Dental, Vision, Life, Retirement, PTO

This job post has expired today. Applications are no longer accepted.


Job description

Company Overview
Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text (STT), text-to-speech (TTS), and building production-grade voice agents at scale. More than 200,000 developers and 1,300+ organizations build voice offerings that are 'Powered by Deepgram', including Twilio, Cloudflare, Sierra, Decagon, Vapi, Daily, Cresta, Granola, and Jack in the Box. Deepgram's voice-native foundation models are accessed through cloud APIs or as self-hosted and on-premises software, with unmatched accuracy, low latency, and cost efficiency. Backed by a recent Series C led by leading global investors and strategic partners, Deepgram has processed over 50,000 years of audio and transcribed more than 1 trillion words. There is no organization in the world that understands voice better than Deepgram.
Company Operating Rhythm
At Deepgram, we expect an AI-first mindset-AI use and comfort aren't optional, they're core to how we operate, innovate, and measure performance.
Every team member who works at Deepgram is expected to actively use and experiment with advanced AI tools, and even build your own into your everyday work. We measure how effectively AI is applied to deliver results, and consistent, creative use of the latest AI capabilities is key to success here. Candidates should be comfortable adopting new models and modes quickly, integrating AI into their workflows, and continuously pushing the boundaries of what these technologies can do.
Additionally, we move at the pace of AI. Change is rapid, and you can expect your day-to-day work to evolve just as quickly. This may not be the right role if you're not excited to experiment, adapt, think on your feet, and learn constantly, or if you're seeking something highly prescriptive with a traditional 9-to-5.
Opportunity
Voice agents are becoming standard infrastructure for customer-facing businesses, and text-to-speech (TTS) is what separates a voice agent that feels natural from one that feels broken. We're looking for a Senior PMM to own our go-to-market motion in TTS and make Deepgram the default choice for teams building voice agents.
This is a market ownership role. You'll define how Deepgram shows up in the TTS space, build the narrative that sets us apart from general-purpose providers, drive adoption from developer trial through enterprise deployment, and own product launches as they come. You'll work across Product, Sales, Developer Relations, and Demand Gen to make sure the TTS story is consistent, compelling, and closing deals.
We're an AI-native team. We expect AI to be woven into how you work, not something you reach for occasionally.
What You'll Do
  • Own the TTS category narrative. Define what "TTS for voice agents" means in a way that plays to Deepgram's strengths and makes general-purpose providers look like the wrong tool for the job.
  • Build and execute GTM strategy across developer adoption, pipeline influence, and account expansion. TTS doesn't always sell alone; know how it fits into voice agent and speech-to-text (STT) deals and make sure it shows up at the right moments.
  • Lead product launches for TTS releases: new voices, model improvements, latency advances, deployment options. Clear messaging, sharp launch plans, tight cross-functional execution.
  • Own competitive positioning. Battlecards, objection-handling guides, and displacement plays against ElevenLabs, OpenAI, Amazon Polly, PlayHT, and others. Know where we win, where we lose, and why.
  • Enable sales and build proof points. Collateral, customer stories, and benchmarks that win TTS deals and surface TTS opportunities in broader conversations.
  • Collaborate with Product on market and competitive intelligence: what voice agent builders need, what competitors are doing, where the market is going.
  • Drive developer-facing content and awareness with Developer Relations: tutorials, documentation messaging, use case content, and SEO tied to how voice agent builders actually search.
  • Build AI-assisted PMM workflows. Use AI to accelerate research, competitive synthesis, and content production. Build repeatable systems, not one-off prompts. You set the strategy; AI handles the first drafts.

You'll Love This Role If You
  • Are energized by owning a market, not just supporting a product. You want to shape how buyers think about the problem.
  • Understand the voice agent stack well enough to tell a coherent story about where TTS fits and why it matters.
  • Move fluidly between a developer audience and a business buyer without losing either one.
  • Are comfortable marketing a product with both a technical integration story and an experiential one, where one buyer is reading API docs and another is deciding if a voice sounds right for their brand.
  • Default to AI-first: you use AI continuously, have good judgment about when it's wrong, and think in systems rather than one-off prompts.

It's Important To Us That You Have
  • 5+ years of product marketing experience, including at least 2 years on infrastructure, API, or developer-facing products.
  • Strong messaging skills: you can build a positioning architecture from scratch and know the difference between a message that's technically accurate and one that actually moves people.
  • A high bar for quality: you notice when copy is off, when a layout doesn't work, when creative doesn't match the brand, and you can give useful feedback on all of it.
  • Working knowledge of the AI landscape: frontier models, major LLM providers, how the ecosystem fits together. You need to talk about this credibly with technical buyers.
  • Technical depth: you can engage with engineering and product teams, read API docs, and translate product-level details into market-facing narratives without oversimplifying.
  • Fluency in both product-led and sales-led growth motions, and how they interact.
  • Demonstrated AI fluency: concrete examples of AI-assisted workflows you've built, iterated on, and measured, with specific outcomes to back it up.

It Would Be Great if You Had
  • Experience in the voice AI space and familiarity with how TTS, STT, and large language models come together in a production voice agent stack.
  • Hands-on experience with AI-native tools across the stack: agentic coding (Claude Code, Codex) and collaborative work tools (Cowork, etc.).
  • Experience with technical proof points, model benchmarks, or evaluation frameworks. In TTS, buying decisions often come down to a listening test.
  • Comfort operating where the playbook doesn't fully exist yet and you're building it as you go.

Benefits & Perks*
Holistic health
  • Medical, dental, vision benefits
  • Annual wellness stipend
  • Mental health support
  • Life, STD, LTD Income Insurance Plans

Work/life blend
  • Unlimited PTO
  • Generous paid parental leave
  • Flexible schedule
  • 12 Paid US company holidays
  • Quarterly personal productivity stipend
  • One-time stipend for home office upgrades
  • 401(k) plan with company match
  • Tax Savings Programs

Continuous learning
  • Learning / Education stipend
  • Participation in talks and conferences
  • Employee Resource Groups
  • AI enablement workshops / sessions

*For candidates outside of the US, we use an Employer of Record model in many countries, which means benefits are administered locally and governed by country-specific regulations. Because of this, benefits will differ by region - in some cases international employees receive benefits US employees do not, and vice versa. As we scale, we will continue to evaluate where we can create more alignment, but a 1:1 global benefits structure is not always legally or operationally possible.
Backed by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $215M in total funding. If you're looking to work on cutting-edge technology and make a significant impact in the AI industry, we'd love to hear from you!
Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing. We put our customers first, grow together and move quickly. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.
We are happy to provide accommodations for applicants who need them.