1

Deep Voice Jobs (NOW HIRING)

Senior Software Engineer Take2 AI is hiring a Senior Software Engineer with deep Voice AI expertise and strong backend engineering skills. Our real-time voice agent pipeline (VAD + STT → LLM → ...

About The Role Take2 AI is hiring a Senior Software Engineer with deep Voice AI expertise and strong backend engineering skills. Our real-time voice agent pipeline (VAD + STT → LLM → TTS) already ...

Perform deep-dive analysis and analysis of voice issues using traces (SIP ladders), and packet captures. * Support and maintain VoIP platforms, including IP PBXs, softswitches, and hosted voice ...

... Deep understanding of VoIP protocols, SIP, dial plans, voice gateways. • Experience with large-scale rollouts (1,000+ devices). • Certifications such as CCNP Collaboration, Microsoft Teams Voice ...

We always ground our innovation in our deep experience and strong financial foundation, so we're a partner you can trust. What You'll Do: As a Network Voice Engineer at Customers Bank, you will be a ...

We always ground our innovation in our deep experience and strong financial foundation, so we're a partner you can trust. What You'll Do: As a Network Voice Engineer at Customers Bank, you will be a ...

Voice Engineer III

San Antonio, TX · On-site

$70K - $75K/yr

Deep understanding of SIP internetworking, signaling, and call-flow mechanics * Proven track record of advanced fault analysis across routing, switching, security, and large-scale voice deployments

Deep understanding of: * SIP, RTP, VoIP codecs, QoS * SBCs (AudioCodes, Ribbon, Oracle, or similar ... Background in hybrid voice environments (onprem + cloud) * Scripting or automation experience ...

next page

Showing results 1-20

Deep Voice information

See salary details

$5

$48

$76

How much do deep voice jobs pay per hour?

As of Jun 9, 2026, the average hourly pay for deep voice in the United States is $48.17, according to ZipRecruiter salary data. Most workers in this role earn between $39.18 and $60.10 per hour, depending on experience, location, and employer.

What is a Deep Voice job?

A Deep Voice job typically involves voice-related work, such as voice acting, AI voice modeling, or speech synthesis. Professionals in this field may lend their deep vocal tones for narration, commercials, audiobooks, or virtual assistants. Some roles also focus on training AI to replicate human voices for digital applications. This job requires strong vocal control, clarity, and sometimes technical knowledge of sound recording or AI voice generation.

What types of projects or clients might a Deep Voice professional typically work with?

Deep Voice professionals are often sought after for projects such as commercial voiceovers, narrations for documentaries, audio book recordings, radio advertising, and even animation or video game characters requiring a strong vocal presence. Clients can include advertising agencies, audiobook publishers, media production companies, and sometimes corporate clients looking for authoritative narration. Project scope and requirements can vary greatly, and professionals may work independently or as part of a larger creative or production team. This diversity enables voice talent to showcase their range and potentially specialize in certain industries or styles over time.

What are the key skills and qualifications needed to thrive in the Deep Voice position, and why are they important?

To thrive as a Deep Voice talent, you need a naturally resonant, clear, and controlled vocal tone, often complemented by experience in voice acting, broadcasting, or audio production. Familiarity with professional recording equipment, audio editing software, and, in some cases, voiceover or broadcasting certifications is typically valued. Excellent reading skills, adaptability, and professionalism help you deliver scripts convincingly and work efficiently with clients and production teams. These skills are crucial for maintaining vocal health, delivering high-quality recordings, and building a reputation in the voiceover industry.

More about Deep Voice jobs
What cities are hiring for Deep Voice jobs? Cities with the most Deep Voice job openings:
What are the most commonly searched types of Deep Voice jobs? The most popular types of Deep Voice jobs are:
What states have the most Deep Voice jobs? States with the most job openings for Deep Voice jobs include:
Infographic showing various Deep Voice job openings in the United States as of June 2026, with employment types broken down into 75% Full Time, and 25% Part Time. Highlights an 94% In-person, and 6% Hybrid job distribution, with an average salary of $100,198 per year, or $48.2 per hour.
Senior Software Engineer, Voice AI

Senior Software Engineer, Voice AI

Take2

Manhattan, NY

$135K - $178K/yr

Other

Posted 10 days ago


Job description

Senior Software Engineer

Take2 AI is hiring a Senior Software Engineer with deep Voice AI expertise and strong backend engineering skills. Our real-time voice agent pipeline (VAD + STT → LLM → TTS) already powers thousands of candidate conversations every month—now we're scaling to millions. You'll tackle challenges in low latency, high reliability, and large-scale architecture, shaping how our platform evolves as we grow. This role is hands-on, highly technical, and perfect for someone who thrives in a startup environment, playing a key role from design through production with significant ownership over technical execution and direction.

Required:
  • 6+ years software engineering experience (backend or full stack)
  • Bachelor's in Computer Science or Computer Engineering
  • Direct, hands-on experience integrating and optimizing STT, TTS, VAD, and LLMs for real-time voice agents
  • Scaled and optimized voice pipelines for low latency, high availability, and real-time performance
  • Designed and implemented agent orchestration, evaluation frameworks, and related tooling
  • Shipped production AI applications with real-time inference, integrating ML models into live systems
  • Built and operated distributed backend systems with monitoring and observability in place
  • Strong proficiency in Python, JavaScript, Node.js, AWS, Kubernetes, Docker
  • 2+ years at an early to mid-stage startup (Series B or lower)
Preferred:
  • Experience self-hosting AI/ML models for use in voice AI pipelines, including tuning and optimizing them for performance and reliability
  • Experience owning and operating distributed systems in high-throughput, streaming, low-latency environments, ensuring scalability, reliability, and performance under real-time constraints

Own and scale the end-to-end voice agent pipeline that already powers thousands of monthly conversations—help us grow it to millions. Select, integrate, and tune models for optimal latency, quality, and reliability. Build orchestration logic, evaluation systems, and supporting backend services. Ensure low latency, high availability, and scalability of voice agent infrastructure. Collaborate closely with product and engineering to rapidly prototype and deliver features, playing a key role in driving the vision and evolution of our platform and product with significant ownership over technical execution and direction.

We're NYC-based and work hybrid (in-office Mon-Thu). We value in-person collaboration but also trust people to manage their time responsibly.

Competitive salary + meaningful equity. This is a chance to join at a stage where your work meaningfully shapes the product and your career trajectory.