1

Voice Research Task Jobs (NOW HIRING)

Work on cutting-edge AI projects alongside leading research labs and Fortune 500 companies * Fully ... Freelance autonomy with the structure of meaningful, task-based work * Make a direct, tangible ...

... research labs and Fortune 500 companies * Fully remote and flexible - record and review on your own schedule * Freelance autonomy with the structure of meaningful, task-based work * Your voice ...

Work on cutting-edge AI projects alongside leading research labs and Fortune 500 companies * Fully ... Freelance autonomy with meaningful, well-defined task-based work * Your voice directly contributes ...

... research labs and Fortune 500 partners * Fully remote and flexible - record on your own schedule, from your own studio * Freelance autonomy with clear, well-defined task structures * Your voice ...

Drive cross-functional teams--including research, product design, engineering, and product ... Experience working in a task or project management system such as Jira, Asana, Trello, etc

AI Voice Producer

Seattle, WA · On-site

$142K - $195K/yr

... research, product design, engineering, and product management-to ensure alignment and successful ... a task or project management system such as Jira, Asana, Trello, etc • Familiarity with ...

Drive cross-functional teams--including research, product design, engineering, and product ... Experience working in a task or project management system such as Jira, Asana, Trello, etc

Drive cross-functional teams--including research, product design, engineering, and product ... Experience working in a task or project management system such as Jira, Asana, Trello, etc

Drive cross-functional teams--including research, product design, engineering, and product ... Experience working in a task or project management system such as Jira, Asana, Trello, etc

Work on cutting-edge AI projects alongside leading research labs and Fortune 500 companies * Fully ... Freelance autonomy with meaningful, well-defined task-based work * Make a direct, tangible impact ...

next page

Showing results 1-20

Voice Research Task information

See salary details

$9

$26

$52

How much do voice research task jobs pay per hour?

As of May 30, 2026, the average hourly pay for voice research task in the United States is $26.92, according to ZipRecruiter salary data. Most workers in this role earn between $18.03 and $39.66 per hour, depending on experience, location, and employer.

What is a Voice Research Task job?

A Voice Research Task job involves recording speech samples, transcribing audio, or analyzing voice data to improve speech recognition systems, AI assistants, or linguistic models. Participants may be asked to read specific phrases, engage in conversations, or provide feedback on synthesized speech. These tasks help develop and refine voice-based technologies for better accuracy and inclusivity. No specialized skills are usually required, but clear speech and adherence to guidelines are essential.

What are the key skills and qualifications needed to thrive in the Voice Research Task position, and why are they important?

To excel in a Voice Research role, you need a background in linguistics, phonetics, audio engineering, or a related field, with experience in collecting and analyzing voice data. Familiarity with audio editing software (like Audacity or Adobe Audition), speech processing tools, and sometimes programming languages such as Python is often required. Strong attention to detail, analytical thinking, and effective communication are valuable soft skills in this position. These abilities are crucial for producing high-quality research outputs and collaborating across multidisciplinary teams in voice technology development.

What typical projects or tasks can I expect to work on in a Voice Research position?

In a Voice Research position, you may work on projects such as designing and conducting voice data collection studies, analyzing speech and audio samples, and developing or refining speech recognition systems. Responsibilities often include transcribing or annotating audio data, evaluating the performance of voice-driven technologies, and collaborating with engineers or linguists to improve product functionality. You'll likely be involved in troubleshooting research challenges and finding solutions to ensure data quality. The work environment is often collaborative, requiring ongoing coordination with cross-functional teams. This role offers exposure to the latest advancements in speech technology and opportunities for professional growth.
What cities are hiring for Voice Research Task jobs? Cities with the most Voice Research Task job openings:
What are the most commonly searched types of Voice Research Task jobs? The most popular types of Voice Research Task jobs are:
What states have the most Voice Research Task jobs? States with the most job openings for Voice Research Task jobs include:
What job categories do people searching Voice Research Task jobs look for? The top searched job categories for Voice Research Task jobs are:

Research Scientist - Speech Synthesis

Nuance Labs

Seattle, WA

Other

Posted 14 days ago


Job description

About Nuance Labs

Nuance Labs is an early-stage deep tech startup. We're building the first real-time human foundation model - unifying text, speech, and vision - to make AI socially and emotionally intelligent. Imagine an AI that can understand a quirked eyebrow, a shift in tone, or a hesitant pause, and respond in a way that feels truly human.

This is for you, if
  • Have a PhD (or equivalent experience) in training speech synthesis models (text-to-speech, speech-to-speech, etc.), training audio generation models, or related fields, with a track record of pushing the research frontier

  • Know deep learning inside out and can run the whole ML pipeline, from data wrangling and rapid prototyping to large-scale training, benchmarking, and evaluation

  • Love blank-page problems, chart your own course, and make progress without waiting for someone to hand you a task list

  • Move quickly from research breakthroughs to practical, real-world applications

  • Write code that's clean enough your future self will thank you for

  • Play well with other brilliant minds from different domains

What you'll be building

The first human foundation model that operates across text, speech, facial expression, and body language in real time. This unified model:

  • Understands fine-grained human signals - from a quirked eyebrow to a subtle change in voice - and infers meaning in context

  • Generates lifelike, responsive avatars whose expressions, gestures, and tone evolve frame-by-frame to deliver genuine responses

The landscape is ripe for innovation. While voice AI systems have made great strides in capturing prosody, and avatar platforms can generate compelling visuals, existing solutions remain fragmented. Real-time, multimodal interaction - where voice, facial expression, and contextual perception converge - is still an unsolved problem. This role offers the rare opportunity to shape foundational technology in a space where the boundaries are still being defined.

Why this team

We're research scientists who've spent years advancing AI avatar and audio-visual generation - publishing at top conferences and shipping ultra-low-latency ML products to millions. We combine frontier research with the ruthless engineering needed for consumer-grade, real-time systems.