1

Speech Data Collection Jobs (NOW HIRING)

Data Collection

San Jose, CA · On-site

$150K - $250K/yr

One that is proactive, multimodal, and capable of interacting with the world through speech, text ... About the Role You'll own data collection at Hark - the programs, the vendors, and the pipelines ...

One that is proactive, multimodal, and capable of interacting with the world through speech, text ... About the Role You'll own data collection at Hark - the programs, the vendors, and the pipelines ...

We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe and ...

We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe and ...

We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe and ...

We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe and ...

We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe and ...

We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe and ...

We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe and ...

We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe and ...

We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe and ...

We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe and ...

We count features, take pictures, make videos, record speech, and scan areas for every detail you need to make more informed decisions. Our field data collection teams are spread across Europe and ...

next page

Showing results 1-20

Speech Data Collection information

See salary details

$5

$23

$28

How much do speech data collection jobs pay per hour?

As of Jun 13, 2026, the average hourly pay for speech data collection in the United States is $23.85, according to ZipRecruiter salary data. Most workers in this role earn between $20.91 and $26.20 per hour, depending on experience, location, and employer.

What is a Speech Data Collection job?

A Speech Data Collection job involves gathering and recording spoken language samples to improve speech recognition systems, virtual assistants, and AI applications. Participants may read predefined phrases, engage in conversations, or record spontaneous speech in various languages and accents. The collected data helps train and enhance machine learning models for better voice recognition and natural language processing. These jobs are often remote and may require specific demographic or linguistic criteria.

What kind of tasks can I expect to perform daily as a Speech Data Collection specialist?

As a Speech Data Collection specialist, your daily tasks may include recording spoken phrases, transcribing audio data, annotating speech segments, and ensuring that collected data meets quality and format standards. You might collaborate with linguists, researchers, or engineers to define project needs and troubleshoot technical issues. Attention to accuracy and adherence to project guidelines are regularly emphasized, and tasks may sometimes be repetitive but require consistent concentration. This role often involves both independent work and participation in small teams, providing opportunities to learn about cutting-edge speech and language technologies.

What are the key skills and qualifications needed to thrive in the Speech Data Collection position, and why are they important?

To thrive in Speech Data Collection, you need strong attention to detail, linguistic or phonetic knowledge, and proficiency in recording or transcribing speech data, often supported by a high school diploma or relevant coursework. Familiarity with audio recording equipment, data annotation tools, and platforms like Praat or ELAN is commonly required. Excellent communication skills, reliability, and adaptability help individuals excel in collaborating with team members and managing varied data collection tasks. These skills ensure high-quality, accurate speech datasets critical for use in fields like speech recognition and language technology development.

More about Speech Data Collection jobs
What cities are hiring for Speech Data Collection jobs? Cities with the most Speech Data Collection job openings:
What are the most commonly searched types of Speech Data Collection jobs? The most popular types of Speech Data Collection jobs are:
What states have the most Speech Data Collection jobs? States with the most job openings for Speech Data Collection jobs include:
Infographic showing various Speech Data Collection job openings in the United States as of June 2026, with employment types broken down into 53% Full Time, 41% Part Time, and 6% Contract. Highlights an 94% In-person, and 6% Remote job distribution, with an average salary of $49,609 per year, or $23.9 per hour.
Data Collection

Data Collection

Hark

San Jose, CA • On-site

$150K - $250K/yr

Full-time

Posted 27 days ago


Job description

About Hark
Hark is an artificial intelligence company building advanced, personalized intelligence. One that is proactive, multimodal, and capable of interacting with the world through speech, text, vision, and persistent memory.
We're pairing that intelligence with next-generation hardware to create a universal interface between humans and machines. While today's AI largely operates through chat boxes and decade-old devices, Hark is focused on what comes next: agentic systems that interact naturally with people and the real world.
To get there, we're developing multimodal models and next-generation AI hardware together - designed from the ground up as a single, unified interface for a new era of intelligent systems.
About the Role
You'll own data collection at Hark - the programs, the vendors, and the pipelines that turn raw signal into training data our models can actually learn from.
That means running end-to-end campaigns across human feedback, synthetic data, and product-embedded signals. The quality of what we collect shapes the quality of what we ship, and this role owns that loop.
This is a high-ownership role on a small team. You'll work directly with researchers, engineers, and external partners, and the data you deliver will directly influence how our models behave in the real world.
Responsibilities
  • Design and run data collection programs end-to-end - scoping requirements, writing instructions, defining success criteria, and driving execution with vendors and annotators.
  • Manage external vendor relationships. Be the primary interface between Hark and data partners, keeping quality high and timelines on track.
  • Assess collected data using internal tooling, identify quality issues, and feed clear, actionable feedback back to vendors and annotators.
  • Collaborate closely with model researchers and engineers to understand what data is needed, translate that into operational plans, and deliver.
  • Track program metrics, surface insights, and drive continuous improvements to quality, throughput, and process.
  • Identify gaps in tooling and workflows and propose concrete improvements.

Requirements
  • Operational excellence. You can manage multiple programs simultaneously, keep track of details under pressure, and bring structure to fast-moving situations.
  • Experience working with external vendors or contractors. You know how to set expectations, manage relationships, and hold partners accountable to quality.
  • A knack for data. You've gone beyond surface-level metrics - you dig in, find patterns, and use what you find to make things better.
  • Strong communication. You can translate between research requirements and operational reality, and you keep everyone aligned without letting things slip.
  • Comfort with ambiguity and fast iteration. You take a rough problem, build a process around it, get feedback, and tighten it quickly.
  • Genuine curiosity about AI. You don't need to be an ML researcher, but you care about how models learn and why data quality matters.
  • 2+ years of relevant experience in data operations, program management, or a related field.

Bonus Qualifications
  • Experience managing human feedback or preference data programs.
  • Familiarity with data annotation platforms or labeling pipelines.
  • Experience with synthetic data generation or evaluation dataset design.
  • Background working at a fast-moving AI or research-driven company.

Compensation
The US base salary range for this full-time position is between $150,000 - $250,000 annually.
The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.