1

Audio Data Collection Jobs (NOW HIRING)

next page

Showing results 1-20

Audio Data Collection information

See salary details

$16

$25

$31

How much do audio data collection jobs pay per hour?

As of Jun 7, 2026, the average hourly pay for audio data collection in the United States is $25.31, according to ZipRecruiter salary data. Most workers in this role earn between $23.32 and $25.96 per hour, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive in Audio Data Collection, and why are they important?

To thrive in Audio Data Collection, you need attention to detail, familiarity with audio recording techniques, and a basic understanding of data management principles, often supported by experience or training in linguistics, audio engineering, or related fields. Proficiency with digital audio workstations (DAWs), recording devices, annotation tools, and sometimes knowledge of speech recognition software is typically required. Strong organizational skills, clear communication, and the ability to follow detailed instructions are valuable soft skills. These skills ensure the collection of high-quality, accurate audio datasets essential for training and improving speech and language technologies.

What is audio data collection?

Audio data collection is the process of gathering audio recordings, such as speech, environmental sounds, or specific language samples, for use in research or to train machine learning models. This work is essential for developing technologies like speech recognition, voice assistants, and language translation tools. Professionals in this field may record, label, and organize audio samples according to project requirements, ensuring high-quality and diverse datasets. The collected data helps improve the accuracy and reliability of audio-based AI systems.

What is the difference between Audio Data Collection vs Audio Transcription Specialist?

AspectAudio Data CollectionAudio Transcription Specialist
CredentialsBasic audio recording skills, attention to detailTyping proficiency, language skills, transcription experience
Work EnvironmentField or remote recording sessions, data labeling platformsOffice or remote, transcription software
Industry UsageData annotation, AI training, speech recognitionContent creation, documentation, subtitles

Audio Data Collection involves gathering and recording audio samples for AI and machine learning purposes, focusing on capturing high-quality sound data. In contrast, Audio Transcription Specialists convert audio recordings into written text, emphasizing accuracy and language skills. While both roles require attention to detail, they serve different stages in audio data processing and are used in distinct workflows within the industry.

What are some common challenges faced in an Audio Data Collection role, and how can they be addressed?

One common challenge in Audio Data Collection is ensuring high-quality, diverse audio samples that meet the project's linguistic and acoustic requirements. Background noise, inconsistent recording environments, and variations in speaker accents can affect data quality. To address these challenges, professionals often use standardized recording protocols, provide clear instructions to participants, and utilize specialized software for quality control. Regular collaboration with linguists, data annotators, and project managers also helps maintain data integrity and project timelines.
What cities are hiring for Audio Data Collection jobs? Cities with the most Audio Data Collection job openings:
What states have the most Audio Data Collection jobs? States with the most job openings for Audio Data Collection jobs include:

Software Engineer, Data Infrastructure & Acquisition - Huntsville, AL, USA

Speechify

Huntsville, AL โ€ข On-site

$112K - $135K/yr

Other

Posted 7 days ago


Job description

The mission of Speechify is to make sure that reading is never a barrier to learning.

Over 50 million people use Speechify's text-to-speech products to turn whatever they're reading - PDFs, books, Google Docs, news articles, websites - into audio, so they can read faster, read more, and remember more. Speechify's text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App.ย Google recently named Speechify the Chrome Extension of the Year and Apple named Speechify its 2025 Design Award winner for Inclusivity.ย ย 

Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting - Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like Stripe, Vercel, Bolt, and many founders of their own companies.

Overview

We're looking to hire for our Data side of our AI team at Speechify. This role is responsible for all aspects of data collection to support our model training operations. We are able to build high-quality datasets at petabyte-scale and low cost through a tight integration of infrastructure, engineering, and research work. We are looking for a skilled Software Engineer to join us.

What You'll Do

  • Be scrappy to find new sources of audio data and bring it into our ingestion pipeline
  • Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform.
  • Collaborate closely with our Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models.
  • Collaborate with others on the AI Team and Speechify Leadership to craft the AI Team's dataset roadmap to power Speechify's next-generation consumer and enterprise products.

An Ideal Candidate Should Have

  • BS/MS/PhD in Computer Science or a related field.
  • 5+ years of industry experience in software development.
  • Proficiency with bash/Python scripting in Linux environments
  • Proficiency in Docker and Infrastructure-as-Code concepts and professional experience with at least one major Cloud Provider (we use GCP)
  • Experience with web crawlers, large-scale data processing workflows is a plus
  • Ability to handle multiple tasks and adapt to changing priorities.
  • Strong communication skills, both written and verbal.

What we offer

  • A fast-growing environment where you can help shape the company and product.
  • An entrepreneurial-minded team that supports risk, intuition, and hustle.
  • A hands-off management approach so you can focus and do your best work.
  • An opportunity to make a big impact in a transformative industry.
  • Competitive salaries, a friendly and laid-back atmosphere, and a commitment to building a great asynchronous culture.
  • Opportunity to work on a life-changing product that millions of people use.
  • Build products that directly impact and support people with learning differences like dyslexia, ADD, low vision, concussions, autism, and more.
  • Work in one of the fastest-growing sectors of tech, the intersection of artificial intelligence and audio.

Compensation:ย Theย United States base salaryย range for this full-time position is $140,000-$200,000 + bonus + equity depending on experience

Think you're a good fit for this job?ย 

Tell us more about yourself and why you're interested in the role when you apply.
And don't forget to include links to your portfolio and LinkedIn.

Not looking but know someone who would make a great fit?ย 

Refer them!ย 

Speechify is committed to a diverse and inclusive workplace.ย 

Speechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.