1

Audio Captioning Jobs (NOW HIRING)

... video/audio directing for town halls, commission meetings, and other major SEC events. - Manage live streaming/webcast support andfacilitatecoordination for captioning/sign-language services ...

Senior Applied Data Scientist

Boston, MA · On-site +1

$150K - $180K/yr

In short, you'll be a full-stack applied data scientist and software developer. 3Play Media has diverse business and technical challenges related to media transcription, captioning, audio description ...

Senior Applied Data Scientist

Boston, MA · On-site +1

$150K - $180K/yr

In short, you'll be a full-stack applied data scientist and software developer. 3Play Media has diverse business and technical challenges related to media transcription, captioning, audio description ...

Be Seen First

Videographer

Smyrna, GA · On-site

$20 - $25/hr

* Competent using video equipment--basic understanding of cameras, lighting, audio, and electrical ... Adding computer graphics, closed captioning, and special effects to footage * Preparing background ...

Edit video content to professional standards, incorporating motion graphics, animation, audio enhancement, captioning and platform specific formatting. * Write copy, captions, headlines and other ...

Edit video content to professional standards, incorporating motion graphics, animation, audio enhancement, captioning and platform specific formatting. * Write copy, captions, headlines and other ...

next page

Showing results 1-20

Audio Captioning information

See salary details

$29.5K

$84.5K

$171.5K

How much do audio captioning jobs pay per year?

As of Jun 21, 2026, the average yearly pay for audio captioning in the United States is $84,456.00, according to ZipRecruiter salary data. Most workers in this role earn between $50,000.00 and $113,000.00 per year, depending on experience, location, and employer.

What is an Audio Captioning job?

An Audio Captioning job involves listening to audio content and creating accurate text descriptions or captions. This may include transcribing speech, describing background sounds, and ensuring captions are synchronized with the audio. Professionals in this role work on media such as videos, podcasts, or live broadcasts to improve accessibility for individuals who are deaf or hard of hearing. Strong listening skills, attention to detail, and familiarity with captioning software are essential for success in this field.

What are some common challenges faced in Audio Captioning roles, and how can I prepare for them?

One of the most common challenges in audio captioning is dealing with poor audio quality, heavy accents, overlapping speech, or specialized terminology, which can make accurate transcription more difficult. To prepare for these challenges, it's important to have strong comprehension skills, practice with a variety of audio samples, and become comfortable using audio editing tools to enhance clarity. Many roles also require you to meet tight deadlines, so time management and the ability to maintain accuracy under pressure are crucial. Learning industry best practices and familiarizing yourself with the latest captioning software can also help you excel in this field.

What are the key skills and qualifications needed to thrive in the Audio Captioning position, and why are they important?

To thrive in Audio Captioning, you need excellent listening skills, strong command of grammar and spelling, and attention to detail, often supported by relevant training or experience in transcription or captioning. Familiarity with audio editing and captioning software such as Adobe Premiere Pro, Aegisub, or specialized captioning platforms is commonly required. Outstanding time management, focus, and the ability to work independently are key soft skills for this role. These skills ensure captions are accurate, delivered on deadline, and compliant with accessibility standards for various audiences.

More about Audio Captioning jobs
What states have the most Audio Captioning jobs? States with the most job openings for Audio Captioning jobs include:
Infographic showing various Audio Captioning job openings in the United States as of June 2026, with employment types broken down into 97% Full Time, 1% Temporary, and 2% Contract. Highlights an 91% Physical, 3% Hybrid, and 6% Remote job distribution, with an average salary of $84,456 per year, or $40.6 per hour.

English Language Specialist (Transcription)

ESRhealthcare and EXEC STAFF RECRUITERS

New York, NY • Remote

$20 - $30/hr

Full-time

Posted 23 hours ago


Job description

English Language Specialist (Transcription)

$20 - $30/hour

pay

Required Skills

English

Transcription

Captioning

Writting

Attention to Detail

Job Description

Job Title: English Language Specialist (Transcription)

Job Type: Contractor

Location: Remote

Job Summary: In this role, you'll apply your expertise to help train next-generation AI systems. Your work will shape how models learn, reason, and perform through high-quality, real-world input.

Key Responsibilities:

Transcribe English audio files with a high degree of accuracy and consistency.

Review and refine machine-generated transcripts, correcting errors in spelling, grammar, and context.

Add relevant metadata tags and annotations to transcripts to enhance data quality.

Ensure all transcriptions meet strict quality standards and client formatting guidelines.

Collaborate with the customer's team to clarify requirements and resolve ambiguities in source material.

Handle sensitive or confidential content with discretion and professionalism.

Maintain organized records of completed assignments and contribute to process improvements.

Required Skills and Qualifications:

Fluent proficiency in English, with exceptional grasp of spelling, grammar, and punctuation.

Extensive experience in transcription and captioning of English audio content.

Strong written and verbal communication skills, demonstrating care and precision in every task.

High attention to detail, especially when reviewing and correcting machine-generated transcripts.

Proficient in annotating and tagging metadata for language data sets.

Excellent time management skills and ability to meet tight deadlines in a remote work environment.

Comfort working independently while being responsive to feedback and updates from the customer's team.

Preferred Qualifications:

Prior experience working with AI, speech recognition, or machine learning projects.

Background in linguistics, English, or related language studies.

Familiarity with industry-leading transcription tools and platforms.