1

Captioning Jobs in Georgia (NOW HIRING)

... captioning, clean plates, rotoscoping and masking, noise reduction, resizing, asset search and concept exploration to streamline workflows and increase output quality and speed. • Use strong ...

Operations Manager (ATL)

Atlanta, GA · On-site

$70K - $82K/yr

... captioning or using a sign language interpreter, or using specialized equipment. We are committed to a transparent and secure hiring process. All communications related to this role will come ...

... captioning or using a sign language interpreter, or using specialized equipment. We are committed to a transparent and secure hiring process. All communications related to this role will come ...

... captioning or using a sign language interpreter, or using specialized equipment. We are committed to a transparent and secure hiring process. All communications related to this role will come ...

Leverage AI tools for tasks such as rough cuts, transcription, captioning, clean plates, rotoscoping and masking, noise reduction, resizing, asset search and concept exploration to streamline ...

next page

Showing results 1-20

Captioning information

See Georgia salary details

$11.4K

$55.2K

$95K

How much do captioning jobs pay per year?

As of Jun 30, 2026, the average yearly pay for captioning in Georgia is $55,214.00, according to ZipRecruiter salary data. Most workers in this role earn between $41,000.00 and $63,300.00 per year, depending on experience, location, and employer.

What is captioning and what does a captioner do?

Captioning is the process of converting spoken dialogue and sounds in videos, television programs, or live events into written text that appears on the screen. Captioners listen to audio and transcribe it accurately, often including non-verbal sounds and speaker identification to assist viewers who are deaf or hard of hearing. Their work ensures content is accessible to a wider audience and may involve real-time (live) or offline (pre-recorded) captioning. Captioners must have excellent listening, typing, and language skills.

What Are the Qualifications to Get a Job in Captioning?

The primary qualifications for a job in captioning are a high school diploma or GED certificate and excellent communication skills. Employers prefer applicants who have call center experience, but this is not necessary for most roles. Performing the duties of a captioning job requires excellent short-term memory, fast and accurate typing skills, and the ability to communicate effectively through speech and text. Most captioning jobs are largely independent, so the ability to work well with minimal supervision is essential to success.

Do captioning jobs still exist?

Yes, captioning jobs still exist and involve creating text for videos to improve accessibility. These roles often require skills in transcription, familiarity with captioning software, and attention to detail. Captioning can be done remotely and may involve live or pre-recorded content.

What is the difference between Captioning vs Transcription?

AspectCaptioningTranscription
Required CredentialsOften requires certification in captioning or related trainingMay require general transcription skills, sometimes certification
Work EnvironmentLive or pre-recorded media, TV, online videosAudio or video files, various industries
Industry UsageBroadcast, media, education, accessibility servicesLegal, medical, business, media

Captioning and transcription both involve converting audio to text, but captioning focuses on real-time or synchronized text for media accessibility, while transcription involves creating a written record of audio content for various purposes. Captioning typically requires specialized skills and certifications for media synchronization, whereas transcription emphasizes accuracy across different industries.

Can I get paid to caption videos?

Yes, captioning jobs are paid positions where individuals transcribe or synchronize text with video content. Payment varies based on factors such as experience, project complexity, and whether the work is freelance or employed by a company; some captioners work remotely using specialized software and may need to pass skills assessments.

How much does captioning pay?

Captioning jobs typically pay between $10 and $30 per hour, depending on experience, the type of content, and whether the work is freelance or employed. Professional captioners often earn higher rates with specialized skills or certifications, and some work on a per-project basis or through platforms that set their own rates.

How do I become a captioner?

To become a captioner, you typically need strong typing skills, proficiency in captioning software, and a good understanding of grammar and punctuation. Many employers require a high school diploma or equivalent, and some may prefer certification in captioning or related fields. Gaining experience through training programs or freelance work can also improve job prospects.

What are the key skills and qualifications needed to thrive as a Captioner, and why are they important?

To thrive as a Captioner, you need excellent listening skills, fast and accurate typing abilities, and a strong command of grammar and spelling, often supported by relevant training or coursework. Familiarity with captioning software, speech recognition tools, and transcription systems is commonly required. Attention to detail, time management, and the ability to concentrate for extended periods are crucial soft skills for this role. These skills ensure that captions are accurate, timely, and accessible, which is vital for effective communication and inclusivity.

What are some common challenges faced by captioners, and how can they be managed on the job?

Captioners often face challenges such as keeping up with fast-paced speech, distinguishing between overlapping voices, and ensuring accuracy under tight deadlines. To manage these, strong listening skills, attention to detail, and proficiency with transcription software are essential. Many captioners also develop shorthand techniques and use specialized tools to improve real-time typing speed. Regular practice and staying updated on industry tools can help overcome these hurdles and maintain high-quality captions.
What are the most commonly searched types of Captioning jobs in Georgia? The most popular types of Captioning jobs in Georgia are:
What are popular job titles related to Captioning jobs in Georgia? For Captioning jobs in Georgia, the most frequently searched job titles are:
What cities in Georgia are hiring for Captioning jobs? Cities in Georgia with the most Captioning job openings:
Infographic showing various Captioning job openings in Georgia as of June 2026, with employment types broken down into 69% Full Time, 26% Part Time, and 5% Contract. Highlights an 80% In-person, and 20% Remote job distribution, with an average salary of $55,214 per year, or $26.5 per hour.

Senior Multimodal AI Researcher, Audio

Dolby

Atlanta, GA • On-site

Full-time

Posted 19 days ago


Key responsibilities

  • Partner closely with other domain experts to refine and execute Dolby's technical strategy in artificial intelligence and machine learning.

  • Use deep learning to create new solutions and enhance existing applications.

  • Push the state-of-the-art and develop intellectual property.


Job description

Join the leader in entertainment innovation and help us design the future. At Dolby, science meets art, and high tech means more than computer code. As a member of the Dolby team, you'll see and hear the results of your work everywhere, from movie theaters to smartphones. We continue to revolutionize how people create, deliver, and enjoy entertainment worldwide. To do that, we need the absolute best talent. We're big enough to give you all the resources you need, and small enough so you can make a real difference and earn recognition for your work. We offer a collegial culture, challenging projects, and excellent compensation and benefits, not to mention a Flex Work approach that is truly flexible to support where, when, and how you do your best work.
The Advanced Technology Group (ATG) is the research division of the company. ATG's mission is to look ahead, deliver insights, and innovate technological solutions that will fuel Dolby's continued growth. Our researchers have a broad range of expertise related to computer science and electrical engineering, such as AI/ML, algorithms, digital signal processing, audio engineering, image processing, computer vision, data science & analytics, distributed systems, cloud, edge & mobile computing, computer networking, and IoT.
Dolby is looking for a talented Senior Multimodal AI Researcher, Audio to join Dolby's research efforts and drive innovation in multimodal AI for audio applications, multimodal representations, and generative modeling for audio, speech, and music. You will join the Machine Reasoning and Perception team to join a team of top-tier researchers working on challenging problems in multimodal AI for entertainment applications. You will focus on the creation and implementation of multimodal and audio AI technologies from the underlying theoretical concepts to the development of prototypes and demonstrations, with the goal to create new experiences.
You will drive key innovations for Dolby's core business which allow Dolby and its customers to build products that push the boundaries of sound and multimedia experiences.
Summary
You will push the boundaries of the state-of-the-art in audio and multimodal technologies. The ideal candidate would have a strong background in deep learning, both in terms of conceptual understanding, as well as practical experience, with previous exposure to audio applications. A core aspect of this role involves being able to keep up to date with the literature, implement, and innovate with the bleeding edge in generative models, self-supervised learning, and multi-modal learning.
With the explosion of large language models and natural language processing, you will partner closely with Dolby's worldwide AI research staff, which actively pursues the integration of such models into audio and media experiences. You will be able to hit the ground running, innovate, and contribute to such projects. Consequently, experience with language models, question answering, vision-language models, captioning, etc. would be highly beneficial.
We are looking for candidates with experience in any of the following:
  • Generative modeling for audio applications (diffusion models, autoregressive models, masked generative transformers).
  • Multimodal semantic understanding and multimodal reasoning.
  • Multimodal representations (audio-video, audio-text, audio-video-text).
  • Multimodal AI architectures, with a focus on generating audio, music, and speech (text-to-audio, video-to-audio, image-to-audio).
  • Self and semi-supervised learning.
  • AI driven audio enhancement, processing, and generation (for speech and music), such as speech enhancement and analysis, source separation, text-to-speech, text-to-music, music information retrieval, audio classification.
  • LLMs for audio applications.

What You Will Accomplish
  • Partner closely with other domain experts to refine and execute Dolby's technical strategy in artificial intelligence and machine learning.
  • Use deep learning to create new solutions (including foundation models) and enhance existing applications.
  • Push the state-of-the-art and develop intellectual property.
  • Transfer technology to product groups.
  • Establish research collaborations with external university partners.
  • Mentor interns on novel research problems.
  • Publish papers in top-tier conferences and journals.
  • Advise internal leaders on recent deep learning advancements in the industry and academia to further influence research direction and business decisions.

Key Requirements
  • Ph.D. in Computer Science or similar field.
  • A strong background in deep learning, both in terms of conceptual understanding, as well as practical experience.
  • Technical knowledge of audio fundamentals.
  • Deep passion for audio, music, and multimedia applications.
  • Deep knowledge on current machine learning literature.
  • Strong publication record, with publications in major machine learning conferences (e.g. NeurIPS, ICLR, ICML) or top domain-specific conferences is desirable (e.g., ACL, CVPR, ICASSP, Interspeech).
  • Highly skilled in Python and one or more popular deep learning frameworks (TensorFlow or PyTorch).
  • Ability to envision new technologies and turn them into innovative products.
  • Good communication and collaboration skills.

Learn more about our innovative research: https://www.dolby.com/about/innovation/empowering/
The Atlanta Area base salary range for this full-time position is $140,700-$170,000 , which can vary if outside this location,plus bonus, benefits, and some roles may also include equity. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, competencies, experience, market demands, internal parity, and relevant education or training. Your recruiter can share more about the specific salary range and perks and benefits for your location during the hiring process.
Dolby will consider qualified applicants with criminal histories in a manner consistent with the requirements of San Francisco Police Code, Article 49, and Administrative Code, Article 12
Equal Employment Opportunity:
Dolby is proud to be an equal opportunity employer. Our success depends on the combined skills and talents of all our employees. We are committed to making employment decisions without regard to race, religious creed, color, age, sex, sexual orientation, gender identity, national origin, religion, marital status, family status, medical condition, disability, military service, pregnancy, childbirth and related medical conditions or any other classification protected by federal, state, and local laws and ordinances.