1

Audio Perception Research Jobs (NOW HIRING)

We are seeking a contract applied research scientist specializing in audio perception. This role is an integral part of our team, contributing to our research and development efforts. The ideal ...

Audio Systems Engineer

San Francisco, CA · On-site

$175K - $280K/yr

Research and implement advanced technologies to optimize audio system integration. * Continuously ... Familiarity with human perception of sound. * Experience with psycho-acoustic metrics and ...

The Advanced Technology Group (ATG) is the research division of the company. ATG's mission is to ... processing, audio engineering, image processing, computer vision, data science & analytics ...

next page

Showing results 1-20

Audio Perception Research information

See salary details

$37K

$106K

$142.5K

How much do audio perception research jobs pay per year?

As of Jun 7, 2026, the average yearly pay for audio perception research in the United States is $106,012.00, according to ZipRecruiter salary data. Most workers in this role earn between $104,000.00 and $104,000.00 per year, depending on experience, location, and employer.

What are some common challenges faced by professionals in audio perception research, and how can they be addressed?

Professionals in audio perception research often encounter challenges such as recruiting diverse participant groups, designing robust experimental protocols, and interpreting subjective data. Overcoming these challenges typically involves collaborating closely with multidisciplinary teams, including engineers, psychologists, and statisticians, to refine methodologies and ensure accurate data analysis. Staying updated on the latest research tools and ethical guidelines also helps researchers maintain high standards and adapt to evolving technologies in the field.

What is audio perception research?

Audio perception research is the scientific study of how humans interpret and understand sounds. Researchers in this field investigate topics such as how we recognize speech, distinguish different musical notes, and perceive spatial locations of sounds. The goal is to better understand the processes and mechanisms by which the brain processes auditory information, which can inform developments in audio technology, hearing aids, and treatments for hearing disorders. This research often combines psychology, neuroscience, engineering, and computer science.

What are the key skills and qualifications needed to thrive as an Audio Perception Researcher, and why are they important?

To thrive as an Audio Perception Researcher, you generally need a strong background in psychology, neuroscience, or acoustics, often supported by an advanced degree such as a Master's or Ph.D. Familiarity with audio analysis software (e.g., MATLAB, Praat), experimental design, and statistical tools is essential, as is experience with laboratory equipment for audio testing. Critical thinking, attention to detail, and effective communication help researchers design robust studies and present findings clearly. These skills are crucial for generating reliable insights into auditory perception, advancing scientific understanding, and informing technology or product development.

What is the difference between Audio Perception Research vs Audio Signal Processing Engineer?

AspectAudio Perception ResearchAudio Signal Processing Engineer
CredentialsTypically requires a degree in psychology, audiology, or neuroscienceUsually holds a degree in electrical engineering, computer science, or related fields
Work EnvironmentResearch labs, universities, or industry R&D departments focused on human perceptionTechnology companies, audio hardware/software development, and signal processing labs
Industry UsageUsed in developing better audio devices, hearing aids, and understanding human hearingDesigning audio algorithms, improving sound quality, and developing audio hardware

Audio Perception Research focuses on understanding how humans perceive sound, often involving behavioral studies and neuroscience. In contrast, Audio Signal Processing Engineers develop algorithms and hardware to manipulate audio signals. Both roles are essential in the audio industry but serve different purposes related to human perception versus technical signal manipulation.

Infographic showing various Audio Perception Research job openings in the United States as of May 2026, with employment types broken down into 11% Internship, 58% Full Time, 21% Part Time, 5% Temporary, and 5% Contract. Highlights an 89% In-person, and 11% Remote job distribution, with an average salary of $106,012 per year, or $51 per hour.

Applied Researcher, Audio Understanding

Cartesia, Inc.

San Francisco, CA • On-site

Full-time

Medical, Dental, Vision, Retirement, PTO

This job post has expired 1 day ago. Applications are no longer accepted.


Job description

About Cartesia Our mission is to architect AI that learns from and interacts with the world like humans do. We're pioneering the model architectures that will make this possible. Our founding team met as PhDs at the Stanford AI Lab, where we invented State Space Models or SSMs, a new primitive for training efficient, large-scale foundation models. Our team combines deep expertise in model innovation and systems engineering paired with a design-minded product engineering team to build and ship cutting edge models and experiences. We're funded by leading investors at Index Ventures and Lightspeed Venture Partners, along with Factory, Conviction, A Star, General Catalyst, SV Angel, Databricks and others. We're fortunate to have the support of many amazing advisors, and 90+ angels across many industries, including the world's foremost experts in AI. About the Role As a Senior Applied Researcher in Audio Understanding, you will be responsible for tackling the most challenging problems in audio perception. Your work will go beyond traditional speech recognition to encompass the full spectrum of audio perception, from identifying speakers and interpreting emotion to understanding complex acoustic environments. You will lead high-impact projects that are critical to our mission of building truly aware AI. Your Impact
  • Architect and develop novel, large-scale models for complex audio understanding tasks, including multi-speaker ASR, diarization, and non-speech audio classification and deploy them to production at scale.
  • Pioneer research in areas like self-supervised learning for audio, few-shot learning, and robust audio-visual perception.
  • Set new standards for how we evaluate and benchmark our audio understanding systems.
  • Build large scale pre-training and fine-tuning datasets for audio understanding capabilities.
What You Bring
  • Deep expertise in ASR, audio understanding, language modeling, or generative modeling more broadly.
  • Experience with large-scale training, GPU/TPU acceleration, and model optimization.
  • Strong applied mindset-able to balance scientific novelty with product impact.
More Details In-office policy: We're an in-person team based out of offices in S San Francisco, GB London and I Bangalore. We love being in the office, hanging out together, and learning from each other every day. Visa sponsorship: We provide visa sponsorship support and assess each circumstance on a case-by-case basis. However, visa sponsorship is dependent on many factors, including the role you are applying for, and the location you are going to be based, and so we can't always guarantee success. Your Recruiter will work with you to understand your visa sponsorship needs from the first call. We ship fast. All of our work is novel and cutting edge, and execution speed is paramount. We have a high bar, and we don't sacrifice quality or design along the way. We support each other. We have an open & inclusive culture that's focused on giving everyone the resources they need to succeed. Our Benefits (US Employees Only) Compensation. Competitive base salary alongside attractive equity package. Health Insurance. Fully covered medical insurance along with dental and vision for you and your family. 401(k) Commuter Allowance. A monthly stipend to help you get to and from the office. Flexible PTO. Take as much time as you need to recharge your batteries. Meals & Snacks. Lunch, dinner and plenty of snacks, provided daily. Your own personal Yoshi.