1

Failure Analysis Manager Jobs (NOW HIRING)

... management. We're committed to making clean, green energy the primary power source for homes ... We are looking for a Failure Analysis Engineer who loves digging into tough hardware problems ...

Failure Analysis Engineer Join TSMC Arizona, a world-class leader in semiconductor manufacturing ... Capable of working independently and managing tasks to meet deadlines effectively. Interpersonal ...

Failure Analysis Engineer Join TSMC Arizona, a world-class leader in semiconductor manufacturing ... Capable of working independently and managing tasks to meet deadlines effectively. Interpersonal ...

next page

Showing results 1-20

Failure Analysis Manager information

See salary details

$29K

$94.9K

$155.5K

How much do failure analysis manager jobs pay per year?

As of Jun 28, 2026, the average yearly pay for failure analysis manager in the United States is $94,901.00, according to ZipRecruiter salary data. Most workers in this role earn between $66,000.00 and $119,000.00 per year, depending on experience, location, and employer.

Why does failure hurt so much?

Failure Analysis Managers often deal with setbacks and errors in identifying root causes, which can lead to emotional distress due to the importance of accuracy and accountability in their role. Understanding failure helps improve processes and prevent future issues, but the emotional impact can be significant when mistakes occur or when high stakes are involved. Developing resilience and analytical skills can help manage the emotional response to failure in this field.

What is another word for failure?

In the context of a Failure Analysis Manager, another word for failure is defect, malfunction, or breakdown. These terms describe the inability of a component or system to perform as intended, which the manager investigates to determine root causes and prevent recurrence. Understanding these synonyms helps in accurately documenting and analyzing issues during failure investigations.

What are some common challenges faced by Failure Analysis Managers, and how are they addressed in the role?

Failure Analysis Managers often encounter the challenge of diagnosing complex and intermittent failures that require a methodical, multidisciplinary approach. Balancing the need for thorough investigation with tight production timelines can also be demanding. To address these issues, managers lead teams of skilled engineers and technicians, utilize advanced analytical tools, and coordinate closely with design, quality, and manufacturing departments. By fostering open communication and continuous learning, they help develop faster, more effective root cause analyses that support robust product improvements.

What is the definition of failure?

In the context of a Failure Analysis Manager, failure refers to the inability of a product, component, or system to perform its intended function due to defects, material issues, or design flaws. Identifying and understanding failure modes is essential for improving quality and reliability through testing, inspection, and root cause analysis. This process often involves tools like failure mode and effects analysis (FMEA) and requires technical expertise to determine causes and prevent recurrence.

What is a Failure Analysis Manager job?

A Failure Analysis Manager oversees the investigation of product or system failures to determine root causes and implement corrective actions. They lead a team of engineers and analysts, collaborate with design and manufacturing teams, and use diagnostic tools to improve reliability. Their goal is to prevent recurring failures, enhance product quality, and support continuous improvement initiatives.

What are the key skills and qualifications needed to thrive in the Failure Analysis Manager position, and why are they important?

To thrive as a Failure Analysis Manager, you need a strong background in engineering or materials science, experience with root cause analysis methodologies, and a solid understanding of failure modes in manufacturing or electronics. Familiarity with technical tools such as scanning electron microscopes (SEM), failure analysis software, and certifications like Six Sigma or ASQ are highly valued. Excellent problem-solving, leadership, and communication skills help guide cross-functional teams and effectively present findings. These abilities are critical to ensuring product reliability, minimizing downtime, and driving continuous improvement within an organization.

How do you define failure?

In a Failure Analysis Manager role, failure is defined as the inability of a product, component, or system to perform its intended function due to defects, material issues, or design flaws. Identifying and understanding failure modes often involves techniques like root cause analysis and testing to prevent recurrence and improve reliability.
More about Failure Analysis Manager jobs
What cities are hiring for Failure Analysis Manager jobs? Cities with the most Failure Analysis Manager job openings:
What are the most commonly searched types of Failure Analysis jobs? The most popular types of Failure Analysis jobs are:
What states have the most Failure Analysis Manager jobs? States with the most job openings for Failure Analysis Manager jobs include:
Infographic showing various Failure Analysis Manager job openings in the United States as of June 2026, with employment types broken down into 100% Full Time. Highlights an 100% In-person job distribution, with an average salary of $94,901 per year, or $45.6 per hour.
Failure Analysis Engineer - Power & Design

Failure Analysis Engineer - Power & Design

Advanced Micro Devices, Inc

Secaucus, NJ • On-site

$114K/yr

Full-time

Posted 24 days ago


Advanced Micro Devices rating

8.4

Company rating: 8.4 out of 10

Based on 7 frontline employees who took The Breakroom Quiz

22nd of 139 rated electronics manufacturers


Job description

WHAT YOU DO AT AMD CHANGES EVERYTHING
At AMD, our mission is to build great products that accelerate next-generation computing experiences-from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges-striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
THE ROLE:
The Quality Engineering team is looking for an experienced Failure Analysis Engineer - Power & Design with strong expertise in board architecture, failure isolation, and rail bring-up. This individual will support customer and factory failure investigations for GPU accelerators, with primary ownership of PCB triage and board-level fault isolation. They will review schematics, layouts, and power architecture to develop targeted debug strategies, run diagnostics and functional test DOE's to reproduce and isolate failures, and work closely with design, validation, FW, and manufacturing teams to accelerate root cause analysis and corrective actions. Your contributions will directly impact product quality, reliability, and customer satisfaction.
THE PERSON:
The ideal candidate is a hands-on engineer with a strong electrical engineering foundation and deep experience in hardware design, board bring-up, and electrical debug. They bring a strong analytical mindset and are skilled at triaging complex PCB failures by narrowing issues to the board, component, rail, or system interaction level. They are comfortable running diagnostics and designing functional test DOE's to reproduce and isolate hard-to-find failures, while working effectively across design, validation, manufacturing, and repair teams. Their communication and documentation skills enable clear reporting and collaboration, and their curiosity and persistence help drive timely, high-quality root cause analysis and corrective actions.
KEY RESPONSIBILITIES:
  • Support internal and external requests to troubleshoot AMD GPU product failures with primary focus on PCB triage, power delivery debug, and board-level failure isolation for continuous yield, quality, and customer support improvements.

  • Develop and execute diagnostics and functional test DOE's to reproduce, characterize, and isolate difficult board- and power-related failures.

  • Develop Automation and tools to run tests and analyze results/logs.

  • Perform structured PCB triage by narrowing failures to the board, component, power rail, layout interaction, or system integration level, and work with the contract manufacturer and internal AMD teams to reproduce failures, isolate root cause, and determine the most effective next steps for debug and corrective action.

  • Use board schematics, layout data, and power delivery design knowledge to understand circuit behavior, trace power and signal paths, form debug hypotheses, and build targeted validation plans that drive efficient fault isolation and high-quality failure analysis.

  • Document all findings into FA database and create a complete failure analysis report for customer consumption as needed.

  • Present findings to key stakeholders, including senior management.

  • Implement ongoing continuous improvements of failure analysis process & techniques and create procedures of the steps to follow.

  • Oversee the set-up of new products and test stations for Failure Analysis operations.

PREFERRED EXPERIENCE:
  • Deep expertise in electrical engineering fundamentals, PCBA design, power delivery architecture, and hardware debug, including diagnostics and functional test development.

  • Skilled in using lab equipment (oscilloscopes, logic analyzers, power analyzers, and custom test tools) for board bring-up, rail validation, and hardware debug.

  • Strong background in PCB triage, board-level failure analysis, PCBA diagnostics, power delivery debug, and failure isolation techniques from NPI through production.

  • Proficient in Python, shell scripting, and working across Windows and Linux environments.

  • Solid understanding of firmware, drivers, and hardware interactions, with the ability to tune firmware as needed.

  • Extensive experience in hardware verification and system integration.

  • Hands-on experience assembling, installing, and configuring computer systems and servers.

  • Strong communication, documentation, collaboration, and presentation skills.

  • Able to read schematics, interpret datasheets, identify components, and perform soldering/rework to support efficient hardware debug and failure isolation.

  • Knowledge of high-speed digital design, power delivery networks, voltage regulator behavior, memory interfaces (HBM, GDDR), PCIe, and display outputs (DP, HDMI).

  • Experience with GPU data center infrastructure and AI/ML technologies is a plus.

ACADEMIC CREDENTIALS:
  • Bachelor's degree in Electrical Engineering, Computer Engineering, or a related field.

This role is not eligible for Visa sponsorship
#LI-LB1
Benefits offered are described: AMD benefits at a glance.
AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or third-party affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants' needs under the respective laws throughout all stages of the recruitment and selection process.
AMD may use Artificial Intelligence to help screen, assess or select applicants for this position. AMD's "Responsible AI Policy" is available here.
This posting is for an existing vacancy.