1

Fault Management Engineer Jobs (NOW HIRING)

... fault management logic and contingency responses. * Communicate concise technical updates on ... engineering teams, mission management, and stakeholders. What you bring to this role: * Hands-on ...

... fault management logic and contingency responses. * Communicate concise technical updates on ... engineering teams, mission management, and stakeholders. What you bring to this role: * Hands-on ...

... fault management, and communication management * Author and maintain platform service ... Collaborate with the hardware engineering team on compute architecture, resource allocation, and ...

... fault management, and communication management * Author and maintain platform service ... Collaborate with the hardware engineering team on compute architecture, resource allocation, and ...

Staff Systems Engineer Autonomy

Palo Alto, CA · On-site

$206.50K - $258.10K/yr

... fault management, and communication management * Author and maintain platform service ... Collaborate with the hardware engineering team on compute architecture, resource allocation, and ...

Provide fault management for the network and supports performance management functions. * Respond ... Network engineering and design experience specific to CAN/LAN. Experience should include ...

Software Systems Engineer

Westlake, TX

$166.10K - $196.90K/yr

Design and develop Fault Management Solutions using the Netexpert VSM application. Develop alarming ... Utilize programming skills to build (code) and test new alarm functionality per technical ...

Provide fault management for the network and supports performance management functions. * Respond ... Network engineering and design experience specific to CAN/LAN. Experience should include ...

next page

Showing results 1-20

Fault Management Engineer information

See salary details

$29.5K

$111.1K

$183.5K

How much do fault management engineer jobs pay per year?

As of May 29, 2026, the average yearly pay for fault management engineer in the United States is $111,144.00, according to ZipRecruiter salary data. Most workers in this role earn between $75,500.00 and $143,000.00 per year, depending on experience, location, and employer.

What are the key skills and qualifications needed to thrive as a Fault Management Engineer, and why are they important?

To thrive as a Fault Management Engineer, you need a solid understanding of networking principles, troubleshooting methodologies, and a relevant degree in engineering or information technology. Familiarity with network management systems (NMS), SNMP, fault monitoring tools like Nagios or SolarWinds, and certifications such as CCNA or CompTIA Network+ are typically required. Analytical thinking, attention to detail, and effective communication are crucial soft skills for diagnosing issues and coordinating resolutions. These skills ensure quick identification and resolution of network faults, maintaining system reliability and minimizing downtime.

How does a Fault Management Engineer typically collaborate with other IT teams during incident resolution?

A Fault Management Engineer works closely with network operations, system administrators, and support teams to swiftly identify and resolve system faults. During incidents, they coordinate troubleshooting efforts, communicate findings, and escalate issues to specialized teams when necessary. This collaboration ensures minimal downtime and helps maintain service reliability. Effective communication and teamwork are essential, as engineers often participate in cross-functional meetings and post-incident reviews to improve future response strategies.

What does a Fault Management Engineer do?

A Fault Management Engineer is responsible for monitoring, detecting, and resolving faults or issues within a network or system to ensure optimal performance and minimal downtime. They use specialized tools to identify problems, analyze incident reports, and coordinate with technical teams for quick resolution. Their duties often include implementing automated monitoring solutions, performing root cause analysis, and documenting incidents to prevent future occurrences. Overall, they play a crucial role in maintaining the reliability and efficiency of IT infrastructure.

What is the difference between Fault Management Engineer vs Network Operations Center (NOC) Technician?

AspectFault Management EngineerNetwork Operations Center (NOC) Technician
CertificationsNetwork+ or CCNA, fault management certificationsNetwork+ or CCNA, basic troubleshooting certifications
Work EnvironmentDesign, analyze, and resolve network faults, often in a technical or engineering settingMonitor network performance, respond to alerts, and perform troubleshooting in a control room
Employer & IndustryTelecom, ISPs, large enterprise networksTelecom, ISPs, data centers, enterprise IT

Fault Management Engineers focus on diagnosing and resolving complex network faults, often working on system design and analysis. NOC Technicians monitor network health and handle routine troubleshooting. Both roles are essential in maintaining network reliability but differ in scope and responsibilities.

More about Fault Management Engineer jobs

Flight Operations Engineer

E-Space

Saratoga, CA

Full-time

Posted 15 days ago


Job description

Ready to make connectivity from space universally accessible, secure, and actionable? Then you've come to the right place!
 
At E-Space, we're focused on bridging Earth and space with the world's most sustainable low Earth orbit (LEO) satellite network. We're a team of bold thinkers, ambitious leaders and dynamic doers-and we're disrupting NewSpace by fundamentally changing the design of legacy LEO space systems to deliver entirely new satellite capabilities at a fraction of the cost.
 
We're intentional, we're unapologetically curious and we're 100% committed-to saving space, to protecting our planet and to turning connectivity into actionable intelligence.
What you will be doing:
  • Operate spacecraft in real-time and scheduled commands, monitor telemetry and subsystem performance, and ensure system safety and mission success through continuous situational awareness and anomaly detection. 
  • Conduct deep-dive analysis of spacecraft telemetry to identify trends, diagnose off-nominal behavior, and support root-cause investigations of anomalies across all subsystems. 
  • Play a critical role in spacecraft integration and test campaigns including functional testing, Day-In-The-Life (DITL), and end-to-end mission rehearsals to ensure operational readiness. 
  • Develop and execute operational scripts and automation tools. Leverage Python and Bash scripting to build and maintain automation pipelines for telemetry analysis, command sequence validation, and alert generation.  
  • Contribute to the enhancement of ground software and data visualization dashboards. 
  • Collaborate with cross-disciplinary engineering teams to develop and refine CONOPS (Concept of Operations), design test cases, verify procedures, and implement fault mitigation strategies. 
  • Participate in Failure Modes and Effects Analysis (FMEA), simulate fault conditions, and conduct fault injection testing to validate spacecraft fault management logic and contingency responses. 
  • Communicate concise technical updates on spacecraft performance, testing progress, and anomalies to engineering teams, mission management, and stakeholders. 
What you bring to this role:
  • Hands-on experience in spacecraft operations, including functional testing, telemetry validation, and operational procedure development. 
  • Expertise in Failure Modes and Effects Analysis (FMEA), fault injection, and validation of fault management logic. 
  • 2-4 years of experience developing automation tools and data pipelines (telemetry analysis, command validation, system monitoring) using Python. 
  • Proficiency in scripting languages for process automation, data parsing, and mission operations support. 
  • Exposure to high volume data processing systems, telemetry decoding, or real-time monitoring frameworks.
  • Strong documentation and communication skills to write clear procedures, technical reports, and deliver concise updates to cross-disciplinary teams. 
  • Demonstrated ability to maintain situational awareness, make data-driven decisions under pressure, and contribute to mission success in dynamic operational environments. 
Bonus points:
  • Prior spacecraft or satellite operations experience (LEO, GEO, or deep-space missions). 
  • Experience working in Linux environments, with familiarity in telemetry processing pipelines, command and telemetry database management, and real-time monitoring tools. 
  • Familiarity with containerized environments (Docker, Kubernetes) and CI/CD integration for operational tool deployment. 
  • Experience querying and managing data from SQL/NoSQL databases and telemetry filestores. 
  • Experience developing or maintaining automation frameworks or real-time alerting systems. 
$0 - $0 a year
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
apply for this job