Utilize monitoring and alerting frameworks to identify issues, escalate appropriately, and ensure ... Exposure to data center monitoring * Experience documenting operational processes and maintaining ...
Utilize monitoring and alerting frameworks to identify issues, escalate appropriately, and ensure ... Exposure to data center monitoring * Experience documenting operational processes and maintaining ...
We are seeking a Data Center Engineer for a 24x7 support of 600 + multi-vendor, global server ... Responsible for proactive monitoring of physical and virtual server environment within our data ...
We are seeking a Data Center Engineer for a 24x7 support of 600 + multi-vendor, global server ... Responsible for proactive monitoring of physical and virtual server environment within our data ...
Data Center Support
Port Washington, NY · On-site
The individual must demonstrate the ability to monitor NPD's high availability infrastructure while ... data center environmental conditions (HVACs, PDUs, UPS, generators), and addressing system alarms ...
Data Center Support
Port Washington, NY · On-site
The individual must demonstrate the ability to monitor NPD's high availability infrastructure while ... data center environmental conditions (HVACs, PDUs, UPS, generators), and addressing system alarms ...
Monitor data center systems and perform routine maintenance activities * Handle incidents and service requests while adhering to defined SLAs * Coordinate with vendors for hardware support ...
Monitor data center systems and perform routine maintenance activities * Handle incidents and service requests while adhering to defined SLAs * Coordinate with vendors for hardware support ...
Data Center Operator
Barker, NY · On-site
$20 - $36/hr
This position is responsible for monitoring, maintaining, and troubleshooting data center infrastructure, including servers, network equipment, power systems, and environmental controls. The operator ...
Quick apply
Data Center Operator
Barker, NY · On-site
$20 - $36/hr
This position is responsible for monitoring, maintaining, and troubleshooting data center infrastructure, including servers, network equipment, power systems, and environmental controls. The operator ...
Data Center Operator
Barker, NY · On-site
$20 - $36/hr
This position is responsible for monitoring, maintaining, and troubleshooting data center infrastructure, including servers, network equipment, power systems, and environmental controls. The operator ...
Quick apply
Data Center Operator
Barker, NY · On-site
$20 - $36/hr
This position is responsible for monitoring, maintaining, and troubleshooting data center infrastructure, including servers, network equipment, power systems, and environmental controls. The operator ...
Data Center Technician
Mcallen, TX · On-site
Join the MDC Team as a Data Center Technician! At MDC Data Centers , were looking for a hands-on ... Monitor infrastructure, escalate outages, and respond to incidents efficiently. * Perform routine ...
Quick apply
Data Center Technician
Mcallen, TX · On-site
Join the MDC Team as a Data Center Technician! At MDC Data Centers , were looking for a hands-on ... Monitor infrastructure, escalate outages, and respond to incidents efficiently. * Perform routine ...
Proven ability to analyze data and generate actionable insights. * Strong understanding of security ... Weekend, holiday, and evening work may be required. Offering excellent benefits: * Paid Vacation ...
Proven ability to analyze data and generate actionable insights. * Strong understanding of security ... Weekend, holiday, and evening work may be required. Offering excellent benefits: * Paid Vacation ...
Proven ability to analyze data and generate actionable insights. * Strong understanding of security ... Weekend, holiday, and evening work may be required. Offering excellent benefits: * Paid Vacation ...
Proven ability to analyze data and generate actionable insights. * Strong understanding of security ... Weekend, holiday, and evening work may be required. Offering excellent benefits: * Paid Vacation ...
Data Center
$175K/yr
Manage day-to-day data center operations and environment monitoring and support * Effectively manage the change control process * A proven track record of successfully negotiating Service Agreements ...
Data Center
$175K/yr
Manage day-to-day data center operations and environment monitoring and support * Effectively manage the change control process * A proven track record of successfully negotiating Service Agreements ...
Monitor data center capacity, power usage, and cooling efficiency to optimize operations. * Track ... Support on-call, after-hours, weekend, and holiday shifts as required. Requirements ...
Monitor data center capacity, power usage, and cooling efficiency to optimize operations. * Track ... Support on-call, after-hours, weekend, and holiday shifts as required. Requirements ...
Data Center
Culver City, CA · On-site
... data center operations including monitoring, maintenance, and incident management • Ensures robust high availability, disaster recovery, and business continuity plans • Manages vendor ...
Quick apply
Data Center
Culver City, CA · On-site
... data center operations including monitoring, maintenance, and incident management • Ensures robust high availability, disaster recovery, and business continuity plans • Manages vendor ...
Data Center Technician Location: Colorado City, CO / Napa - CA (Onsite) Duration: W2 / C2C Contract ... Monitor, diagnose, and resolve complex network and computer system issues. Manage and administer ...
Data Center Technician Location: Colorado City, CO / Napa - CA (Onsite) Duration: W2 / C2C Contract ... Monitor, diagnose, and resolve complex network and computer system issues. Manage and administer ...
Collaboborate with Vendors and other data center stakeholders for efficient Data Center operations Monitor, diagnose, and resolve complex network and computer system issues Manage and administer ...
Collaboborate with Vendors and other data center stakeholders for efficient Data Center operations Monitor, diagnose, and resolve complex network and computer system issues Manage and administer ...
Monitor data center capacity, power usage, and cooling efficiency to optimize operations. * Track ... Support on-call, after-hours, weekend, and holiday shifts as required. Requirements ...
Quick apply
Monitor data center capacity, power usage, and cooling efficiency to optimize operations. * Track ... Support on-call, after-hours, weekend, and holiday shifts as required. Requirements ...
Proven ability to analyze data and generate actionable insights. * Strong understanding of security ... Weekend, holiday, and evening work may be required. Offering excellent benefits: * Paid Vacation ...
Quick apply
Proven ability to analyze data and generate actionable insights. * Strong understanding of security ... Weekend, holiday, and evening work may be required. Offering excellent benefits: * Paid Vacation ...
Data Center Administrator
Hawthorne, CA · On-site
... Monitor the environmental conditions and power/cooling systems of a data center to ensure they are optimum for servers, switches, storage arrays, and other devices. • Run hardware diagnostics and ...
Data Center Administrator
Hawthorne, CA · On-site
... Monitor the environmental conditions and power/cooling systems of a data center to ensure they are optimum for servers, switches, storage arrays, and other devices. • Run hardware diagnostics and ...
Join the MDC Team as a Data Center Technician! At MDC Data Centers, we're looking for a hands-on ... Monitor infrastructure, escalate outages, and respond to incidents efficiently. * Perform routine ...
Join the MDC Team as a Data Center Technician! At MDC Data Centers, we're looking for a hands-on ... Monitor infrastructure, escalate outages, and respond to incidents efficiently. * Perform routine ...
Data Center Technician
Mcallen, TX · On-site
Join the MDC Team as a Data Center Technician! At MDC Data Centers, we're looking for a hands-on ... Monitor infrastructure, escalate outages, and respond to incidents efficiently. * Perform routine ...
Data Center Technician
Mcallen, TX · On-site
Join the MDC Team as a Data Center Technician! At MDC Data Centers, we're looking for a hands-on ... Monitor infrastructure, escalate outages, and respond to incidents efficiently. * Perform routine ...
Data Center Technician
Mcallen, TX · On-site
Join the MDC Team as a Data Center Technician! At MDC Data Centers, we're looking for a hands-on ... Monitor infrastructure, escalate outages, and respond to incidents efficiently. * Perform routine ...
Data Center Technician
Mcallen, TX · On-site
Join the MDC Team as a Data Center Technician! At MDC Data Centers, we're looking for a hands-on ... Monitor infrastructure, escalate outages, and respond to incidents efficiently. * Perform routine ...
Holiday Data Center Monitoring information
What is the highest salary for a data center technician?
Do data centers bring high paying jobs?
Who are the big 5 in data centers?
How much are data centers paying employees?
What is the difference between Holiday Data Center Monitoring vs Data Center Technician?
| Aspect | Holiday Data Center Monitoring | Data Center Technician |
|---|---|---|
| Certifications | Network+, CompTIA A+ | CompTIA A+, Cisco CCNA |
| Work Environment | Remote or on-site during holidays, monitoring systems | On-site, hardware and infrastructure maintenance |
| Job Focus | Monitoring systems, alert response, ensuring uptime during holidays | Installing, troubleshooting, repairing hardware and network issues |
Holiday Data Center Monitoring primarily involves overseeing data center systems remotely during holidays, focusing on alert management and system uptime. Data Center Technicians perform hands-on hardware and network repairs on-site. While both roles require certifications like CompTIA A+ and involve working in data center environments, their daily tasks and responsibilities differ significantly.
Full-time
Posted 26 days ago
Job description
Job Summary
The Data Center Operations Engineer is responsible for supporting, maintaining, and deploying critical data center infrastructure with a strong focus on Linux-based systems, GPU server deployments, and InfiniBand networking. This role requires hands-on expertise in data center operations, cluster bring-up, hardware installation, and troubleshooting across compute, network, and GPU environments. The engineer will collaborate closely with global infrastructure, development, and operations teams to ensure reliable, secure, and scalable service delivery.
Key Responsibilities
- Provide hands-on operational support for all data center projects, deployments, and repair activities.
- Participate in an on-call rotation and provide on-site or remote support during maintenance windows and incidents.
- Troubleshoot and resolve operational issues related to Linux servers, GPU platforms, networking, and storage infrastructure.
- Support customer and internal deployments, ensuring timely and successful bring-up of GPU servers and clusters.
- Perform InfiniBand fabric bring-up, switch configuration, subnet management, and troubleshooting.
- Conduct daily health checks of Linux systems and infrastructure components, proactively identifying and mitigating risks.
- Install, configure, test, and maintain server hardware (rack and stack, labeling, HDDs, memory, CPUs, RAID batteries, NICs, etc.).
- Install, configure, and troubleshoot networking equipment including routers, switches, and terminal servers for out-of-band management.
- Review and validate equipment deployments against approved design documentation and standards.
- Support data center builds, refreshes, migrations, and expansions while adhering to quality and safety standards.
- Coordinate with vendors and onsite staff for hardware delivery, diagnostics, replacement, and warranty services.
- Utilize monitoring and alerting frameworks to identify issues, escalate appropriately, and ensure timely service restoration.
- Maintain accurate documentation of operational procedures, system configurations, and runbooks.
- Follow established incident management, escalation procedures, and service-level agreements (SLAs).
- Collaborate with global teams across time zones to support operational initiatives and continuous improvement efforts.
- Contribute to process improvement initiatives and ensure adherence to documented policies, processes, and procedures.
Required Qualifications
- Bachelor's degree in Computer Science, Engineering, Information Technology, or equivalent practical experience.
- Strong hands-on experience in Linux environments, including system administration, troubleshooting, and performance validation.
- Proficiency with Linux command-line tools and shell scripting (Bash or equivalent).
- Experience with cluster bring-up, driver installation, and system-level configuration.
- Hands-on experience setting up and validating GPU servers in clustered environments.
- Experience with end-to-end GPU testing in InfiniBand-based clusters.
- Working knowledge of InfiniBand networking, including switch configuration and subnet management.
- Solid understanding of networking fundamentals, including the OSI model and TCP/IP protocol suite (IP, ARP, ICMP, TCP, UDP, SMTP, FTP, TFTP).
- Experience installing, configuring, and troubleshooting routers, switches, and terminal servers.
- Familiarity with fiber and copper cabling, including IP and SAN deployments.
- Experience managing incident tickets, maintaining acceptable ticket loads, and meeting SLAs.
- Strong organizational skills with meticulous attention to detail in data center environments.
- Ability to follow and enforce documented escalation procedures and operational policies.
- Strong verbal and written communication skills, with the ability to collaborate effectively with cross-functional and global teams.
Preferred Qualifications
- Experience supporting HPC, AI, or large-scale GPU environments.
- Exposure to data center monitoring
- Experience documenting operational processes and maintaining technical runbooks.
- Familiarity with large-scale data center buildouts or refresh programs.
Physical Requirements
- Ability to perform the essential functions of the role, including lifting, moving, and installing equipment weighing 50 pounds or more, with or without reasonable accommodation.
- Ability to work in data center environments, including raised floors, equipment racks, and confined spaces.
- Willingness to work flexible hours, including nights, weekends, and on-call rotations as required.
Work Environment
- On-site data center environment with occasional remote coordination.
- Interaction with hardware vendors, service providers, and internal engineering teams.
- Fast-paced operational setting requiring attention to detail, adherence to safety standards, and rapid problem resolution.
About Cadence Design Systems
Sourced by ZipRecruiter
Industry
Software development
Company size
5,001 - 10,000 Employees
Headquarters location
San Jose, CA, US
Year founded
1988