Your role will bridge the gap between cutting-edge networking technologies (InfiniBand, RoCE, EVPN, VXLAN) and real-world HPC adoption at scale. This position offers the opportunity to shape the ...
Your role will bridge the gap between cutting-edge networking technologies (InfiniBand, RoCE, EVPN, VXLAN) and real-world HPC adoption at scale. This position offers the opportunity to shape the ...
Responsibilities : • Deploy, operate and maintain RDMA-based network architectures (RoCE/InfiniBand) for cluster with thousands of nodes • Optimize network performance for distributed collective ...
Responsibilities : • Deploy, operate and maintain RDMA-based network architectures (RoCE/InfiniBand) for cluster with thousands of nodes • Optimize network performance for distributed collective ...
Senior Manager, AI Cluster Deployment
Springfield, OH · Remote
$180K - $220K/yr
The role requires hands-on familiarity with modern AI infrastructure tooling and architectures, including Canonical MaaS, VAST Data storage platforms, and both InfiniBand and Ethernet-based GPU ...
Quick apply
Senior Manager, AI Cluster Deployment
Springfield, OH · Remote
$180K - $220K/yr
The role requires hands-on familiarity with modern AI infrastructure tooling and architectures, including Canonical MaaS, VAST Data storage platforms, and both InfiniBand and Ethernet-based GPU ...
... Infiniband or Omni-Path high speed fabrics, including subnet management, IPoIB and/or IPoOPA mechanisms, fabric topology and health monitoring and integration with MPI • Experience with Lustre, NFS ...
... Infiniband or Omni-Path high speed fabrics, including subnet management, IPoIB and/or IPoOPA mechanisms, fabric topology and health monitoring and integration with MPI • Experience with Lustre, NFS ...
L11 Diags Engineer
Santa Clara, CA · On-site
Work with systems connected by various networking protocols, including Ethernet, NV Link and InfiniBand * Operate in Windows and Linux operating systems for test and debug tasks. * Collaborate with ...
Quick apply
L11 Diags Engineer
Santa Clara, CA · On-site
Work with systems connected by various networking protocols, including Ethernet, NV Link and InfiniBand * Operate in Windows and Linux operating systems for test and debug tasks. * Collaborate with ...
Senior HPC Engineer
Mountain View, CA · On-site
$160K/yr
Design, deploy and maintain HPC clusters with over 2000+ nodes with InfiniBand, 100+ petabytes of data storage in production. * Shepherd and/or contribute to scalable feature designs through the ...
Senior HPC Engineer
Mountain View, CA · On-site
$160K/yr
Design, deploy and maintain HPC clusters with over 2000+ nodes with InfiniBand, 100+ petabytes of data storage in production. * Shepherd and/or contribute to scalable feature designs through the ...
Ensure InfiniBand high speed fabrics operate correctly with MAC labeling * Ensure MLS networking meets controls requirements Education/Requirements * Bachelor's degree in Computer Science or an AIS ...
Ensure InfiniBand high speed fabrics operate correctly with MAC labeling * Ensure MLS networking meets controls requirements Education/Requirements * Bachelor's degree in Computer Science or an AIS ...
Senior HPC Engineer
Mountain View, CA · On-site
$123K - $168K/yr
Design, deploy and maintain HPC clusters with over 2000+ nodes with InfiniBand, 100+ petabytes of data storage in production. * Shepherd and/or contribute to scalable feature designs through the ...
Senior HPC Engineer
Mountain View, CA · On-site
$123K - $168K/yr
Design, deploy and maintain HPC clusters with over 2000+ nodes with InfiniBand, 100+ petabytes of data storage in production. * Shepherd and/or contribute to scalable feature designs through the ...
Systems Engineer, Kernel
$165K - $242K/yr
Intel/AMD/ARM CPUs, Nvidia GPUs, DPUs, Infiniband and Ethernet NICs * Docker, kubernetes (k8s), KubeVirt, containerd, kubelet Focus Areas: * Kernel Debugging - Analyze kernel crashes, oopses, panics ...
Quick apply
Systems Engineer, Kernel
$165K - $242K/yr
Intel/AMD/ARM CPUs, Nvidia GPUs, DPUs, Infiniband and Ethernet NICs * Docker, kubernetes (k8s), KubeVirt, containerd, kubelet Focus Areas: * Kernel Debugging - Analyze kernel crashes, oopses, panics ...
... InfiniBand high speed fabrics operate correctly with MAC labeling • Ensure MLS networking meets controls requirements Must have active TS//SCI with CI Poly to start Basic Qualifications: • ...
... InfiniBand high speed fabrics operate correctly with MAC labeling • Ensure MLS networking meets controls requirements Must have active TS//SCI with CI Poly to start Basic Qualifications: • ...
LLM Pre-training & Distributed Engineer (AI Infrastructure)
Seattle, WA · On-site
$122K - $160K/yr
Optimize networking (InfiniBand/RDMA) and memory management to prevent out-of-memory errors. * Automate checkpointing and failure recovery during month-long training runs. Required Skills: * Deep ...
LLM Pre-training & Distributed Engineer (AI Infrastructure)
Seattle, WA · On-site
$122K - $160K/yr
Optimize networking (InfiniBand/RDMA) and memory management to prevent out-of-memory errors. * Automate checkpointing and failure recovery during month-long training runs. Required Skills: * Deep ...
Optimize networking (InfiniBand/RDMA) and memory management to prevent out-of-memory errors. * Automate checkpointing and failure recovery during month-long training runs. Required Skills: * Deep ...
Optimize networking (InfiniBand/RDMA) and memory management to prevent out-of-memory errors. * Automate checkpointing and failure recovery during month-long training runs. Required Skills: * Deep ...
DevOps Engineer TS/SCI CI Poly with Security Clearance
Herndon, VA · On-site
$54.25 - $74.25/hr
... InfiniBand and OmniPath) • Experience with InfiniBand or Omni-Path high speed fabrics, including subnet management, IPoIB and/or IPoOPA mechanisms, fabric topology and health monitoring and ...
DevOps Engineer TS/SCI CI Poly with Security Clearance
Herndon, VA · On-site
$54.25 - $74.25/hr
... InfiniBand and OmniPath) • Experience with InfiniBand or Omni-Path high speed fabrics, including subnet management, IPoIB and/or IPoOPA mechanisms, fabric topology and health monitoring and ...
Systems Engineer, Kernel
Livingston, NJ · On-site
$165K - $242K/yr
Intel/AMD/ARM CPUs, Nvidia GPUs, DPUs, Infiniband and Ethernet NICs * Docker, kubernetes (k8s), KubeVirt, containerd, kubelet Focus Areas: * Kernel Debugging - Analyze kernel crashes, oopses, panics ...
Systems Engineer, Kernel
Livingston, NJ · On-site
$165K - $242K/yr
Intel/AMD/ARM CPUs, Nvidia GPUs, DPUs, Infiniband and Ethernet NICs * Docker, kubernetes (k8s), KubeVirt, containerd, kubelet Focus Areas: * Kernel Debugging - Analyze kernel crashes, oopses, panics ...
LLM Pre-training & Distributed Engineer (AI Infrastructure)
San Francisco, CA · On-site
$126K - $166K/yr
Optimize networking (InfiniBand/RDMA) and memory management to prevent out-of-memory errors. * Automate checkpointing and failure recovery during month-long training runs. Required Skills: * Deep ...
LLM Pre-training & Distributed Engineer (AI Infrastructure)
San Francisco, CA · On-site
$126K - $166K/yr
Optimize networking (InfiniBand/RDMA) and memory management to prevent out-of-memory errors. * Automate checkpointing and failure recovery during month-long training runs. Required Skills: * Deep ...
Preferred : • Understanding of HPC systems: data center design, high speed interconnect InfiniBand, Cluster Storage and Scheduling related design and/or management experience. • Strong coding and ...
Preferred : • Understanding of HPC systems: data center design, high speed interconnect InfiniBand, Cluster Storage and Scheduling related design and/or management experience. • Strong coding and ...
This includes optimizing for heterogeneous interconnects such as NVLink, Spectrum-X (Ethernet), and Quantum-X (InfiniBand). • Application- Communication Library Co-Design: Partner with application ...
This includes optimizing for heterogeneous interconnects such as NVLink, Spectrum-X (Ethernet), and Quantum-X (InfiniBand). • Application- Communication Library Co-Design: Partner with application ...
This includes optimizing for heterogeneous interconnects such as NVLink, Spectrum-X (Ethernet), and Quantum-X (InfiniBand). • Application- Communication Library Co-Design: Partner with application ...
This includes optimizing for heterogeneous interconnects such as NVLink, Spectrum-X (Ethernet), and Quantum-X (InfiniBand). • Application- Communication Library Co-Design: Partner with application ...
Principal Software Engineer, GPU Compute
San Mateo, CA · On-site
$153K - $206K/yr
Evaluate and onboard new GPU and AI accelerator platforms, networking topologies (NVLink, InfiniBand, RoCE), and multi-node training and inference patterns. * Establish the standards, tooling, and ...
Principal Software Engineer, GPU Compute
San Mateo, CA · On-site
$153K - $206K/yr
Evaluate and onboard new GPU and AI accelerator platforms, networking topologies (NVLink, InfiniBand, RoCE), and multi-node training and inference patterns. * Establish the standards, tooling, and ...
LLM Pre-training & Distributed Engineer (AI Infrastructure)
Boston, MA · On-site
$116K - $153K/yr
Optimize networking (InfiniBand/RDMA) and memory management to prevent out-of-memory errors. * Automate checkpointing and failure recovery during month-long training runs. Required Skills: * Deep ...
LLM Pre-training & Distributed Engineer (AI Infrastructure)
Boston, MA · On-site
$116K - $153K/yr
Optimize networking (InfiniBand/RDMA) and memory management to prevent out-of-memory errors. * Automate checkpointing and failure recovery during month-long training runs. Required Skills: * Deep ...
Infiniband information
See salary details
$11K - $20.3K
0% of jobs
$20.3K - $29.5K
0% of jobs
$36.1K is the 25th percentile. Wages below this are outliers.
$29.5K - $38.8K
35% of jobs
$38.8K - $48.1K
0% of jobs
$48.1K - $57.4K
0% of jobs
$57.4K - $66.6K
0% of jobs
$66.6K - $75.9K
0% of jobs
$75.9K - $85.2K
0% of jobs
$85.2K - $94.5K
0% of jobs
$94.5K - $103.7K
11% of jobs
The median wage is $104.3K / yr.
$103.7K - $113K
54% of jobs
$11K
$83.3K
$113K
How much do infiniband jobs pay per year?
What jobs in the US pay 300,000 a year?
What is Infiniband?
What jobs pay $500,000 a year in the US?
What are the key skills and qualifications needed to thrive as an InfiniBand Network Engineer, and why are they important?
What jobs pay $10,000 a month without a degree?
Which 3 jobs will survive AI?
What are the typical responsibilities of an InfiniBand network engineer in a data center environment?
What is the difference between Infiniband vs Ethernet Network Engineer?
| Aspect | Infiniband | Ethernet Network Engineer |
|---|---|---|
| Required Credentials | Networking certifications, Cisco, Cisco CCNA, CCNP | Networking certifications, Cisco, CCNA, CCNP |
| Work Environment | Data centers, high-performance computing environments | Corporate networks, data centers, enterprise environments |
| Industry Usage | High-performance computing, research institutions | Business, telecommunications, enterprise IT |
| Common Search/Comparison | Yes | Yes |
Infiniband and Ethernet Network Engineers both work with network infrastructure, but Infiniband specializes in high-speed, low-latency connections used in data centers and HPC environments. Ethernet Network Engineers focus on standard Ethernet networks used across various industries. While their certifications and skills overlap, their work environments and applications differ significantly.
Full-time
Medical, Dental, Vision, Life, Retirement, PTO
Posted 24 days ago
Job description
NorthMark Compute & Cloud (NMC²) is backed by dedicated leadership and investment, with a clear mission as it operates at the bleeding edge of technology. Its goal is to scale and enhance the high-performance computing (HPC) and cloud infrastructure that supports its clients' research, production, and delivery, enabling breakthroughs that shape the industries of tomorrow. Its engineers build critical infrastructure to eliminate friction in scientific research, simulations, analysis, and decision-making, accelerating discovery and driving faster innovation.
The Position
As an HPC Network Solutions Architect, you will design, integrate, and optimize high-performance networking architectures that form the backbone of HPC, AI/ML, and data-intensive workloads. You will act as a trusted advisor to customers, guiding them across the entire solution lifecycle - from requirements gathering and design, through proof-of-concept and deployment, to optimization and long-term adoption.
This is a customer-facing, technically focused role. You will collaborate closely with customers to align low-latency, high-bandwidth networking designs with their workload requirements, while also working with internal engineering and product teams to influence roadmap priorities. Your role will bridge the gap between cutting-edge networking technologies (InfiniBand, RoCE, EVPN, VXLAN) and real-world HPC adoption at scale.
This position offers the opportunity to shape the future of HPC networking, deliver measurable impact for customers, and influence vendor ecosystems by incorporating emerging innovations into enterprise-ready solutions.
Responsibilities
- Act as the primary networking SME for customers adopting or scaling HPC environments.
- Partner with customers to capture network performance goals, scalability requirements, and integration constraints.
- Design and document end-to-end HPC network architectures, including Ethernet, InfiniBand, RoCE, EVPN, and VXLAN fabrics.
- Lead proof-of-concept and benchmarking engagements, validating low-latency and high-throughput designs against workload requirements.
- Optimize multi-vendor, multi-protocol data center and HPC interconnects, addressing scaling challenges such as data gravity and throughput bottlenecks.
- Define integration strategies across compute, storage, orchestration, and security layers to deliver resilient, workload-aware solutions.
- Conduct network performance assessments and tuning, identifying bottlenecks and recommending enhancements.
- Build observability frameworks for HPC networks at scale using tools like Prometheus, Grafana, and vendor telemetry.
- Collaborate with engineering, product, and operations teams to refine architecture blueprints and ensure consistent delivery.
- Partner with ecosystem vendors (e.g., NVIDIA, Mellanox, Cisco, Arista) to integrate cutting-edge features and influence roadmap evolution.
- Stay current with emerging HPC networking technologies and protocols, providing future insight to customers on adoption strategies.
- Represent the organization at customer design sessions, workshops, and industry events, building strong technical relationships.
Requirements
- Demonstrated experience in HPC networking solution architecture, systems design, or data center network engineering.
- Strong expertise in InfiniBand and RoCE protocols, including deployment and tuning at scale.
- Hands-on experience designing and implementing large-scale Ethernet networks, including BGP, OSPF, EVPN, and VXLAN.
- Deep understanding of GPU communication frameworks such as MPI and NCCL, and their integration with HPC interconnects.
- Proficiency with Linux-based environments and scripting (e.g., Python, Bash, PowerShell) for automation.
- Experience supporting multi-vendor environments and evaluating new networking platforms.
- Ability to translate complex networking requirements into clear solution architectures and present them effectively to customers.
- Strong customer-facing communication skills, including the ability to engage executives and technical stakeholders alike.
Preferred Experience
- Experience delivering HPC or AI/ML workloads across large-scale, low-latency network infrastructures.
- Familiarity with CNI plugins (Multus, Cilium, NVIDIA CNI) for HPC/Kubernetes environments.
- Exposure to automation and infrastructure-as-code practices for network provisioning (Terraform, Ansible).
- Experience in vendor collaboration, including influencing feature roadmaps and participating in joint evaluations.
- Contributions to open-source HPC networking or infrastructure projects.
- Bachelor's or Master's degree in Computer Science, Networking, Engineering, or a related technical field.
- Relevant Networking and systems certifications such as Cisco CCNP/CCIE, Juniper JNCIP, AWS Advanced Networking Specialty, or Red Hat RHCE.
It is impossible to list every requirement for, or responsibility of, any position. Similarly, we cannot identify all the skills a position may require since job responsibilities and the Company's needs may change over time. Therefore, the above job description is not comprehensive or exhaustive. The Company reserves the right to adjust, add to or eliminate any aspect of the above description. The Company also retains the right to require all employees to undertake additional or different job responsibilities when necessary to meet business needs.
Must be legally authorized to work in the United States without the need for employer sponsorship, now or at any time in the future.
Benefits & Perks:
- Company-Paid Lunch Stipend: Lunch is provided via GrubHub
- Company-Paid Benefits: 100% Employer-Paid Medical in our High Deductible Health Plan, Dental and Vision benefits for employees and their families, 16 weeks of Paid Parental Leave, Employee Assistance Program, Life insurance, Short-Term Disability and Long-Term Disability
- 401(k): Company will match 100% of your contributions up to 6%
- Optional Employee-Paid Benefits: Medical insurance in our PPO plan and a variety of other benefits such as Health Savings Accounts (with Company Contribution!), Flexible Spending Accounts, Supplemental Life Insurance, Wellhub and more.
- Time Off: 25 days of Paid Time Off plus 12 company holidays
EQUAL OPPORTUNITY EMPLOYER
NORTHMARK STRATEGIES LLC IS AN EQUAL EMPLOYMENT OPPORTUNITY EMPLOYER. THE COMPANY'S POLICY IS NOT TO DISCRIMINATE AGAINST ANY APPLICANT OR EMPLOYEE BASED ON RACE, COLOR, RELIGION, NATIONAL ORIGIN, GENDER, AGE, SEXUAL ORIENTATION, GENDER IDENTITY OR EXPRESSION, MARITAL STATUS, MENTAL OR PHYSICAL DISABILITY, AND GENETIC INFORMATION, OR ANY OTHER BASIS PROTECTED BY APPLICABLE LAW. THE FIRM ALSO PROHIBITS HARASSMENT OF APPLICANTS OR EMPLOYEES BASED ON ANY OF THESE PROTECTED CATEGORIES.