2

Remote Hpc System Engineer Jobs in Colorado (NOW HIRING)

Role Overview VDURA is seeking a Senior System Engineer to lead the specification, selection, and ... Experience working with HPC, AI, or large-scale storage systems * Proven ability to collaborate ...

Role Overview VDURA is seeking a Senior System Engineer to lead the specification, selection, and ... Experience working with HPC, AI, or large-scale storage systems * Proven ability to collaborate ...

Ground System Engineer

Aurora, CO ยท On-site +1

$99K - $225K/yr

If this sounds like you, come join Booz Allen's new Remote Sensing Systems Engineering and Integration (SE&I) team to work on the military's space programs. As a Ground Systems Engineer on our team ...

Systems Engineer Belong. Connect. Grow. with KBR! KBR's National Security Solutions team provides ... Colorado Springs, CO (or remote) * Travel Requirements: Minimum: up to 10% of time within the ...

IT Systems Engineer

Colorado Springs, CO ยท On-site +1

$120K - $160K/yr

TS/SCI Potential for Remote Work: ORA_ON_SITE Description SAIC is seeking a Mid-Level IT Systems Engineer to join our team of diverse IT professionals in delivering innovative solutions to modernize ...

next page

Showing results 1-20

Remote Hpc System Engineer information

What are the key skills and qualifications needed to thrive as a Remote HPC System Engineer, and why are they important?

To thrive as a Remote HPC System Engineer, you need expertise in Linux system administration, parallel computing, networking, and a degree in computer science or related field. Familiarity with job schedulers (like Slurm), cluster management tools, scripting languages (such as Python or Bash), and certifications like CompTIA Linux+ or Red Hat Certified Engineer are highly valuable. Strong problem-solving abilities, effective communication, and self-motivation are essential soft skills for remote collaboration and troubleshooting. These skills ensure the reliable operation, optimization, and scalability of HPC systems in distributed environments.

What are some common challenges faced by Remote HPC System Engineers, and how can they be managed effectively?

Remote HPC System Engineers often encounter challenges such as troubleshooting complex hardware or software issues without physical access, ensuring seamless system performance, and coordinating with geographically dispersed teams. These can be managed by leveraging strong remote monitoring tools, maintaining clear documentation, and establishing effective communication channels with on-site staff. Proactively scheduling regular system health checks and participating in virtual team meetings can also help address problems quickly and maintain high system reliability.

What is the difference between Remote Hpc System Engineer vs Remote Cloud Infrastructure Engineer?

AspectRemote Hpc System EngineerRemote Cloud Infrastructure Engineer
CredentialsTypically requires Linux certifications, HPC-specific trainingOften requires cloud platform certifications (AWS, Azure, GCP)
Work EnvironmentHigh-performance computing clusters, research labsCloud platforms, data centers, virtualized environments
Industry UsageResearch, scientific computing, academiaTech, finance, enterprise IT
Search/Comparison IntentUnderstanding HPC-specific roles vs cloud rolesComparing on-premise HPC vs cloud infrastructure

The Remote Hpc System Engineer focuses on managing and optimizing high-performance computing clusters, often in research or scientific environments. In contrast, the Remote Cloud Infrastructure Engineer specializes in designing and maintaining cloud-based infrastructure across various industries. While both roles require technical expertise in system management, their environments and certifications differ, catering to distinct operational needs.

What are Remote HPC System Engineers?

Remote HPC (High Performance Computing) System Engineers are IT professionals who design, implement, manage, and troubleshoot HPC systems and clusters from a remote location. They work with advanced computing infrastructure that supports scientific research, complex simulations, and large-scale data processing. Their responsibilities include configuring hardware and software, monitoring system performance, ensuring security, and providing technical support to users, all while working off-site. This role requires strong expertise in HPC technologies, operating systems like Linux, networking, and scripting, as well as effective communication skills for collaborating with distributed teams.
What are popular job titles related to Remote Hpc System Engineer jobs in Colorado? For Remote Hpc System Engineer jobs in Colorado, the most frequently searched job titles are:
What job categories do people searching Remote Hpc System Engineer jobs in Colorado look for? The top searched job categories for Remote Hpc System Engineer jobs in Colorado are:
What cities in Colorado are hiring for Remote Hpc System Engineer jobs? Cities in Colorado with the most Remote Hpc System Engineer job openings:

Systems Engineer, Platform

VDURA

Niwot, CO โ€ข On-site, Remote

Full-time

Posted 27 days ago


Job description

VDURA is redefining high-performance data infrastructure for AI, HPC, and data-intensive workloads. Building on our heritage as the creators of PanFS, VDURA is delivering next-generation parallel file system solutions designed for extreme scale, performance, and reliability across modern compute environments. Our platforms integrate cutting-edge server, storage, and networking technologies to power some of the most demanding workloads in the world.
 
Role Overview
VDURA is seeking a Senior System Engineer to lead the specification, selection, and qualification of server, storage, and networking platforms used in VDURA parallel file system solutions. This role is critical to ensuring our hardware platforms meet the performance, scalability, reliability, and cost objectives required for AI and HPC workloads.
 
You will work closely with software engineering, QA, product management, and external partners to define reference architectures, evaluate emerging technologies, and qualify platforms for both internal development and customer deployment.
 
Key Responsibilities
  • Define and own system architectures for VDURA parallel file system solutions, including compute, storage, and networking components
  • Specify, evaluate, and select server, storage, and networking platforms from OEMs and technology partners for current and next-generation products
  • Lead hardware bring-up, qualification, and validation efforts in collaboration with software, QA, and lab teams
  • Recommend continuous improvement changes to the platform definition based on vendor roadmaps and customer feedback.
  • Develop build and test instructions for integration and manufacturing partners.
  • Drive performance characterization of platforms, including throughput, latency, IOPS, failover behavior, and scalability under real-world workloads
  • Partner with software teams to ensure optimal alignment between hardware capabilities and PanFS datapath and control planes
  • Work directly with OEMs, IHVs, and component vendors on roadmap alignment, issue resolution, and joint qualification efforts
  • Support customer engagements by providing platform guidance, configuration recommendations, and technical deep dives as needed
  • Contribute to lab infrastructure planning and ensure test environments reflect future customer-facing configurations
 
Required Qualifications
  • Bachelorโ€™s degree in Electrical Engineering, Computer Engineering, Computer Science, or a related field (Masterโ€™s preferred)
  • 10+ years of experience in hardware engineering, systems engineering, or platform architecture roles
  • Strong hands-on experience specifying, benchmarking and qualifying servers, storage systems, and high-performance networking platforms
  • Engineering experience with: 
    • x86 server architectures
    • Storage โ€“ SAS, SATA, NVMe 
    • Networking โ€“ Ethernet, InfiniBand, RDMA, TCP, UDP 
    • Multicore โ€“ NUMA, memory management, caching 
    • PCIe Gen5/6
    • Hypervisor technologies โ€“ particularly KVM
  • Experience working with HPC, AI, or large-scale storage systems
  • Proven ability to collaborate cross-functionally with software, QA, and product teams
  • Comfortable working with vendors and partners at both technical and roadmap levels
  • Strong analytical, documentation, and communication skills
 
Preferred Qualifications
  • Experience with parallel file systems, distributed storage, or scale-out data platforms
  • Familiarity with GPU-accelerated systems and AI infrastructure requirements
  • Experience with storage benchmarking tools and workload characterization
  • Prior exposure to customer-facing technical roles or field engineering support
  • Experience building or managing lab environments for system qualification
 
Location: We strongly prefer candidates in Pittsburgh, PA or Denver, CO. However, we are open to remote candidates who meet the qualifications and can work effectively from a remote location.
 
VDURA Culture & Values
At VDURA, weโ€™re committed to transforming data into the catalyst for groundbreaking human advancements. Weโ€™re looking for collaborative and innovative minds to join our team and help us drive this mission forward. If youโ€™re passionate about mastering complex challenges, driving innovation, and making a tangible impact on the world, VDURA is the place for you. Join us and be part of a vibrant, โ€œcan doโ€ culture where every individual has the potential to make a real impact on our business success.
 
At VDURA, we value diversity and are proud to be an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.