2

Remote Hpc System Engineer Jobs in Texas (NOW HIRING)

AI System Architect

Houston, TX ยท On-site +1

$232K/yr

HPC & AI Business Unit - AI System Architect Distinguished Technologist. - AI & Machine Learning ... Requires a broad knowledge and application of engineering disciplines, methodologies and tools ...

Senior Power Systems Engineer

Austin, TX ยท On-site +1

$103K - $141K/yr

As a Senior Power System Engineer at EPE you will manage and lead power system projects, including ... This position is open to Remote US or Canada Travel : Occasional travel may be needed (10% or less ...

Senior Power Systems Engineer

Austin, TX ยท On-site +1

$103K - $141K/yr

As a Senior Power System Engineer at EPE you will manage and lead power system projects, including ... This position is open to Remote US or Canada Travel : Occasional travel may be needed (10% or less ...

Senior Power Systems Engineer

Austin, TX ยท On-site +1

$103K - $141K/yr

As a Senior Power System Engineer at EPE you will manage and lead power system projects, including ... This position is open to Remote US or Canada Travel : Occasional travel may be needed (10% or less ...

Senior Kubernetes Platform Engineer

Austin, TX ยท On-site +1

$54.75 - $70.50/hr

... and HPC datacenters. Our differentiated architecture seamlessly integrates hardware, software and system level technologies to maximize the efficiency of GPU, CPU and accelerator-based compute ...

Building Systems Engineer - BMS/EPMS The Building Systems Engineer specializing in Building ... System Design and Implementation: Develop and implement BMS and EPMS solutions, ensuring seamless ...

next page

Showing results 1-20

Remote Hpc System Engineer information

What are the key skills and qualifications needed to thrive as a Remote HPC System Engineer, and why are they important?

To thrive as a Remote HPC System Engineer, you need expertise in Linux system administration, parallel computing, networking, and a degree in computer science or related field. Familiarity with job schedulers (like Slurm), cluster management tools, scripting languages (such as Python or Bash), and certifications like CompTIA Linux+ or Red Hat Certified Engineer are highly valuable. Strong problem-solving abilities, effective communication, and self-motivation are essential soft skills for remote collaboration and troubleshooting. These skills ensure the reliable operation, optimization, and scalability of HPC systems in distributed environments.

What are some common challenges faced by Remote HPC System Engineers, and how can they be managed effectively?

Remote HPC System Engineers often encounter challenges such as troubleshooting complex hardware or software issues without physical access, ensuring seamless system performance, and coordinating with geographically dispersed teams. These can be managed by leveraging strong remote monitoring tools, maintaining clear documentation, and establishing effective communication channels with on-site staff. Proactively scheduling regular system health checks and participating in virtual team meetings can also help address problems quickly and maintain high system reliability.

What is the difference between Remote Hpc System Engineer vs Remote Cloud Infrastructure Engineer?

AspectRemote Hpc System EngineerRemote Cloud Infrastructure Engineer
CredentialsTypically requires Linux certifications, HPC-specific trainingOften requires cloud platform certifications (AWS, Azure, GCP)
Work EnvironmentHigh-performance computing clusters, research labsCloud platforms, data centers, virtualized environments
Industry UsageResearch, scientific computing, academiaTech, finance, enterprise IT
Search/Comparison IntentUnderstanding HPC-specific roles vs cloud rolesComparing on-premise HPC vs cloud infrastructure

The Remote Hpc System Engineer focuses on managing and optimizing high-performance computing clusters, often in research or scientific environments. In contrast, the Remote Cloud Infrastructure Engineer specializes in designing and maintaining cloud-based infrastructure across various industries. While both roles require technical expertise in system management, their environments and certifications differ, catering to distinct operational needs.

What are Remote HPC System Engineers?

Remote HPC (High Performance Computing) System Engineers are IT professionals who design, implement, manage, and troubleshoot HPC systems and clusters from a remote location. They work with advanced computing infrastructure that supports scientific research, complex simulations, and large-scale data processing. Their responsibilities include configuring hardware and software, monitoring system performance, ensuring security, and providing technical support to users, all while working off-site. This role requires strong expertise in HPC technologies, operating systems like Linux, networking, and scripting, as well as effective communication skills for collaborating with distributed teams.
What are the most commonly searched types of Hpc System Engineer jobs in Texas? The most popular types of Hpc System Engineer jobs in Texas are:
What cities in Texas are hiring for Remote Hpc System Engineer jobs? Cities in Texas with the most Remote Hpc System Engineer job openings:

Azure DevOps Senior Systems Engineer

Sparc Technology Services Inc

Flower Mound, TX โ€ข Remote

$60 - $70/hr

Full-time

Posted 10 days ago


Job description

Role: Azure DevOps Senior Systems Engineer
Location: Remote
Type of Hire: Contract
Duration: 6+ Months
Must be willing to use your own laptop
Must be willing to work in PST Hours
Experience: 13+ Years of experience

3 levels of interviews
1st interview-Introduction on azure skills
2nd interview- 1 hour coding interview
3rd interview with manager

Job Description:
Cloud Engineering/Platform Engineering. Cloud/DevOps Engineer responsible for designing and deploying secure Azure infrastructure using Terraform and Bicep. This project is on modernizing CIAM infrastructure in Azure by implementing secure, scalable, and compliant cloud-native solutions. The initiative includes provisioning and deploying infrastructure using Terraform and Bicep. The objective is to enhance encryption, strengthen security controls, and ensure high availability and compliance with enterprise security standards.

Skills | No. of years of Experience | Detailed Write Up
Total No. of Years of Experience | |
Total No. of Years of Experience as an Azure Dev Ops System Engineer | |
Must be a hands-on Microsoft Azure System Engineer | |
Must have Azure App Service | |
Must have Azure native deployment | |
Must have Landing Zones (Very important. Experience standing up and managing Azure landing zones | |
Must have Azure Key Vault (Premium) | |
Must have Terraform Landing Zones | |
Must have Bicep | |
Must have worked on Azure DevOps (ADO) | |
Must have Azure infra using IaC. | |
Must have Implementing CI/CD pipelines in Azure DevOps for infrastructure deployments. | |
Must have Azure Private End points | |
Must have Azure Networking Git | |
Must have Service Principals | |
Must have RBAC | |
Build pipelines, deploying to Azure resources and Saas products | |
Experience with set/troubleshoot security mechanisms like mTLS | |
Must have AppInsights experience | |
Experience with certificate management, manual and implementing automation | |
Experience with PowerShell | |
Familiarity with IAM and user authentication (such as oauth and SAML) is preferred | |
Experience with ServiceNow preferred | |
Strong troubleshooting skills required | |
Proactive ownership of the system engineer space is necessary, including identifying and prioritizing work as a recommended roadmap | |
Managing role-based access control (RBAC) and service principal configurations. | |
Collaborating with architects and security teams to ensure compliance with encryption and networking standards. | |
Troubleshooting pipeline failures and network access issues (e.g., firewall and private endpoint configurations) | |
Supporting sandbox, dev, and environment-specific infrastructure deployments. | |
Have you worked on modernizing CIAM infrastructure in Azure by implementing secure, scalable, and compliant cloud-native solutions. The initiative includes provisioning and deploying infrastructure using Terraform and Bicep. The objective is to enhance encryption, strengthen security controls, and ensure high availability and compliance with enterprise security standards. | |
Willing to work on On Call rotation

This is a remote position.