Job Summary:
TensorWave is dedicated to delivering seamless and reliable AI compute at scale through their cloud platform. They are seeking a DevOps Software Engineer to design and build integrations between internal platform services and infrastructure systems, focusing on automation and orchestration across platforms.
Responsibilities:
โข Design and build integrations between internal platform services (e.g., messaging/pub-sub systems), infrastructure systems (compute, storage, networking), third-party vendor platforms
โข Develop services and tools that enable automation workflows, system coordination and orchestration, event-driven infrastructure operations
โข Write production-quality code in Go, Python, Rust (where applicable)
โข Build APIs, services, and background workers that interact with infrastructure platforms, CI/CD systems, automation frameworks
โข Ensure code is reliable, observable, maintainable
โข Integrate software with automation systems such as Ansible, Terraform, CI/CD pipelines (GitHub Actions, ArgoCD)
โข Enable infrastructure workflows through APIs, event-driven systems, automation hooks
โข Work closely with DevOps engineers (infrastructure and automation), Development teams (application requirements)
โข Translate infrastructure capabilities into usable APIs and services
โข Help teams integrate their systems into platform workflows
โข Build logging, metrics, and tracing into services
โข Debug and resolve issues across distributed systems
โข Ensure integrations are resilient and handle failure scenarios gracefully
โข Identify gaps in platform integration and automation
โข Build tooling that reduces manual work and improves system cohesion
โข Contribute to standards for internal platform development
Qualifications:
Required:
โข 5+ years of experience in software engineering, DevOps development, or platform engineering
โข Strong programming experience in: Go and/or Python
โข Rust is a strong plus
โข Experience building: APIs, services, system integrations
โข Strong understanding of: Distributed systems concepts, Event-driven architectures
โข Experience working with: Linux systems, Infrastructure platforms
Preferred:
โข Experience integrating with: Infrastructure platforms (compute, storage, networking), Kubernetes environments
โข Familiarity with: Message queues or pub/sub systems, CI/CD systems (GitHub Actions, ArgoCD)
โข Experience with automation frameworks such as: Ansible
โข Experience working with third-party APIs and vendor platforms
โข Exposure to infrastructure at scale or CSP environments
Company:
TensorWave is an AMD-exclusive cloud platform that leverages AMD Instinct GPUs and ROCm for high-performance AI workloads. Founded in 2023, the company is headquartered in Las Vegas, USA, with a team of 51-200 employees. The company is currently Growth Stage.