Job Summary:
Armada is the hyperscaler for the edge, delivering modular AI infrastructure from first deployment to AI factory with speed, scale, and sovereignty. The Senior Software Engineer will play a key role in the design and implementation of on-premise Compute as a Service (CaaS) and GPU as a Service (GPUaaS) offerings, contributing to a robust infrastructure platform and a cloud-integrated marketplace.
Responsibilities:
• Contribute to the architectural design and implementation of a scalable and efficient CaaS and GPUaaS platform on our on-premise bare metal clusters.
• Implement and integrate key technologies for our stack, including hypervisors (e.g., KVM), container runtimes (Docker, containerd), orchestration platforms (Kubernetes), and GPU virtualization technologies (Kubevirt, vGPU).
• Develop and implement robust monitoring, logging, and alerting systems for the CaaS and GPUaaS platform.
• Optimize performance and resource utilization of the bare metal infrastructure.
• Contribute to the design and development of a cloud-integrated marketplace for deploying applications onto the on-premise infrastructure.
• Implement backend services and API integrations for the marketplace.
• Develop secure and efficient application deployment workflows.
• Integrate the marketplace with existing cloud services and infrastructure.
• Develop and maintain comprehensive documentation for the marketplace.
• Write clean, well-tested, and maintainable code.
• Participate in technical design and code reviews.
• Stay up-to-date with the latest industry trends and technologies.
• Collaborate with cross-functional teams, including engineering, operations, and product management.
Qualifications:
Required:
• Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
• 6+ years of experience in infrastructure engineering or software development.
• Proven experience in designing and implementing CaaS and/or GPUaaS platforms.
• Strong understanding of containerization technologies (Docker, Kubernetes).
• Experience with bare metal infrastructure and hardware management.
• Experience with cloud platforms (AWS, Azure, GCP).
• Experience designing and implementing marketplaces or similar platforms.
• Strong programming skills in languages such as Python, Go, or Java.
• Excellent problem-solving and troubleshooting skills.
• Strong communication and collaboration skills.
• Experience with infrastructure as code tools (Terraform, Ansible).
Preferred:
• Experience with high-performance computing (HPC) environments.
• Experience with GPU virtualization and management.
• Experience with distributed storage systems.
• Experience with security best practices in cloud and on premise environments.
• Experience with building CI/CD pipelines.
Company:
Armada develops computing infrastructure that supports AI processing, connectivity, and data management in remote environments. Founded in 2022, the company is headquartered in San Francisco, USA, with a team of 201-500 employees. The company is currently Growth Stage.