Job Summary:
Quarterhill Inc. is a forward-thinking company focused on building the next generation of Intelligent Transportation Systems. They are seeking a seasoned DevOps Engineer to work with a modern tech stack to build and maintain infrastructure that supports real-time data processing across cloud and on-premise environments.
Responsibilities:
• Implement and manage production-grade Kubernetes clusters using Argo CD for GitOps deployments and Terraform for Infrastructure as Code
• Build and maintain scalable infrastructure solutions for both cloud and on-premise environments
• Implement robust CI/CD pipelines with a focus on containerized applications
• Manage infrastructure performance, ensuring high availability for mission-critical systems
• Maintain our S3-based Data Lake infrastructure integrated with Dagster for scalable data orchestration
• Manage and optimize our NATS messaging system for real-time event streaming and communication
• Manage infrastructure to performantly run our Numaflow pipelines for real-time stream processing and reliable data flow
• Scale data infrastructure to support growing SaaS platform and expanding customer base
• Deploy and manage on-premise/edge computing infrastructure
• Implement hybrid cloud solutions that seamlessly integrate edge deployments with centralized cloud infrastructure
• Ensure reliable connectivity and data synchronization between edge nodes and central systems
• Optimize compute resource allocation across distributed computing environments
• Work closely with our development teams to optimize containerized application deployment of Go and Python code
• Manage and enhance our Bazel build system and evaluate complementary build tools
• Collaborate with our small team to rapidly prototype, test, and deploy new features
• Integrate security practices into the CI/CD pipeline to ensure that all software releases meet stringent security standards.
• Maintain compliance with industry regulations (e.g., PCI DSS, GDPR) and internal security policies to ensure that sensitive data is protected.
• Stay current with security trends and emerging threats, implementing updates and patches to mitigate vulnerabilities.
• Implement comprehensive monitoring solutions for distributed systems, data pipelines, and message brokers
• Proactively identify and resolve performance bottleneck in data processing
• Ensure system reliability and disaster recovery capabilities across cloud and edge environments
Qualifications:
Required:
• Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent work experience).
• Proven experience as a DevOps Engineer or in a similar role, with at least 5+ years of experience in DevOps, IT infrastructure, or software engineering.
• Hands-on, production experience with Kubernetes for at least 3+ years of time.
• Proven experience managing Data Lakes/Data Warehouses (e.g. Hadoop, Spark, Snowflake, BigQuery, etc.).
• Experience working in a high-availability, high-performance environment, such as transportation, data analytics, or financial systems.
• Expert-level Kubernetes knowledge including cluster management, networking, storage, and security.
• Terraform proficiency for Infrastructure as Code across multiple cloud environments.
• Bazel build system experience or familiarity with large-scale build systems used in monorepo settings (Buck, NX, Turbo, Pants, etc.).
• ArgoCD or other GitOps experience for Kubernetes deployments.
• Strong experience in identifying, diagnosing, and triaging various system issues related to performance and reliability.
• Experience with CI/CD tools, such as Github Actions and GitOps methodologies.
• Knowledge of monitoring and observability tools (Grafana, Prometheus, OTEL, etc.).
Preferred:
• AWS Certified Solutions Architect, Kubernetes Certified Administrator, or similar certifications (preferred but not required).
• Experience with on-premise/edge deployments and hybrid cloud architectures.
• Background in transportation or other related industries deploying and debugging software.
• Knowledge of microservices architecture and container orchestration tools.
• Familiarity with infrastructure-as-code (e.g., Terraform, AWS CloudFormation).
• Strong understanding of networking, load balancing, and security protocols.
Company:
Driving the future of transportation further, faster, smarter. Founded in , the company is headquartered in Frisco, Texas, US, , with a team of 201-500 employees. The company is currently Growth Stage.