Job summary
The Platform Ops team within CloudOps is responsible for the reliability, scalability, and modernization of DigiCert's cloud infrastructure. As a Principle SRE, you will own the intersection of software engineering and operations-driving automation-first practices, reducing toil, and accelerating our cloud transformation across AWS, Azure, and GCP environments.
You will be a technical force multiplier: raising reliability standards across the organization, defining SLOs that matter, and building the internal platforms and tooling that enable product teams to ship with confidence.
What you will do
Reliability Engineering
- Define, implement, and own SLIs, SLOs, and error budgets for critical platform services
- Lead blameless post-mortems and drive systemic reliability improvements across the platform
- Design and implement observability pipelines (metrics, logs, traces) using tools such as Splunk, Prometheus, Grafana, or OpenTelemetry
- Participate in on-call rotation and serve as an incident commander for P0/P1 events
Cloud Modernization
- Architect and execute migration strategies from legacy infrastructure to cloud-native patterns (containers, serverless, managed services)
- Champion adoption of Kubernetes, service mesh, and managed cloud services (EKS, GKE, AKS)
- Evaluate and introduce emerging cloud technologies that improve availability, cost efficiency, and developer experience
- Partner with architecture and security teams to embed reliability and compliance into platform design
Automation & Platform Development
- Build and maintain infrastructure-as-code using Terraform across multi-cloud environments
- Develop internal tooling, self-service platforms, and golden-path templates that reduce operational burden for development teams
- Automate operational workflows including provisioning, scaling, patching, and secret rotation
- Contribute to and maintain CI/CD pipelines (GitHub Actions) to enable safe, frequent deployments
Engineering Leadership
- Mentor mid-level engineers on SRE principles, distributed systems, and infrastructure best practices
- Collaborate cross-functionally with product, security, and compliance teams to deliver on platform roadmap commitments
- Document architectural decisions, runbooks, and platform standards; raise the engineering bar through code and design reviews
What you will have
- 5+ years of experience in SRE, platform engineering, or infrastructure engineering roles
- Deep proficiency in at least one major cloud provider (AWS, GCP, or Azure) with working knowledge of multi-cloud environments
- Strong software engineering skills in Python, Go, or Bash; comfortable writing production-grade automation and tooling
- Hands-on Kubernetes experience: cluster operations, workload management, networking (CNI/service mesh), and security (RBAC, pod security)
- Infrastructure-as-code expertise with Terraform or equivalent; experience with GitOps workflows
- Proven experience designing and operating observability systems and responding to production incidents at scale
- Strong understanding of networking fundamentals: DNS, TLS/PKI, load balancing, and zero-trust networking concepts
Nice to have
- Experience in PKI, certificate lifecycle management, or security-adjacent infrastructure
- Familiarity with compliance frameworks such as SOC 2, FedRAMP, or ISO 27001 in cloud environments
- Prior experience driving cloud migration or modernization programs at scale
- Contributions to open-source infrastructure or platform projects
- AWS/GCP/Azure professional-level certifications (e.g., AWS Solutions Architect Professional, CKA/CKS)
What success looks likeย
In your first 90 days, you'll have a deep understanding of our platform's reliability posture, contributed to at least one automation or modernization initiative, and be a trusted voice in incident response. Within a year, you'll have measurably reduced toil, improved SLO attainment across key services, and delivered at least one major platform capability that enables product teams to move faster.
Working at DigiCert CloudOpsย
- Greenfield modernization: we are actively migrating workloads and building new platform capabilities-you'll shape the architecture, not just maintain it
- Engineering-first culture with a strong bias toward automation, GitOps, and platform thinking
- Cross-functional visibility: PlatformOps partners directly with product, security, and compliance-your work has organization-wide impact
- Competitive compensation, equity, and comprehensive benefits including flexible PTO and remote-first flexibility
Benefits
- Competitive compensation and comprehensive health, dental, and vision coverageย
- Retirement savings programs with company matching (401(k) or RRSP)ย
- Generous paid time off, including holidays, and vacationย
- Paid parental leave and family support benefitsย
- Life and disabilityย coverageย
- Flexible spending and health savings options (where applicable)ย
- Health and wellness support, including gym reimbursement and wellness programsย
- Employee Assistance Program withย 24/7confidential support for employees and familiesย
- Educationย assistanceย and professional development opportunitiesย
- Access to LinkedIn Learning and continuous learning resourcesย
- Employee referral bonus program andย additionalย companyย perksย and discountsย
- Internal rewards and recognition platform (Motivosity) to celebrate and acknowledge project wins, milestone achievements, and the outstanding contributions of our colleagues
- Business travel insurance and global employee support programsย
To protect candidate information and maintain a secure hiring process, all applications must be submitted through our careers portal. Resumes or CVs sent directly via email will not be reviewed or considered.
DigiCert is an Equal Opportunity employer and is committed to diversity in its workforce. In compliance with applicable federal and state laws, DigiCert prohibits discrimination on the basis of race or ethnicity, religion, color, national origin, sex, age, sexual orientation, gender identity/expression, veteran's status, status as a qualified person with a disability, or genetic information. Individuals from historically underrepresented groups, such as minorities, women, qualified person with disabilities, and protected veterans are strongly encouraged to apply.
#LI-RR1