Xometry is seeking a Site Reliability Engineer II to join our Site Reliability Engineering (SRE) Organization. In this role as an individual contributor, you will guide the reliability and performance of our infrastructure and software systems across several engineering teams and influence decisions across our technology organization. You will utilize your technical skills and expertise to help us build reliable and flexible infrastructure solutions that empower our technology organization to quickly and safely deploy new features for our customers.
Responsibilities
- Take ownership of assigned problem statements and drive them to completion with guidance from senior engineers.
- Write clean, efficient, and well-documented code while improving existing systems and features.
- Accurately estimate timelines for features and tasks, learning to balance effort, risk, and impact.
- Collaborate effectively across teams, communicating clearly on progress, blockers, and outcomes.
- Seek and apply feedback from peers and managers to improve code quality, technical skills, and delivery consistency.
- Support team members and contribute to a positive, learning-oriented team culture.
- Take ownership of personal development goals, showing steady progress in technical and problem-solving skills.
- Demonstrate accountability, curiosity, and continuous improvement in all aspects of your work.
- Develop, configure, and maintain underlying platforms for deployed software (AWS accounts and networking, kubernetes clusters, and similar systems).
- Develop, configure, and maintain observability and monitoring tools (Coralogix, Sentry, etc.).
- Develop, configure, and maintain software development (CI/CD) tools (github actions runners, ArgoCD, etc).
Qualifications Required:
- 3+ years of professional experience in infrastructure management or backend software development experience in a fast-paced, product-driven environment.
- Demonstrated technical expertise in one or more of the following languages: Python, Javascript, or Unix Shell.
- Experience with AWS, including deploying, monitoring, and scaling production workloads.
- General experience with Terraform, Kubernetes, CI/CD pipelines, and Docker.
- Comfortable working in an operational environment, including participation in an on-call schedule.
- Excellent communication and collaboration skills, comfortable engaging with both technical and non-technical stakeholders.
The estimated base salary range for new hires into this role is $135,000 - $165,000 annually + commission depending on factors such as job-related skills, relevant experience, and location. We also offer a competitive benefits package, including 401(k) match, medical, dental and vision insurance; life and disability insurance; generous paid time off including vacation, sick leave, floating and fixed holidays, maternity and bonding leave; EAP, other wellbeing resources; and much more.
#LI-Hybrid