Job Summary:
Tesla is a leading company in the automotive industry, known for its innovative products. They are seeking a seasoned Site Reliability Engineer to support the development of next-generation diagnostics software and services, focusing on infrastructure management, production issue mitigation, and code reviews.
Responsibilities:
• Capacity planning and analysis, and infrastructure change management (including tuning, reshaping, resizing, and migrating infrastructure), for services and their immediate downstreams
• Work with SWE counterparts to identify and mitigate production issues; validate, document and exercise failover/disaster recovery plans and graceful degradation mechanisms policies and standard methodologies
• Actively participate and contribute to code reviews and technical design documents, with an eye toward identifying performance and reliability bottlenecks
• Productionalize new services and features, as well as improve production landscape for existing services, providing SRE expertise and implementing standard methodologies in the areas of CI/CD, dashboard integrity improvements, identifying and evaluating for the right set of alerts, SLOs and error budgets to use for services on an ongoing basis
• Attend team meetings, standups, and on-call handoffs
• Participate in team on-call rotation
Qualifications:
Required:
• Degree in Computer Science, Electrical Engineering, Automotive Engineering, or related field, or equivalent experience
• Expert knowledge of Linux operating system internals, filesystems, disk/storage technologies, and storage protocols, and networking stack
• Experience with troubleshooting and full-cycle incident response (mitigation, correction, prevention)
• Experience handling services in a large-scale distributed systems environment
• Experience with containerization software such as Kubernetes, Docker
• Expert knowledge of systems programming (bash and shell tools) and practical, validated knowledge of at least one higher-level language (Python, Go)
• Strong bias for action vs endless planning, willing to get hands dirty and make mistakes sometimes
• Strong belief in spreading & acquiring knowledge through mentorship and acting like an owner
Preferred:
• Experience working in an on-prem environment a plus
Company:
Tesla is an electric vehicle and clean energy company that provides electric cars, solar, and renewable energy solutions. Founded in 2003, the company is headquartered in Austin, USA, with a team of 10001+ employees. The company is currently Late Stage.