Job SummaryWe are seeking an experienced Senior DevOps / Site Reliability Engineer (SRE) with strong application and infrastructure knowledge. The role requires hands-on expertise in AWS, Kubernetes, CI/CD, monitoring, and .NET-based applications to ensure high availability, reliability, and performance of production systems.
Key Responsibilities- Ensure application availability and reliability as per defined SLOs
- Deploy applications using CI/CD pipelines
- Handle production issues including understanding application code and troubleshooting/debugging
- Troubleshoot new errors logged after releases and provide root cause analysis
- Set up new alerts and synthetic monitoring prior to releases
- Automate repetitive operational tasks
- Forecast potential issues and bottlenecks to prevent major application impact
- Work on setting up lower environments and support infrastructure activities
- Collaborate with development and operations teams to resolve issues efficiently
- Support Agile development and release methodologies
Required Skills & Experience- Strong experience in DevOps and Site Reliability Engineering (SRE)
- Hands-on experience with AWS, Kubernetes, and Docker (containerization is mandatory)
- Experience with CI/CD tools such as Octopus and Azure DevOps
- Strong experience with monitoring tools like Dynatrace, Splunk, and Mouseflow
- Good understanding of operating systems and container orchestration concepts
- Experience with multi-threaded, performance-intensive applications
- Strong troubleshooting and debugging skills
- Experience with source control systems such as GitTFS and continuous build pipelines
- Strong experience in RDBMS including Oracle and MS SQL Server
- DevOps automation experience using Ansible or other scripting tools
Competencies- Strong problem-solving and analytical skills
- Excellent communication and collaboration skills
- Ability to work across application and infrastructure layers
- Proactive mindset toward automation and reliability
- Ability to work in fast-paced production environments
Preferred Skills- Application development background (preferably .NET; other web environments acceptable)
- Experience working in Agile methodologies
- Exposure to enterprise-scale monitoring and observability practices