Job Summary:
Barclays is seeking a Senior Site Reliability Engineer - AVP - Credit Trade Floor to enhance the reliability, stability, and availability of their trading platforms. The role involves collaborating with technology and infrastructure teams to provide technical support, conduct root cause analysis, and implement monitoring frameworks.
Responsibilities:
โข Provision of technical support for the service management function to resolve more complex issues for a specific client of group of clients. Develop the support model and service offering to improve the service to customers and stakeholders.
โข Execution of preventative maintenance tasks on hardware and software and utilisation of monitoring tools/metrics to identify, prevent and address potential issues and ensure optimal performance.
โข Maintenance of a knowledge base containing detailed documentation of resolved cases for future reference, self-service opportunities and knowledge sharing.
โข Analysis of system logs, error messages and user reports to identify the root causes of hardware, software and network issues, and providing a resolution to these issues by fixing or replacing faulty hardware components, reinstalling software, or applying configuration changes.
โข Automation, monitoring enhancements, capacity management, resiliency, business continuity management, front office specific support and stakeholder management.
โข Identification and remediation or raising, through appropriate process, of potential service impacting risks and issues.
โข Proactively assess support activities implementing automations where appropriate to maintain stability and drive efficiency. Actively tune monitoring tools, thresholds, and alerting to ensure issues are known when they occur.
Qualifications:
Required:
โข Demonstrated ability to manage incidents effectively, including rapid troubleshooting, solution-oriented stakeholder communication, and conducting root cause analysis (RCA) to prevent recurrence
โข Considerable expertise in systems engineering, with deep knowledge of Operating systems like Windows, Linux, Networking fundamentals Container orchestration Kubernetes, Cloud platforms experience AWS and Azure. Automation using Python for scaling system reliability and operational efficiency
โข Confirmed capability to approach complex problems methodically, analyze issues, and implement effective solutions to maintain high system reliability and performance
โข Practical experience in implementing monitoring, alerting, and observability frameworks for critical platforms, tools such as ITRS Geneos. Automation of manual operational processes to improve efficiency and reduce risk
โข Solid experience working with databases and SQL, supporting troubleshooting, data analysis, and system performance optimization
Preferred:
โข Prior experience supporting Credit or other Investment Banking (IB) asset classes, including Rates, Equities, or FX, with an understanding of trading workflows and time-sensitive environments
โข Experience working with IaaS and/or PaaS solutions, along with practical exposure to Virtualization technologies, containerization tools Docker, Orchestration platforms Kubernetes, Compute, network, and storage infrastructure management
โข Familiarity with or working knowledge of Generative AI tools, with the ability to leverage them for automation, analysis, or operational efficiency
โข Good communication and stakeholder management skills, with the ability to effectively engage and collaborate with senior business and technology stakeholders
Company:
Barclays is a transatlantic consumer and wholesale bank with global reach, offering products, and services. Founded in 1690, the company is headquartered in London, GBR, with a team of 10001+ employees. The company is currently Late Stage.