Why this job matters
The Site Reliability Engineering Professional is responsible for supporting BT to be in the best position to deliver the service performance, reliability and availability that internal and external customers expect.
What You'll Be Doing
1 - Supports the implementation of new software development life cycle automation tools, frameworks, and code pipelines (continuous integration/continuous delivery pipelines), helps to elevate the organisation using best practices with a focus on the re-use of application code, demonstrates consistent software delivery practices and produces continuous integration/continuous delivery platform solutions using Amazon Web Services Cloud, infrastructure as code (IaC), GitOps, and container technologies
2 - Supports teammates and engineering teams to identify and implement requirements for building a high-end developer experience enabling quick, autonomous, and secure delivery of production changes
3 - Supports the maintenance of monitoring tooling used to optimise systems for uptime, performance, and reliability
- Executes tests to investigate how the infrastructure handles failure and scaling
- Supports the execution of approaches that scale systems sustainably through automation mechanisms and evolves systems by pushing for changes that improve reliability and velocity
- Supports the delivery of infrastructure as code software to improve the availability, scalability, latency, and efficiency of services
- Executes quality control/quality assurance on new clusters and software deployments
- Supports the operation and management of distributed storage architecture
- Monitors queue and support processing to support in the identification of early warning of support issues
- Supports in the implementation of ways to improve working processes within the area of site reliability engineering responsibility, such as contributing to the design of continuous integration/continuous delivery systems
The Skills You'll Need
Troubleshooting
Infrastructure Configuration
Debugging
Continuous Improvement
Application Performance Monitoring & Alerting
Release Management
Programming/Scripting
Operating Systems
IT Security
Cloud Computing
Data Analysis
Agile Methodologies
Software Testing
Continuous Integration/Continuous Deployment Automation & Orchestration
Incident Management
Decision Making
Growth Mindset
Inclusive Leadership
Our leadership standards
Looking in:
Leading inclusively and Safely
I inspire and build trust through self-awareness, honesty and integrity.
Owning outcomes
I take the right decisions that benefit the broader organisation.
Looking out:
Delivering for the customer
I execute brilliantly on clear priorities that add value to our customers and the wider business.
Commercially savvy
I demonstrate strong commercial focus, bringing an external perspective to decision-making.
Looking to the future:
Growth mindset
I experiment and identify opportunities for growth for both myself and the organisation.
Building for the future
I build diverse future-ready teams where all individuals can be at their best.