Search by job, company or skills
FMCG
Lead and mentor a team of Site Reliability Engineers responsible for maintaining and improving the reliability of our systems on Azure.
Collaborate with cross-functional teams, including DevOps, Development, and Infrastructure, to implement and enhance best practices for reliability and performance.
Develop and implement strategies for monitoring, alerting, and incident response to ensure proactive identification and resolution of issues.
Work closely with Azure services, making informed decisions on resource optimization, scalability, and cost efficiency.
Design and implement automation processes for deployment, configuration, and scaling of Azure resources.
Stay current with industry trends and best practices, and apply this knowledge to continually improve our SRE processes and methodologies.
Collaborate with security teams to ensure the integrity and security of our infrastructure on Azure.
Bachelors or Masters degree in Computer Science, Information Technology, or a related field.
Proven experience in a leadership role within Site Reliability Engineering, with a focus on Microsoft Azure.
Strong understanding of cloud computing principles, especially in the context of Azure services.
In-depth knowledge of infrastructure as code (IaC) tools, such as Terraform or Azure Resource Manager.
Experience with containerization technologies, such as Docker and Kubernetes.
Proficiency in scripting languages, such as PowerShell or Python.
Excellent problem-solving and troubleshooting skills, with a proactive and collaborative approach to resolving issues.
Strong communication skills and the ability to effectively collaborate with cross-functional teams.
Relevant certifications in Microsoft Azure (e.g., Azure Solutions Architect Expert, Azure DevOps Engineer Expert) are a plus.
Product Development
Login to check your skill match score
Date Posted: 14/07/2024
Job ID: 84723933