- As we modernize the product further, we anticipate a growing landscape of microservices. Work closely with our architects and DevOps team to take a resilience and availability lens to these conversations as we shape the future state of the product.
- Reporting clearly any defects found, capturing logs and scenarios and reproducing as required to support software investigations.
- Supporting product team and development leadership team to understand the current state of the product, and to inform prioritisation decisions for product improvement (e.g. resilience improvement work vs feature delivery, tech debt resolution)
Basic Qualifications
Strong experience as an SRE for a complex cloud-based microservices product, able to articulate specific improvements you have driven, the approach taken and the benefit delivered.
Understanding of high availability systems in a cloud landscape.
Broad technical understanding covering software, data, devops, infrastructure.
Experience with escalated incident management under pressure, and understanding of incident management approach, incident communications, evidence-based decision making and lessons learned.
Experience working in an Agile (pref. Scrum) and iterative development approach.
Strong written and verbal communication skills in English.
Enthusiasm and ability to collaborate well with others, including remote teams
Professional pride, drive and curiosity, a diligent self-starter that keeps up to date with best practise and keeps your skillset sharp.
Strong problem resolution skills