Provide strong leadership to the SRE team, fostering a culture of collaboration, innovation, and accountability.
Set clear goals, expectations, and priorities for the team to ensure the delivery of reliable and scalable services. Google Cloud Platform Expertise: Demonstrate deep expertise in GCP services and tools, utilizing them to optimize infrastructure, automate processes, and improve system reliability.
Infrastructure Automation: Lead efforts in automating deployment, scaling, and management of our infrastructure using tools like Terraform, Ansible, and GCP Deployment Manager.
Monitoring and Incident Response: Design and implement effective monitoring and alerting systems to identify and respond to incidents promptly.
Capacity Planning and Performance Optimization: Conduct capacity planning to ensure our infrastructure meets current and future demands.