Job Title: SRE Lead
Experience: 10+ years
Mandatory Skills:
- Modern observability stack - Splunk, Elastic Search, Prometheus, Grafana
- Cloud-based SRE practices and experiences such as AWS, Azure, or Google Cloud
- Containerization technologies (e.g., Kubernetes, Docker) and microservices architecture
- DevOps practices
- Programming Skills: Java/Golang/Javascript
Other Skills: Excellent Communication, Prior experience of client interaction role, team lead, prior experience in big MNC's
Responsibilities:
- SRE Strategy and Leadership: Lead a team of 20 member SRE professionals team (3 POD teams) within the client portfolio to drive the reliability, performance, and scalability of GRC technology solutions.
- Client Interaction Role: Client interaction for day to day task, any risk and mitigation plan.
- Observability and Monitoring: Establish observability practices to ensure real-time insights into system performance, availability, and customer experience. Implement monitoring tools, metrics, and dashboards to proactively identify and address potential issues.
- Production Support Optimization: Lead all aspects of the end-to-end production support process, including incident management, problem resolution, and service-level agreement (SLA) compliance. Drive continuous improvement initiatives to enhance operational effectiveness and reduce mean time to resolution (MTTR).
- GRC Customer Journeys: Collaborate with multi-functional teams to enhance customer journeys through seamless and reliable technology experiences.
- Reliability Engineering Best Practices: Promote and implement standard methodologies, including error budgeting, chaos engineering, and disaster recovery planning. Cultivate a culture of resilience and reliability within technology.
- Automation and Efficiency: Champion automation initiatives to streamline operational workflows, deployment processes, and incident response tasks. Leverage automation tools and orchestration to improve reliability and reduce manual intervention.
Qualifications:
- 10+ years of experience and degree or equivalent experience in Computer Science, Information Technology, or related field. Advanced certifications in SRE or related are a plus
- Deep understanding of observability tools and methodologies, including experience with logging, monitoring, tracing, and performance analysis platforms
- Strong leadership, people management skills, Client interaction with the ability to inspire and empower successful SRE teams
[Confidential Information]