Monitoring and Alerting: Design and implement monitoring and alerting systems to ensure timely detection of issues to meet SLAs.
Drive the implementation of SLOs (Service Level Objectives) and SLIs (Service Level Indicators) to ensure the availability and performance of critical systems.
Experience in Power Shell, Python/C#.
Strong experience with container technology including Kubernetes and Docker.
Working knowledge of deploying, monitoring SaaS applications in Cloud especially Azure.
Collaborate with DevOps, engineering, and IT teams to identify opportunities for automation and to implement Infrastructure as Code (IaC) practices using tools such as Terraform, Ansible, or CloudFormation.
Implement automation tools and processes to enhance governance, compliance monitoring, and risk management in cloud environments.
Experience with Cloud monitoring tools like Grafana, Azure AppInsights, Prometheus etc.
Good verbal and written communication.
Freshers are not eligible to apply for this job role.