Search by job, company or skills

Reflections Info Systems

Site Reliability Engineer

Early Applicant
  • 19 days ago
  • Be among the first 50 applicants

Job Description

We are looking for an experienced and proactive Site Reliability Engineer (SRE) to join our team. This role is focused on improving the reliability, scalability, and performance of our applications and infrastructure. The ideal candidate will possess strong troubleshooting skills, a holistic approach to problem-solving, and the ability to engineer solutions that enhance system resilience and reliability.

Experience: 3+ years

Location : Trivandrum, Chennai( Hybrid), Remote

Main duties/responsibilities

Work closely with the application support team.

Monitor critical applications and services to minimize downtime and ensure their availability.

Collaborate with DevOps teams to maintain and monitor CI/CD pipelines.

Deploy new versions to production environments.

Work with project teams to ensure the reliability and maintainability of new and modified releases.

Provide input to risk management practices that will anticipate reliability-related incidents that could adversely impact operations.

Document processes and monitor application performance metrics.

Continuously improve proactive monitoring alert configuration and incident response processes to increase reliability and reduce Mean Time to Recovery (MTTR ).

Optimize performance and cost efficiency through continuous monitoring, trend analysis, and fine-tuning.

Monitor any abnormal usage that can impact the cost or performance and take corrective actions.

Proactively implement preventive measures to improve system reliability.

Maintain runbooks, Standard Operating Procedures (SOPs), diagrams, and documentation for swift incident response.

Conduct post-incident reviews to improve reliability and contribute to the development of resilience strategies.

Achieve Service Level Indicators (SLIs) that are set to meet reliability objectives.

Experience

Experience in SRE/DevOps with a focus on Ops.

2+ years of experience in AWS Cloud Infrastructure.

Familiarity with CI/CD pipelines and version control systems.

Experience in Project Management and issue tracking tools such as JIRA/SysAid.

Fluent in AWS key services (EBS, S3, AWS Compute, Storage, RDS etc).

Expertise in Kubernetes or any Container Orchestration System.

Knowledge of Infrastructure as a Code.

Linux system administration knowledge.

Knowledge of RDBMS and Document databases.

Knowledge of Monitoring tools including AWS CloudWatch and NewRelic.

Additional certification in Microsoft, Linux, Cisco, AWS or similar technologies is a plus.

Aspirants May Share their updated resume to [Confidential Information]

More Info

Industry:Other

Job Type:Permanent Job

Date Posted: 12/11/2024

Job ID: 99955075

Report Job

About Company

Hi , want to stand out? Get your resume crafted by experts.

Similar Jobs

Site Reliability Engineer Datacenter Cloud Ops

Onemind Cloud ServicesCompany Name Confidential

Site Reliability Engineer APAC

CanonicalCompany Name Confidential
Last Updated: 25-11-2024 09:18:57 PM
Home Jobs in Thiruvananthapuram Site Reliability Engineer