Search by job, company or skills

Siemens

Site Reliability Engineer (SRE) - Manager

Early Applicant
  • a month ago
  • Be among the first 50 applicants

Job Description

Organization Overview-

As part of the Siemens DISW SRE organization, this position makes significant contributions towards the delivery of automated solutions that support best-in-class cloud based applications. Our team is looking for a leader who is passionate about automatic automation. SREs discover ways to help promote the availability of services and applications, improve processes through remediation of manual and/or repetitive tasks, and address complex technical problems in a fast-paced, collaborative, inclusive, and iterative environment.

Job Profile/Position Overview -

The candidate will be responsible for supporting the Siemens Xcelerator platform by identifying, managing, and enhancing efficiencies in availability, resiliency, reliability, and stability. This role demands strong technical leadership to drive innovative solutions and refine processes to achieve operational excellence. Establishing and maintaining strong relationships with product teams across the Xcelerator platform is essential for aligning with core objectives. The success of this role will be determined by the ability of DISW business unit product teams to consistently meet or exceed their SLAs.

In this role, you will lead a team of versatile Site Reliability Engineers, overseeing the design, deployment, and maintenance of our production systems. You will be pivotal in ensuring the reliability, scalability, and performance of our infrastructure while spearheading continuous improvement efforts. Your deep expertise in SRE practices and proficiency with the specified technologies will empower you to guide the team towards achieving operational excellence.

Responsibilities/Tasks

  • Lead the design, deployment, automation, and integration of scripting solutions to enhance capabilities, visibility, and efficiency.
  • Collaborate with leaders across technical platforms and partners to engineer automated, integrated solutions that improve tool, service, and team interactions, increasing availability, reliability, and performance.
  • Oversee and ensure that both internal and external SLAs consistently meet or exceed expectations.
  • Continuously review and refine SRE standards, processes, and standard practices, particularly in incident response and toil reduction.
  • Manage a team of engineers participating in a 24/7 on-call rotation to support our production infrastructure.
  • Join incident calls that exceed acceptable duration.
  • Ensure comprehensive post-mortem analysis of production incidents, driving continuous improvement initiatives.

Required Knowledge/Skills, Education, and Experience

  • 7+ years of professional experience in SRE or DevOps, with 3+ years of experience in a leadership role.
  • proven experience with automation via scripting & API development
  • 2+ years experience with observability tools(Datadog, CloudWatch, Cloud-Trail, Elastic Stack, Grafana, or equivalent tools)
  • 2+ years experience with containerization, specifically Kubernetes
  • 2+ years experience with Amazon Web Services (AWS) services
  • 2+ years experience Terraform, CloudFormation, Ansible, or equivalent tools
  • 2+ years experience with issue/incident tracking tool

Preferred Knowledge/Skills, Education, and Experience

  • Familiarity with agile methodologies and experience working in an Agile/Scrum environment.
  • Desired certifications include: Datadog, Kubernetes, AWS or Azure certification
  • 2+ years experience as a Site Reliability Engineer or equivalent role (ServiceNOW, ServiceDesk, Jira or equivalent tools)
  • 2+ years with log management tools (ie ELK Stack)
  • 2+ years experience Enterprise IT environment with distributed environments
  • Senior level system administration experience, including troubleshooting, support, mentorship/training, and oversight

We are Siemens

A collection of over 377,000 minds building the future, one day at a time in over 200 countries. We're dedicated to equality, and we encourage applications that reflect the diversity of the communities we work in. All employment decisions at Siemens are based on qualifications, merit, and business need. Bring your curiosity and creativity and help us shape tomorrow!

We offer a comprehensive reward package which includes a competitive basic salary, bonus scheme, generous holiday allowance, pension, private healthcare and actively support working from home.

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, colour, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status.

Transform the everyday and Accelerate transformation

#li-plm

#LI-Hybrid

#SWSaaS


More Info

Function:Technology

Job Type:Permanent Job

Skills Required

Login to check your skill match score

Login

Date Posted: 02/10/2024

Job ID: 94645541

Report Job

About Company

Follow

Hi , want to stand out? Get your resume crafted by experts.

Similar Jobs

Site Reliability Engineer SRE Automation

FISCompany Name Confidential

Site Reliability Engineer SRE

Ampcus Tech Pvt LtdCompany Name Confidential
Last Updated: 15-11-2024 00:14:55 AM
Home Jobs in Pune Site Reliability Engineer (SRE) - Manager