Search by job, company or skills

IBM

SRE

Early Applicant
  • a month ago
  • Be among the first 50 applicants
Exp: 0-2 Years

IT/Computers - Hardware & Networking

Job Description

Introduction
At IBM, work is more than a job - it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most challenging problems If so, lets talk.

Your Role and Responsibilities
  • Monitoring the health of the IKS control plane and ensuring reliable operations
  • Responding promptly to production issues and alerts
  • Executing changes in the production environment through advanced automation
  • Partnering with other SRE teams and program managers to deliver mission-critical services
  • Supporting the development and enhancement of Platform-as-a-Service services
  • Implementing and automating solutions that support IBM Cloud products
  • Ensuring compliance and security integrity of the environment
  • Collaborating with Engineering to troubleshoot and resolve production issues
  • Providing technical escalation support for other Infrastructure Operations teams
  • Monitoring the health of the IKS control plane and ensuring reliable operations
  • Responding promptly to production issues and alerts
  • Executing changes in the production environment through advanced automation
  • Partnering with other SRE teams and program managers to deliver mission-critical services
  • Supporting the development and enhancement of Platform-as-a-Service services
  • Implementing and automating solutions that support IBM Cloud products
  • Ensuring compliance and security integrity of the environment
  • Collaborating with Engineering to troubleshoot and resolve production issues
  • Providing technical escalation support for other Infrastructure Operations teams


Required Technical and Professional Expertise
  • Expertise in Kubernetes architecture, including the latest features and security aspects
  • Strong debugging skills in Kubernetes environments.
  • Strong experience in programming with Python or Go, with demonstrated ability to develop and maintain complex codebases.
  • Proficiency in network configuration and advanced monitoring solutions such as Prometheus, SysDIG, and Grafana
  • Experience in hands-on administration of cloud infrastructure, particularly Kubernetes-based platforms.
  • Skills in performance tuning and optimization of Kubernetes clusters, including resource quota management, scaling, and efficient use of underlying infrastructure.
  • Understanding of network protocols (TCP/IP, HTTP, etc.) and network configuration tools (e.g., CNI) specific to Kubernetes environments.
  • Deep understanding of Kubernetes security practices, including network policies, security contexts, role-based access control (RBAC), and the secure handling of secrets.
  • Knowledge of automation and configuration management tools: Ansible, Salt, Chef, Terraform
  • Strong Linux skills for managing services across a microservices platform
  • Ability to implement robust incident management strategies and frameworks
  • Experience in performance optimization of Kubernetes clusters
  • Understanding of disaster recovery planning and high availability setups in Kubernetes environments
  • Excellent written and verbal communication skills, with a willingness to take on call-out responsibilities
  • Experience establishing and improving procedures within a mission-critical environment


Preferred Technical and Professional Expertise
  • Hands-on experience with any one of cloud infrastructures (IKS, AWS, Azure, GCP) and integrating cloud services for storage, security, and databases
  • Knowledge of Slack bot automations for infra/cloud maintenance and SRE-based automations
  • Active participation in Kubernetes communities and forums
  • Vendor management skills to ensure optimal service levels and cost control
  • Ability to mentor and train teams on Kubernetes best practices and operational strategies

Skills Required

Login to check your skill match score

Login

Date Posted: 10/10/2024

Job ID: 95831933

Report Job

About Company

IBM
Follow

About Business Unit About IBM IBM's greatest invention is the IBMer. We believe that through the application of intelligence, reason and science, we can improve business, society and the human condition, bringing the power of an open hybrid cloud and AI strategy to life for our clients and partners around the world. Restlessly reinventing since 1911, we are not only one of the largest corporate organizations in the world, we're also one of the biggest technology and consulting employers, with many of the Fortune 50 companies relying on the IBM Cloud to run their business. At IBM, we pride ourselves on being an early adopter of artificial intelligence, quantum computing and blockchain. Now it's time for you to join us on our journey to being a responsible technology innovator and a force for good in the world. Being You @ IBM IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.

Hi , want to stand out? Get your resume crafted by experts.

Similar Jobs

Software Development Engineer III Devops SRE

InmobiCompany Name Confidential

Software Consultant SRE Devops

PTCCompany Name Confidential
Last Updated: 26-11-2024 06:53:50 PM