Search by job, company or skills

InfraCloud Technologies

Site Reliability Engineer

Early Applicant
  • 4 months ago
  • Be among the first 50 applicants

Job Description

What we are looking for:

  • Hands-on experience in deploying and managing Kubernetes clusters in airgap environments, preferably within financial institutions or similarly regulated industries, is required. Familiarity with a cloud-native CI/CD tool used for Kubernetes deployments is necessary.
  • Knowledge of GKE (Google Cloud Platform) and RKE (Rancher Kubernetes Engine) is preferred.
  • Knowledge of Rook-Ceph distributed storage is preferred.
  • You should be able to manage multi-region clusters for disaster recovery.
  • You should have experience working with programmable infrastructure, such as building a CI/CD pipeline.
  • Familiarity with monitoring and observability tools such as Prometheus, Grafana, and the ELK stack is expected.
  • You should possess good knowledge of Linux operating systems and be proficient in troubleshooting OS issues.
  • It is essential not to use terms like high availability or resilient systems without a thorough understanding of their basics, as building such systems in practice requires significant effort.
  • Knowledge of security and compliance frameworks and best practices relevant to financial institutions is necessary.
  • Preferred certifications include CKA (Certified Kubernetes Administrator) and CKAD (Certified Kubernetes Application Developer).

What you will be learning and doing

  • You will be working with customers trying to transform their applications and adopt cloud-native technologies. The technologies used will be Kubernetes, Prometheus, Service Mesh, and public cloud technologies or on-premise infrastructure.
  • The problems and solutions are continuously evolving in space but fundamentally you will be solving problems with the simplest and most scalable automation.
  • You will be hacking around open source projects, understand their capabilities, and limitations, and apply the right tool for the right job.
  • You will be educating the customers - from their operations engineers to developers on scalable ways to build and operate applications in modern cloud-native infrastructure.

More Info

Industry:Other

Function:technology

Job Type:Permanent Job

Skills Required

Login to check your skill match score

Login

Date Posted: 11/07/2024

Job ID: 84118819

Report Job

Hi , want to stand out? Get your resume crafted by experts.

Similar Jobs

Site Reliability Engineer Datacenter Cloud Ops

Onemind Cloud ServicesCompany Name Confidential

Site Reliability Engineer

GoGuardianCompany Name Confidential
Last Updated: 18-11-2024 06:06:57 PM