Role: Senior Devops Engineer
Experience: 5 to 8 years
Job Location: Bangalore
Working mode: Hybrid, 3 days working from office
Responsibilities:
Analyze and improve the efficiency, scalability, and reliability of our backend systems
Build and mature automation tools for robust continuous integration and deployment pipelines
Build scalable, secure, and measurable infrastructure with code
Facilitate capacity planning
Champion code health, rigorous testing, and maintainability standards
Create automation of engineering deployments
Create scalable and reliable monitoring and alerting that works
Create actionable documentation and playbooks, and when possible automation, to resolve recurring issues and proactively address issues before impact is felt
Design, build, and upkeep tools, systems, and self-service options to elevate engineering team productivity and reduce toil
Maintain a stable, scalable, and secure development environment while keeping abreast of the latest DevOps innovations
Own and maintain Apixio services and data infrastructure in production
Support disaster recovery design, implementation, and testing
Support engineering teams in implementing system reliability
When things go bad, perform advanced troubleshooting of our systems
And you will have knowledge of many of the following:
Amazon Web Services (AWS) and Application Programming Interface design and best practices
Experience with CI/CD tools such as Jenkins, GitLab CI, CircleCI, GitHub Actions) and version control systems such as Git
Experience with deployment and config management systems like Salt Stack, Ansible, and HashiCorp
Experience with monitoring and logging applications like FluentD, Graylog, and Datadog
Familiarity with containerization and orchestration technologies
Knowledge of cloud services (e.g., AWS, GCP, Azure) and infrastructure as code (e.g., Terraform, CloudFormation)
Proficiency in version control systems like Git and CI/CD tools like Octopus Deploy or ArgoCD
Strong communication and collaboration skills to work effectively with cross-functional teams
Strong knowledge of best-in-class security practices and testing methods
Strong knowledge of internet service architecture (TCP/IP, HTTP, DNS, routing, load balancing)
Strong knowledge of the configuration and maintenance of common big infrastructure components such as Cassandra, Redis, FluentD, Apache/Django/Flask, Kafka, Redis, Elasticsearch & Hadoop
Strong scripting skills like Python, Ruby, or Bash
Strong understanding of Unix and system administration
A strong candidate will have:
A BS, MS in Computer Science / Engineering or equivalent
A passion for improving the developer experience and empowering engineers.
A passion for killer SLA's, Secure Infrastructure and Automation of Everything
Past experience and success stories managing a significant infrastructure in AWS, and have maintained a 24x7 commercial SLA