Job Purpose:
- Design, Implement, configure, Install & support on various OS flavours of Linux
- Support of existing Linux (RHEL, Ubuntu, CentOS ) and virtualization Server environments
- Work closely with Infrastructure and applications teams to find the right trade-off between the business specific needs and DevOps practises
Skills & Technical Competences & Behaviors:
- Certification : Linux, AWS, Azure, DevOps
- 5+ years of professional experience exclusively into AWS and Linux.
- Minimum of 5 years experience designing and implementing Red Hat Linux server systems to suit varying application and availability requirements
- Hands on experience working with EC2,ELB,EBS, S3, EC2, Direct Connect, Lambda, VPC, IAM, CloudFormation, Autoscaling, RDS will be plus
- Deployment, automation, management, and maintenance of AWS cloud-based system.
- Ensuring availability, performance, security, and scalability of AWS production systems.
- Monitoring tool knowledge, preferably Zabbix , Cloud Watch, Prometheus , Grafana etc.
- Knowledge of containerization and orchestration tools.
- Experience with configuration management tools (Ansible).
- Familiarity with CI/CD tools (Jenkins, GitLab).
- Apache and Nginx knowledge
- Knowledge and application of ITIL and best practice methods in a medium to large sized organisation
- Experience working together with application owners, business units and 3rd parties to deliver shared goals
- Hands-on solution design skills and the ability to objectively quality assure 3rd party solution designs to ensure they meet business expectations.
- Strong understanding of different cultural working patterns.
- Fluent in English.
- Excellent communication, presentation and interpersonal skills.
- Bear personal responsibility and demonstrate quality awareness.
- Behave loyally and comply with rules, regulations and legal requirements.
Main Tasks & Responsibilities:
- Developing and implementing server availability (24x7x365) uptime service, security and performance configurations and delivering system updates.
- Providing L2 & L3 support for Linux Server as per request from various constituencies.
- Understands fundamentals of large scale on-premises and cloud mission critical systems; networking, security, redundancy, scalability, monitoring, & performance.
- Collaborate with other teams and team members to develop automation strategies and deployment processes.
- Experience in configuring Apache, Tomcat, Postfix, LDAP,SMTP; System and Configuration management product Ansible.
- Proven experience in risk-based systems running on UNIX / LINUX platforms along with OS clustering, partitioning, and virtualization.
- Maintain best practices on managing systems and services across all environments
- Fault finding, analysis and of logging information for reporting of performance exceptions
- Provide input on ways to improve the stability, efficiency and scalability of the environment.
- Expert in Shell, Perl, and/or Python scripting
- Maintain and monitor all system frameworks and provide after call support to all systems and maintain optimal Linux knowledge.
- Monitoring tool knowledge and hands on , preferably Zabbix , Cloud Watch, Prometheus , Grafana etc.
- Implement CI/CD pipelines using tools like Jenkins and GitLab.
- Maintain detailed documentation of infrastructure configurations, processes, and procedures.
- Prepare and present reports on infrastructure patching, vulnerabilities and compliance status.
- Keep abreast of industry trends and emerging technologies to recommend improvements.
- Experience of Information Security and vulnerability management principles.
- Apply Windows Patches, Remediate Windows Vulnerabilities, troubleshoot updates is a Plus.
KPIs:
- Minimize unplanned downtime and ensure quick recovery from incidents.
- Knowledge-sharing sessions and documentation.
- Increase in infrastructure and processes automated.
- Reduce manual intervention and errors in deployment processes.
- SLA achievement
- On-time delivery of projects