Upgrade the OCP / Patching, onboard new application to OCP platform, Portworx software defined storage , Prometheus monitoring, Hwkular metrics , Ansible . Add Remove resources on need basis.
Capacity planning and Customer capacity provisioning Implement self healing for known issues.
Infrastructure Automation and Provisioning
Activities:
Design and build hardware solutions
Build and manage bootstrapping automation Design and automate VM and host provisioning Datacenter design and implementation Linux system administration Scripting and automation Knowledge of storage Knowledge of network design and implementation Security Installation and Management of the OpenShift Platform
Activities:
Perform cluster install
Manage infrastructure services
Manage platform scale
Platform authentication and authorization Linux system administration
Knowledge of networking
Scripting and automation (Ansible)
Knowledge of storage
Knowledge of containers and container architectures
Knowledge of Kubernetes and OpenShift architecture
Platform security
Monitoring integration
Managing Tenant Provisioning, Isolation, and Capacity
Activities:
Adding users and teams to the platform
Design and manage quotas
Design and implement RBAC
Knowledge of Kubernetes and OpenShift architecture
Knowledge of containers and container architecture
Scripting and automation
Deep knowledge of projects, quotas, limits, roles, role bindings, and scheduling
Building and Maintaining Base Images
Activities:
Develop image change workflow
Develop standard base images
Linux system administration
Scripting and automation
Application and middleware runtime configuration Knowledge of container architectures Application build frameworks Deep knowledge of images, imagestreams, templates Building and Maintaining Base Images
Activities:
5+ years of working experience in OVM , KVM, RHEV, docker container/ openshift
Experience in deploying; maintaining and supporting Linux Server infrastructures and storage systems in an enterprise environment
Maintain security and OS patch management for linux based systems
Responsible for resolving all technical incidents escalated by the L-2 team
Participate in Problem Management review meetings and provide root cause analysis
Ability to handle and drive Incident Review bridges by identifying potential Problems; recommend solutions to be implemented
Responsible for SLA compliance
Should possess work experience on migration/upgradation projects
Basic knowledge on scripting and automation
Communicate effectively (verbal and written) and clearly within the team and with all the stakeholders Good knowledge on ITIL process
Technical Certification a must Redhat Linux
Possess ITIL certification
On call schedule and week end change schedule participation