Run Engineering functions, including managing people and a team across multiple location
Building high-performing teams by developing and nurturing Engineering teams through cultural change,
Supporting, challenging and building consensus on design directions/decisions to ensure they are viable from a Cloud perspective.
Ability to work in a constantly evolving and dynamic environment.
Assess the IT infrastructure and operations organization (people, process, and technology) to determine best course of journey on On prem and Cloud (AWS)
Assess the current deployment process to identify bottlenecks and implement solutions towards a continuous deployment and continuous integration (CI/CD). Implement, and manage a robust Sre function.
Implement and manage processes and controls that assure maximum uptime and quick service to the user community for Cloud workloads
Management and continual improvement of Cloud Operations, DevOps, and SREs.
Partner with InfoSec to deliver key information security and IT risk related initiatives. Furthermore, ensure compliance to patching and vulnerability policies established within the organization.
Develop and implement a robust Disaster Recovery strategy for critical systems and infrastructure.
Assess single points of failure in infrastructure and recommend actions as appropriate.
Oversee the monitoring, maintenance, upgrade, and administration of all IT systems, to include applications, servers, storage, databases, containers, and Cloud related services
Participate in the 24x7 support coverage as needed .
Requirements
Demonstrated experience transforming IT infrastructure and moving an organization toward a mature cloud-based (AWS) service delivery model.
Proven experience in AWS managing SRE and DevOps teams.
Experience with managing on-prem workloads as well as migrating workloads to Cloud, specifically AWS while optimizing the cost of Cloud based workloads.
Possess a deep experience in SRE/DevOps with proven results enabling continuous deployment and continuous integration (CI/CD)
Experience and background in infrastructure architecture, systems administration, network administration and storage administration.
Extensive experience with instrumentation, enterprise monitoring and reporting tools.
Demonstrates the business and financial acumen necessary to develop and present data- based ideas and solutions in a clear, concise, organized manner.
Working experience and understanding of Infrastructure as a code tool such as Terraform, Chef, Ansible, CloudFormation etc.
Experience building and maintaining cloud-native applications Experience using DevOps tools in a cloud environment, such as Ansible, Docker, GitHub, Jenkins, and Kubernetes.
Skills
Experience leading, directing, and mentoring SRE and DevOps teams.
Solid leadership skills in leading an IT infrastructure and operations organizations towards Cloud- first operations and DevOps mindset
Good organization change management skills and the ability to manage multiple priorities concurrently.
Strong analytical capability and excellent verbal and written communication skills.