Senior SME - Cloud must have at least 6+ years of strong work experience in Google Cloud Platform and complementary technologies and should have relevant experience in the following technology areas:
GCP Iaas and PaaS technologies and services
Delivery Skills required:
- Compute Engine, GKE
- Web App Engine, Web Servers
- Storage - Cloud Storage, Filestore, Persistent Disks
- Networking - VPC, Cloud Load Balancing, Cloud DNS, Cloud VPN, Hybrid Connectivity
- Database - Cloud SQL, Datastore, Bigtable
- Security & Management - IAM, MFA, Automation, Deployment Manager, Audit Logs
- Hybrid - Cloud Identity, GCP Native Backup
- Portals - GCP Console, Cloud Monitoring
- Knowledge of Cloud SDK and Cloud CLI commands & scripts
Preferred Technical Skills:
- Google Cloud Platform Administration
- Unix Administration
- Windows Server Administration
- Bash Unix Shell
- Ansible
Networking and Hardware:
- Knowledge of scale up and scale down of VM sizes, auto scale, configure availability sets.
- Manage the subscriptions, cloud account, IAM s, Network configurations, delegation.
- Strong practical Linux systems administration skills in a Cloud or Virtualized environment
- Good knowledge of Windows OS administration and troubleshooting
- Good Knowledge in managing & resolving issues related to Networking component (Routing, VPN, Subnetting, IPSEC, Load Balancers)
- Able to handle Security related Tasks & Issues related to (SSL, TLS, Certificates, Firewalls/Web Application Firewalls, Authentication & Authorization, Active Directory, MFA, IPS/IDS)
- Perform Security related Complex Tasks related to Certificate Management, IP & Port Access, Access Control list.
Troubleshooting Skills:
- Collaborate with vendors for troubleshooting issues & implementation of changes
- Perform Root cause analysis for P1/ Major Incidents
- Comfortable in Troubleshooting and fix complex Production issues
Security:
- Assist team on security related tasks & issues, such as Certificate Management, Network & Port Access, Access Control list, User on-boarding & off-boarding, Password policy, etc.
- Able to handle Server less function tasks on monitor, manage & configuration permissions Serverless functions.
Management Tools:
- Proficiency in at least one configuration management tool like Ansible
- Perform Application Performance Management & Log analytics Management.
- Drive Continuous Improvement & Best Practices.
Co-ordination:
- Work closely with other Subject Matter Experts on cost optimization and Continuous Service Improvements
- Coordinate with other resolver groups for quick resolution of incidents and service requests.
- Vendor Co-ordination for troubleshooting issues & change implementation
- Handle escalations from L2/L3 team members.
Scripting languages:
- Manage GCP through CLI or Shell
- Good with scripting and automating tasks through Shell/Powershell scripting.
Reports:
- Manage Resource Tags, define and build cost & usage Reports.
- Candidate will lead analysis, recommendations, and presentation of approaches to customer s complex problems based on prior experience.
- Candidate will be involved in conducing GCP readiness assessments and evaluations and delivery of assessment reports, architecture/ design/ deployment documents & presentations
Process Compliance:
- Prepare RFCs and represent RFC in the change advisory board for approval of changes. Assign approved changes to L2 group to implement them.
- Conduct repeat incident analysis to identify proactive problem tickets and work on the permanent fix for the issues.
- Gather information from vendor, customer, analysis from the troubleshooting/investigation, resolution steps implemented etc. and update the tickets regularly, as per the defined process.
- Prepare a detailed report on the plan of action for the existing tickets for the next shift & engineer (Shift Handover Report)
- Follow through and execute the various Escalation Matrix (Vendor , Internal and Customer defined).
- Ensure tickets are closed post user/customer communication.
- Constantly track tickets for SLA adherence and escalate on deviations as defined in the process.
Behavioral Skills:
- Strong communication, presentation, and documentation skills
- Awareness of ITIL and/or CMM practices and processes.
- Understanding with IT compliance and risk management requirements (e.g. security, privacy, SOX, HIPPA etc. is a plus
- Strong technical writing skills to document knowledge articles and sample programs for knowledge base
- Ability to effectively communicate with customers and stakeholders on technical and business issues.
- Ability to be concise, articulate, and assertive in all forms of communication.
- Excellent analytical and troubleshooting skills.
- Must be highly motivated, results focused and act with a high sense of urgency.
- Ability to lead and work collaboratively with a group of senior technical / management teams.
- Strong knowledge on incident, capacity, change and problem management processes.
- Willingness to work in flexible hours as needed by the business.