Provide hands-on support for the runtime operation of our applications, ensuring high availability and performance, drive Mean Time to Restore service level targets.
Participate in incident response and root cause analysis efforts to ensure the stability and resilience of the applications.
Collaborate with software engineering and infrastructure teams to troubleshoot and resolve runtime issues, including performance bottlenecks, scalability challenges, and system failures.
Contribute to the design and implementation of monitoring, alerting, and logging solutions to proactively identify and address potential runtime issues.
Work closely with cross-functional teams to understand the application architecture and provide input on runtime and operational requirements early in the software development lifecycle.
Support and monitoring of software across test, integration, and production environments using Continuous Integration and Continuous Delivery tools, e.g., Jenkins, etc.
Automate deployments in test or production environments using standard
Explores new automation techniques to refine the agility, speed and quality of engineering initiatives and efforts
Contribute to the development and maintenance of runtime automation and tooling to streamline operational processes, improve efficiency, and reduce toil.
Drive automation and ensure automated test scripts are completed for new features.
Manage time effectively and work independently and as part of a team.
Mentor peers and demonstrate a passion for continuous learning environment for the team.
Good attitude, communication, willingness to learn and collaborate.
Bring a culture of innovation, ideas, and continuous improvement.
Challenging status quo, demonstrate risk taking, and implement creative ideas.
Qualifications
BS degree in Computer Science, Computer Engineering, other technical discipline.
Minimum 3 years of work experience in software engineering, app support or infrastructure operations or runtime engineering.
A working understanding of cloud infrastructure, distributed systems, and containerization technologies, with experience in supporting critical business applications being a plus.
Familiarity with monitoring and logging tools, and incident management best practices, to ensure reliability and performance of applications in a production environment.
Solid programming and scripting skills, with hands on experience to automate operational tasks using tools such as Python, Bash, Selenium, Linux and Ansible.
Excellent problem-solving and troubleshooting abilities, focused on understanding, and resolving runtime issues quickly.
Practical experience with implementation of system integration solutions, such as, APIs and Data integration layers (Batch Real-Time).
Experience with Continuous Integration and Continuous Delivery environment and platforms, such as GIT, Maven, and Jenkins
Effective communication skills with ability to explain concepts to product managers and business partners and translate technical terms to meet business needs. Excellent collaboration skills, with the ability to work effectively in a cross-functional environment.
Hands on experience with relational and NoSQL databases such as DB2, Redis, Postgres, Couchbase etc.
Benefits include:
Competitive base salaries
Bonus incentives
Support for financial-well-being and retirement
Comprehensive medical, dental, vision, life insurance, and disability benefits (depending on location)
Flexible working model with hybrid, onsite or virtual arrangements depending on role and business need
Generous paid parental leave policies (depending on your location)
Free access to global on-site wellness centers staffed with nurses and doctors (depending on location)
Free and confidential counseling support through our Healthy Minds program