Provide hands-on support for the runtime operation of our applications, ensuring high availability and performance.
Collaborate with software engineering and infrastructure teams to troubleshoot and resolve runtime issues, including performance bottlenecks, scalability challenges, and system failures.
Contribute to the design and implementation of monitoring, alerting, and logging solutions to proactively identify and address potential runtime issues.
Participate in incident response and root cause analysis efforts to ensure the stability and resilience of the applications.
Work closely with cross-functional teams to understand application requirements and provide input on runtime and operational considerations during the software development lifecycle.
Contribute to the development and maintenance of runtime automation and tooling to streamline operational processes and improve efficiency.
Manage your own time and work well both independently and as part of a team.
Mentoring your peers and demonstrate a passion for continuous learning environment for the team.
Drive automation and ensure automated test scripts are completed for new features.
Good attitude, communication, willingness to learn and collaborate.
Bring a culture of innovation, ideas, and continuous improvement.
Challenging status quo, demonstrate risk taking, and implement creative ideas.
Qualifications
BS degree in Computer Science, Computer Engineering, other technical discipline.
Minimum 5 years of work experience in software engineering, infrastructure operations, or a related field. Experience in SRE, runtime engineering and application support will be highly desirable.
Strong understanding of cloud infrastructure, distributed systems, and containerization technologies, with experience in supporting critical business applications being a plus.
Familiarity with monitoring and logging tools, as well as incident management best practices, to ensure the reliability and performance of applications in a production environment.
Practical experience on implementation various system integration solutions, such as, APIs and Data integration layers (Batch & Real-Time).
Experience in design and developing mission critical highly available enterprise applications and Java applications performance tuning.
Experience with Continuous Integration and Continuous Delivery environment and platforms, especially GIT, Maven, and Jenkins
Good communication skills - able to explain concepts to product managers and business partners in ways that are meaningful to them.
Hands on experience with relational and NoSQL databases such as DB2, Redis, Postgres, Couchbase etc.
Strong interpersonal communication skills and the ability to work well in a diverse team-focused environment.
Solid programming and scripting skills, with the ability to automate operational tasks using tools such as Python and Ansible.
Excellent problem-solving and troubleshooting abilities, with a focus on understanding and addressing runtime issues in a timely manner.
Strong communication and collaboration skills, with the ability to work effectively in a cross-functional team environment.