We are seeking an accomplished Subject Matter Expert (SME) specialized in Monitoring Tools and Linux administration. This role requires a deep understanding of monitoring solutions, strong leadership skills, and the ability to design, implement, and optimize monitoring architectures to ensure the reliability and performance of our IT infrastructure.
Key Responsibilities:
- Lead the design, implementation, and optimization of monitoring solutions using Splunk, Zabbix, SolarWinds and other relevant tools.
- Provide technical leadership and guidance to the team members ensuring best practices and standards are followe'd in monitoring tool configurations and operations.
- Collaborate with cross-functional teams to gather requirements, define monitoring strategies, and implement solutions that meet business needs.
- Configure and customize monitoring tools to monitor performance, availability, and security of IT systems, applications, and network infrastructure.
- Develop and maintain dashboards, alerts, reports, and automated responses within monitoring tools to provide real-time insights and facilitate proactive management.
- Implement machine learning and AI-driven analytics for predictive and prescriptive monitoring capabilities.
- Conduct regular health checks, audits, and performance tuning of monitoring environments to optimize efficiency and ensure scalability.
- Serve as a subject matter expert, providing training, mentoring, and support to team members and stakeholders on monitoring tools and best practices.
Qualifications:
- bachelors degree in computer science, Information Technology, or a related field.
- Minimum 8 years of hands-on experience in monitoring tools, with proficiency in Splunk and Zabbix.
- Strong understanding of Linux administration, scripting languages (eg, Python), and automation frameworks for monitoring tool customization and integration.
- Knowledge of machine learning concepts and their application to monitoring and analytics.
- Proven leadership skills, including the ability to lead projects, mentor team members, and drive initiatives forward.
- Effective communication skills, with the ability to convey complex technical concepts to non-technical stakeholders.
- Experience with cloud environments (eg, AWS, Azure) and monitoring cloud-native applications.
- Certifications in relevant technologies (eg, Splunk Certified Architect/Administrator, Zabbix Certified Professional) are highly desirable.