Requirements
Description & Requirements
What you will be doing:
- Monitor and report on infrastructure growth, health, availability, and utilization (server health metrics [Memory, CPU, I/O, etc.], storage of all types, database health, Web and network utilizations, virtual machines) in addition to discrete application components on Windows & Unix OSs with VMware & Azure virtualization hosted Azure cloud offerings.
- Maintain high system availability via SolarWinds monitoring and proactive issue resolution.
- Monitoring and analyzing the performance and usage of our applications by using Azure Application Insights.
- Write and optimize Kusto Query Language (KQL) queries to extract insights from Azure telemetry data, including metrics, logs, and traces.
- Develop and maintain custom dashboards in Azure Monitor to visualize and monitor application performance, availability, and resource utilization.
- Maintain healthy systems (perform clean ups and performance tweaks)
- Troubleshoot customer issues related on Infra and application hosted on Azure.
- Use Service Now to work on customer tickets by achieving SLA's.
- Perform scheduled maintenance and support windows updates activities after hours and weekends.
- Run PowerShell scripts and CI/CD pipeline where automation is used to complete tasks for Product deployments.
- Run SQL Scripts to restore SQL Server Database, good understanding of databases and SQL statements.
- Creating, deleting, and managing namespaces within Kubernetes clusters using K9s.
What you will likely bring:
- 3+ year of experience as a Site Reliability Engineer doing systems monitoring & supporting IT infrastructure.
- Experience working with Kubernetes Service and Windows servers.
- PHP / PowerShell / ARM templates / C# Scripting experience is A+
- Web site application monitoring skills are required, while Web site creation skills is A+
- Knowledge of IT infrastructure Architecture as it relates to Web Application Hosting and Support
- Proficiency in Kusto Query Language (KQL) with experience querying large datasets for performance monitoring and troubleshooting.
- Experience in creating and customizing dashboards in Azure Monitor
- Networking and Hardware configuration/troubleshooting.
- Experience with tools such as SolarWinds , Datadog. (Monitoring Tool )
- Experience with other Kubernetes management tools (e.g., k9s, kubectl) is a plus.
- Excellent written and verbal communication.
What could set you apart:
- Azure Server Administration.
About Epicor
At Epicor we know that success comes from working together. Everyone has a role to play, and it's the essential partnerships across our company that are crucial to our customers success and our growth as a business.
We're truly a team. Working in close partnership, we bring wide-ranging talents together in powerful collaborations. We think innovatively, share our knowledge generously, and constantly learn from our colleagues. We're proud of the success we achieve every day, but we never stop challenging ourselves and encouraging each other. Together, we go further and imagine an even brighter future.
Whatever your career journey, we'll help you find the right path. Through our training courses, mentorship, and continuous support, you'll get everything you need to thrive. At Epicor, your success is our success. And that success really matters, because we're the essential partners for the world's most essential businessesthe hardworking companies who make, move, and sell the things the world needs.
Equal Opportunities and Accommodations Statement
Epicor is committed to creating a workplace and global community where inclusion is valued; where you bring the whole and real youthat's who we're interested in. If you have interest in this or any role- but your experience doesn't match every qualification of the job description, that's okay- consider applying regardless.
We are an equal-opportunity employer.