As a Site Reliability Engineer atOneMind Cloud Services, your primary responsibility will be to manage ourdatacenter inventory and ensure the reliability and optimal performance of ourinfrastructure. This role requires a blend of skills in system administration,inventory management, and reliability engineering, with a focus on maintaininghigh uptime and efficiency.
Key Responsibilities:
- Oversee andmanage the datacenter inventory, ensuring all hardware and software componentsare accounted for and properly maintained.
- Implement andmaintain tools for monitoring and reporting on the health and performance ofdatacenter infrastructure.
- Conduct regularaudits of datacenter equipment to identify and rectify any issues proactively.
- Collaborate withIT and network teams to ensure seamless integration and operation of datacentercomponents.
- Develop andenforce best practices for inventory management and system reliability.
- Automaterepetitive tasks and processes to improve efficiency and accuracy in inventorymanagement.
- Respond to andresolve system outages and performance issues, ensuring minimal downtime.
- Stay currentwith emerging technologies and industry trends in datacenter management andreliability engineering.
Requirements
Qualifications:
- Bachelordegree in computer science, Information Technology, or a related field.
- Provenexperience in managing datacenter inventory and ensuring system reliability.
- Strong knowledgeof system administration and network management.
- Experience withmonitoring tools and automation software.
- Excellentproblem-solving skills and attention to detail.
- Ability to workin a fast-paced environment and manage multiple tasks simultaneously.
- Strongcommunication skills and the ability to collaborate effectively with variousteams.
Preferred Skills:
- Familiarity withcloud computing platforms and services.
- Experience withscripting languages for automation (e.g., Python, Bash).
- Knowledge ofbest practices in IT security and datacenter operations.