Link copied to clipboard
IT & Infrastructure

Site Reliability Engineer – Compute Operations ⭐ Featured

IBM India (Multiple Locations)
Full Time 3–7 years experience
About the Role

As a Site Reliability Engineer focused on Compute Operations at IBM, you will ensure the reliability, availability, and performance of large-scale compute infrastructure. You will design and implement automation to reduce manual toil, manage infrastructure as code, and build monitoring and alerting systems for proactive issue detection. The role involves troubleshooting complex production issues, performing capacity planning, and driving continuous improvements in system uptime and operational efficiency. You will collaborate with development and operations teams to define SLOs/SLIs and implement best practices for incident management, change management, and disaster recovery across cloud and on-premise compute environments.

You'll be redirected to the official careers portal

Similar Jobs You Might Like

Sales Planning Manager – Xactly Plan Expert

Wolters Kluwer company logo

Wolters Kluwer

India (Multiple Locations)
Xactly Plan Sales Compensation Planning Territory Planning Quota Management Data Analysis +6 more
Sales & Marketing Full Time 6-10 years experience

Principal Business Systems Analyst - Red Hat Training

Red Hat company logo

Red Hat

India (Multiple Locations)
Business Analysis Requirements Gathering Process Mapping JIRA Agile +6 more
Business & Consulting Full Time 8-14 years experience

Senior Salesforce Application Engineer- Support

Red Hat company logo

Red Hat

India (Multiple Locations)
Salesforce Apex Visualforce Lightning Web Components SOQL +7 more
Software Engineering Full Time 5-10 years experience

Senior Engineer

Fractal company logo

Fractal

India (Multiple Locations)
Python Java Microservices REST APIs Cloud Platforms +8 more
Software Engineering Full Time 5-9 years experience