SearchLondonJobs.co.uk

🏛️ London's Premier Job Portal

← Back to London Jobs

Confluent Incident Management Engineer

Company: IBM

Location: toronto, London

Posted: June 11, 2026

Apply for This Position

Submit Application

Position Details

Join Confluent as an Incident Management Engineer and enhance the reliability of our cloud infrastructure. Manage incident response and implement strategic improvements to prevent future incidents.
This role fuses technical engineering and strategic program ownership, dedicating 75% of your time to hands-on engineering tasks. You'll focus on building automation, analyzing systemic failures, and developing reliability enhancements, while also teaching and coordinating post-mortem processes with various teams. Your impact will help shape critical incident response standards.
Key Responsibilities:
• Design reliability improvements to mitigate incidents
• Manage Rootly configuration and its integration with relevant tools
• Maintain SLO/SLA frameworks using error budgets
• Review and edit customer-facing incident documentation
• Develop and deliver incident training programs
Requirements:
• Over 10 years in SRE or reliability engineering
• Experience with AWS,...