Core Responsibilities

Lead incident response from detection to resolution and conduct retrospective analysis. Take personal ownership of critical incidents and demonstrate leadership in coordinating with global teams. Organize post-incident recovery actions and customer communication, and participate in post-incident reviews to identify root causes.

Requirements

3+ years of experience in handling significant incidents in a 24/7 high-load environment. 5+ years in roles in engineering, product, or service delivery. Strong decision-making skills in uncertain conditions and experience in Public Cloud, Microservices, and monitoring systems.

Additional Information

Experience Level

Senior

Job Language

English

Employment Type

Full-time

Work Mode

Hybrid