Position Details
Elevate system reliability as a Senior Site Reliability Engineer (SRE) focused on production applications. Analyze performance and implement optimizations to ensure exceptional system uptime.
We are looking for a highly experienced SRE to oversee reliability, optimization, and incident response. Key responsibilities include implementing best practices for performance and scalability, troubleshooting issues, and leading incident resolutions. You will also engage in capacity planning and operational automation to enhance efficiency.
Key Responsibilities:
• Implement best practices for high availability
• Set up monitoring and troubleshoot incidents
• Analyze resource usage for optimization
• Automate operational workflows and integrations
• Lead incident resolution and performance monitoring
Requirements:
• Proficient in Dynatrace and ELK Stack
• Experience with monitoring tools, AI Ops
• Advanced skills in Python, PowerShell, Shell Scripting
• Knowled...