Participate in a 24/7/365 team that will track, support and remediate production incidents
Manage the maintenance and patching of varying complexity solutions across multiple cloud providers
Monitor the stability and health of our critical cloud projects
Develop and maintain a best in class monitoring and logging platform for varying cloud projects comprised of technologies such as Kubernetes, Docker and Airflow
Incorporate Real User Monitoring into the monitoring strategy
Govern our systems to ensure security standards and compliance is always met
Support the wider global business for access management and change control
Work closely with the in-house developers to help share knowledge and improve the quality of practices
Taking ownership of projects gaining a personal satisfaction in achieving high performing environments
Working in a DevOps capacity with a desire to automate everything
Thrive in a fast-paced environment trying to always achieve new goals
Requirements:
5+ years of experience in working with cloud technologies
Certified in a public cloud technology (AWS, GCP or Azure)
Strong communication skills and ability to work well in a diverse, globally distributed team
Experienced with SRE design principles and methodologies
Strong understanding of modern monitoring and logging technologies
Understand microservice architecture
Understand web performance enhancement technologies (CDN, Caching, etc)
Understand industry standard security best practices for OWASP
Hands-on experience with managing production Kubernetes environments