My job alerts

Staff Site Reliability Engineer I - Kubernetes

Careem

This job is no longer accepting applications

See open jobs at Careem.See open jobs similar to "Staff Site Reliability Engineer I - Kubernetes" Alpha Partners.

Software Engineering

Jordan Springs, VA, USA · Jordan

Posted on Thursday, May 11, 2023

Careem is building ‘the everything app’ for the greater Middle East, making it easier than ever to move around, order food and groceries, manage payments, and more. Careem is led by a powerful purpose to simplify and improve the lives of people and build an awesome organisation that inspires. Since 2012, Careem has created earnings for over 2.5 million Captains, simplified the lives of over 50 million customers, and built a platform for the region’s best talent to thrive and for entrepreneurs to scale their businesses. Careem operates in over 70 cities across 10 countries, from Morocco to Pakistan.

About the team

We are looking for engineers who will work within the Cloud Infrastructure Foundation team. The Infra Foundation team develops and maintain cloud-native technology for the Careem Service teams:

Highly scalable Kubernetes clusters
Highly reliable, secure and performant KONG based API Gateways
Cloud Access management automation and integration with k8s

About the role

We need expert, execution-focused engineers to help shape the future of the Careem platform and to help us scale our already sizable effort greatly. As an Staff SRE in Careem, you'll architect, build and maintain the above ecosystem required to ensure resilience, reliability of our services and speed up deployments with the aim of improving our products used by millions of customers every day. Key responsibilities include:

Make an impact from design phase, through development and operation of Kubernetes cluster and its ecosystem on AWS
Develop and integrate KONG API Gateway Plugins that are low latency and secure.
Build core services, tooling and create technical processes that simplify and enable engineers across multiple services
Identifying and automating and scale system configurations without compromising on security and reliability.
Participate in on-call rotations and help improve incident response

Qualifications

8 above years experience in architecting, developing, operating and troubleshooting Kubernetes clusters and/or other highly available systems at scale.
Preferable - hands on experience with deploying and operating KONG as API GW or Ingress controller.
Hands on with at least one of the following programming languages: Go, Python, Java, Rust, C++
Experience with infrastructure automation - such as terraform , Cloud Formation or Pulumi.
Strong Unix or Linux background, including concepts such as processes, network stack, and memory allocation
Experience with cloud-native services on AWS/GCP/Azure
Incident response and/or incident management experience
Experience on DevOps topics such as monitoring, CI/CD, security is a plus
Effective communication and collaboration skills: have the ability to drive and promote technical partnerships across teams

What we’ll provide you

In addition to a competitive long-term total compensation with salary and equity, we have a reward philosophy that expands beyond this. As a Careem colleague you will be able to:

Be part of a Remote-First organization that offers flexible ways of working from the office and home.
Work from any country in the world for 30 days a year
Use Unlimited Vacation days throughout the year
Access fitness reimbursements for health activities including: gym, health club and training classes.
Work and learn from great minds
Create impact in a region with untapped potential
Explore new opportunities to learn and grow every day

This job is no longer accepting applications

See open jobs at Careem.See open jobs similar to "Staff Site Reliability Engineer I - Kubernetes" Alpha Partners.

See more open positions at Careem

Alpha invests in incredible companies.

Staff Site Reliability Engineer I - Kubernetes