Senior Site Reliability Engineer


San Jose
Permanent
USD200000 - USD315000
Development and Engineering​
PR/539535_1744047986
Senior Site Reliability Engineer

Site Reliability Engineer

At the intersection of machine learning and large-scale infrastructure, the SRE team for our Applied Machine Learning group is redefining how intelligent systems operate at global scale. We blend the principles of software engineering with systems reliability to keep our AI and recommendation systems resilient, high-performing, and ever-evolving.

As a Site Reliability Engineer on this team, you'll be hands-on with some of the most advanced AI technologies, helping architect, maintain, and scale machine learning platforms that serve millions-if not billions-of users. You'll also play a critical role in optimizing system performance, making hardware and capacity recommendations, and automating everything possible.

What You'll Do:

  • Ensure our ML systems run smoothly, efficiently, and reliably-no matter how complex or large they get.

  • Dive deep into the guts of distributed systems to identify and resolve bottlenecks before they become outages.

  • Contribute to and lead the automation of infrastructure, pipelines, and operational routines.

  • Collaborate with engineering and hardware teams on capacity planning, architecture choices, and performance tuning.

What You Bring:

  • Deep knowledge of distributed systems and the experience to troubleshoot them with precision.

  • A Bachelor's or Master's in Computer Science or a closely related field focused on software development or systems engineering.

  • Solid programming chops in at least one of the following: Python, C/C++, or Go.

  • Strong foundation in algorithms, data structures, and computer science fundamentals.

Preferred Extras:

  • Experience designing and operating high-scale, high-availability systems.

  • Passion for writing clean, optimized code and automating away manual tasks.

  • Prior SRE experience in large distributed production environments.

FAQs

Congratulations, we understand that taking the time to apply is a big step. When you apply, your details go directly to the consultant who is sourcing talent. Due to demand, we may not get back to all applicants that have applied. However, we always keep your CV and details on file so when we see similar roles or see skillsets that drive growth in organisations, we will always reach out to discuss opportunities.

Yes. Even if this role isn’t a perfect match, applying allows us to understand your expertise and ambitions, ensuring you're on our radar for the right opportunity when it arises.

We also work in several ways, firstly we advertise our roles available on our site, however, often due to confidentiality we may not post all. We also work with clients who are more focused on skills and understanding what is required to future-proof their business. 

That's why we recommend registering your CV so you can be considered for roles that have yet to be created. 

Yes, we help with CV and interview preparation. From customised support on how to optimise your CV to interview preparation and compensation negotiations, we advocate for you throughout your next career move.

Handpicked roles for you