Baseten logo

Site Reliability Engineer (SRE)

BasetenSan Francisco Office
FullTimeremotefull-timedevops+5 more
Apply Now
Baseten logo

Site Reliability Engineer (SRE)

Baseten

Apply Now

Baseten is seeking a Site Reliability Engineer (SRE) to build and maintain scalable infrastructure for deploying machine learning models. The role involves automating processes, managing CI/CD pipelines, and collaborating with cross-functional teams to enhance system reliability and performance. The company has recently secured $150M in funding and aims to scale its team to meet growing customer demand.

Qualification

  • Bachelor's, Master's, or Ph.D. degree in a relevant field (not fully provided in the text).
  • Experience with cloud infrastructure and services.
  • Strong understanding of CI/CD processes and tools.
  • Proficiency in scripting and automation tools.
  • Experience with monitoring and incident management.

Responsibility

  • Build and maintain scalable infrastructure to support the deployment and operation of machine learning models.
  • Establish standards and best practices for reliability and performance across the infrastructure.
  • Automate processes when relevant, particularly for managing CI/CD pipelines.
  • Own products and projects end-to-end, functioning as both an engineer and a project manager.
  • Collaborate with cross-functional teams to understand project requirements and translate them into technical solutions.
  • Mentor junior team members and contribute to knowledge sharing within the organization.
  • Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems.

Similar Jobs