

Site Reliability Engineer (SRE)
Baseten
Baseten is seeking a Site Reliability Engineer (SRE) to build and maintain scalable infrastructure for deploying machine learning models. The role involves automating processes, managing CI/CD pipelines, and collaborating with cross-functional teams to enhance system reliability and performance. The company has recently secured $150M in funding and aims to scale its team to meet growing customer demand.
Qualification
- Bachelor's, Master's, or Ph.D. degree in a relevant field (not fully provided in the text).
- Experience with cloud infrastructure and services.
- Strong understanding of CI/CD processes and tools.
- Proficiency in scripting and automation tools.
- Experience with monitoring and incident management.
Responsibility
- Build and maintain scalable infrastructure to support the deployment and operation of machine learning models.
- Establish standards and best practices for reliability and performance across the infrastructure.
- Automate processes when relevant, particularly for managing CI/CD pipelines.
- Own products and projects end-to-end, functioning as both an engineer and a project manager.
- Collaborate with cross-functional teams to understand project requirements and translate them into technical solutions.
- Mentor junior team members and contribute to knowledge sharing within the organization.
- Navigate ambiguity and exercise good judgment on tradeoffs and tools needed to solve problems.




