
Senior Engineer, Compute Services (Kubernetes, Bare Metal)

Senior Engineer, Compute Services (Kubernetes, Bare Metal)

Senior Engineer, Compute Services (Kubernetes, Bare Metal)
CoreWeave
CoreWeave is seeking a Senior Engineer in Compute Services to build reliable infrastructure for AI applications. The role involves working with Kubernetes on bare-metal environments, utilizing tools like Python and Golang, and optimizing system reliability.
Qualification
- Proven experience provisioning Kubernetes using tools such as kubeadm, Cluster API, Kubeception, Kubespray, or similar
- Demonstrated ability debugging complex Kubernetes cluster issues and carrying out upgrades
- Proficiency in Python, Golang, and Bash scripting
- Experience with GitOps and DevOps practices
- Strong understanding of fault-tolerant architectures and reliability optimization
Responsibility
- Design, develop, and maintain automated tooling to provision Kubernetes control planes on bare-metal
- Use Python, Golang, and Bash to create tooling and go operators
- Perform day 2 lifecycle tasks and maintenance on running clusters
- Identify gaps and implement fault-tolerant architectures
- Optimize reliability using the Grafana ecosystem
- Design automated testing to validate build quality and stability
- Participate in an on-call rotation every two months serving as point of contact




