
Staff Engineer - Site Reliability Engineering

Staff Engineer - Site Reliability Engineering

Staff Engineer - Site Reliability Engineering
Aviatrix
The Staff Engineer - Site Reliability Engineering role at Aviatrix involves ensuring the reliability, availability, and performance of critical systems and services. The position requires independent ownership of significant components, mentoring junior engineers, and driving technical excellence through automation and best practices.
Qualification
- 6+ years of experience in a relevant engineering field
- Strong knowledge of Kubernetes and application lifecycle management
- Proficiency in Infrastructure as Code (IaC) practices
- Experience with automation and development in Golang and Python
- Ability to design and implement observability strategies
- Strong incident management and performance engineering skills
- Excellent collaboration and technical leadership abilities
Responsibility
- Design and implement complex application lifecycle management using Kubernetes
- Architect comprehensive Infrastructure as Code (IaC) solutions with advanced configurations
- Design and build automation frameworks and tools in Golang and Python
- Take ownership of significant system components, defining SLA targets and driving achievement
- Contribute to improvements in product security, quality, reliability, and performance
- Implement automation frameworks that scale across teams to eliminate manual work
- Design observability strategies and implement advanced monitoring and distributed tracing
- Lead major incident response and establish incident management processes
- Drive system-wide performance improvements and establish performance engineering practices
- Mentor junior engineers and provide technical guidance on complex problems




