

Senior Member of technical staff (Infrastructure)
hcompany
H is an innovative AI startup focused on developing agentic AI to automate complex tasks and enhance human potential. The Infrastructure team plays a crucial role in providing robust and scalable infrastructure for research and product engineering efforts, ensuring seamless access for engineers and researchers.
Qualification
- Relevant experience in ML Ops or Data Engineering.
- Experience architecting and deploying distributed systems on public cloud platforms (AWS, Azure, GCP).
- Knowledge of observability and monitoring tools (e.g., Datadog, Prometheus, Grafana).
- Proficiency in a modern programming language, ideally Python or JavaScript/TypeScript.
- Experience with containerization and orchestration tools (Docker, Kubernetes) is a plus.
- Familiarity with Infrastructure as Code tools (CDK, Terraform) is a plus.
- Experience in CI/CD management (GitHub Actions, GitLab CI, TeamCity) is a plus.
Responsibility
- Designing and managing infrastructure to support research in model and agent development, including training infrastructure, data pipelines, and inference.
- Supporting product engineering efforts on H Company's agent platform, including client-facing APIs and agent runtimes in various deployment scenarios.
- Setting up and maintaining observability and monitoring strategies.
- Mentoring and growing other engineers in infrastructure-related topics and general engineering practices.
- Ensuring the underlying infrastructure for public services is robust, reliable, and scalable.




