modularai logo

Software Engineer, Inference

modularaiUnited States / Canada
Full Timeremotehybridfull-time+5 more
Apply Now
modularai logo

Software Engineer, Inference

modularai

Apply Now

Modular is seeking a Cloud Inference Engineer to join their team focused on building advanced AI infrastructure. The role involves developing distributed LLM inference deployments integrated with the MAX stack, aiming to enhance the speed and scalability of AI systems. Candidates can work remotely or from the Los Altos, CA office, with a hybrid model for early-career professionals.

Qualification

  • Experience in backend engineering and distributed systems development
  • Strong understanding of AI inference techniques
  • Proficiency in building high-performance, scalable systems
  • Ability to work collaboratively in a team environment
  • Familiarity with cloud deployment strategies

Responsibility

  • Build and ship Modular’s LLM focused inference services using advanced inference techniques
  • Develop distributed systems to support high performance inference
  • Enhance operational excellence with observability and multi-cloud deployments
  • Collaborate with a team of experts to push the boundaries of distributed inference systems
  • Ensure systems are repeatable for new model architectures

Similar Jobs