Member of Technical Staff - ML Performance

Modal•New York

FullTimeon-site full-time machine-learning tensorflow cuda python performance-engineering gpu

Apply Now

Member of Technical Staff - ML Performance

Modal•New York

FullTimeon-site full-time machine-learning+5 more

Apply Now

Member of Technical Staff - ML Performance

Modal

Apply Now

Modal is a fast-growing AI infrastructure company providing GPU access and container solutions for AI teams. They are seeking experienced engineers to enhance the performance of ML systems, particularly in relation to language and diffusion models.

Qualification

5+ years of experience writing high-quality, high-performance code.
Experience with torch and high-level ML frameworks.
Familiarity with Nvidia GPU architecture and CUDA.
Experience with ML performance engineering and inference engines like vLLM or TensorRT.
Nice-to-have: familiarity with low-level operating system foundations such as Linux kernel and containers.

Responsibility

Contribute to open-source projects related to ML performance engineering.
Enhance Modal's container runtime for improved model throughput and latency.
Collaborate with the engineering team to optimize ML systems at scale.
Debug and resolve performance issues related to GPU utilization.
Implement high-performance algorithms for machine learning applications.

Member of Technical Staff - ML Performance

Member of Technical Staff - ML Performance

Member of Technical Staff - ML Performance

Qualification

Responsibility

Similar Jobs

Systems Engineer, Open Architecture, Active Clearance

Software Engineer I / II

Staff Software Engineer-Greenplum

DataOps Engineer (AI Platform Engineer)

Staff Backend Engineer-RiskOS

Similar Jobs

Similar Jobs

Systems Engineer, Open Architecture, Active Clearance

Software Engineer I / II

Staff Software Engineer-Greenplum

DataOps Engineer (AI Platform Engineer)

Staff Backend Engineer-RiskOS