
Member of Technical Staff - ML Research Engineer, Performance Optimization

Member of Technical Staff - ML Research Engineer, Performance Optimization

Member of Technical Staff - ML Research Engineer, Performance Optimization
Liquid AI
Liquid AI, a company spun out of MIT, is seeking a Member of Technical Staff - ML Research Engineer specializing in Performance Optimization. The role focuses on writing high-performance GPU kernels and optimizing AI models for various hardware architectures. The position offers the opportunity to work with cutting-edge technology in a collaborative environment.
Qualification
- Experience writing high-performance, custom GPU kernels for training or inference
- Understanding of low-level profiling tools and kernel tuning
- Experience integrating GPU kernels into frameworks like PyTorch
- Solid understanding of memory hierarchy and optimization for compute and memory-bound workloads
- Experience with CUDA, CUTLASS, C/C++, and PyTorch/Triton
Responsibility
- Write high-performance GPU kernels for inference workloads
- Optimize alternative architectures used at Liquid across all model parameter sizes
- Implement the latest techniques and ideas from research into low-level GPU kernels
- Continuously monitor, profile, and improve the performance of our inference pipelines
- Integrate GPU kernels into frameworks like PyTorch



