
Software Engineer, Ray Data

Software Engineer, Ray Data

Software Engineer, Ray Data
Anyscale
Anyscale is seeking a Software Engineer for the Ray Data team to enhance and scale the Ray Datasets library, which is integral to building distributed applications and machine learning pipelines. The role involves optimizing performance, integrating with ML training, and ensuring stability in data processing capabilities.
Qualification
- Strong programming skills in Python
- Experience with distributed systems and data processing
- Familiarity with Apache Arrow and Ray Core
- Knowledge of machine learning frameworks and libraries
- Ability to work collaboratively in a team environment
Responsibility
- Build, optimize, and scale Ray’s Datasets library
- Enhance data processing capabilities for distributed applications
- Work on performance optimization at large scale using Arrow primitives
- Integrate Ray Datasets with ML training and various data sources
- Develop stability and stress testing infrastructure
- Lead integration of streaming workloads into Ray




