
HPC Production Engineer

HPC Production Engineer
Jump Trading
Jump Trading Group is seeking a Production Engineer for their High Performance Computing Team in Chicago. The role involves designing, implementing, and maintaining high-performance computing and storage systems, while collaborating with researchers and team members to optimize infrastructure for quantitative research.
Qualification
- Hands-on experience managing Linux environments
- Strong software development background
- Experience with high performance computing systems
- Ability to work collaboratively across teams
- Familiarity with performance monitoring and fault monitoring systems
- Experience in building and maintaining production computing environments
- Willingness to participate in maintenance operations during evenings and weekends
- Strong communication skills for vendor management and collaboration with researchers
Responsibility
- Design, implement, maintain, and support high performance compute and storage systems
- Implement and support performance monitoring and fault monitoring systems
- Monitor systems and storage performance, including network components
- Build tooling to compile, package, install, and upgrade software and operating system components at scale
- Collaborate with team members to write code and testing infrastructures in multiple programming languages
- Develop and improve systems and user documentation
- Participate in large, coordinated maintenance operations, including evenings and weekends
- Work on global projects across a wide range of infrastructure
- Collaborate directly with researchers to optimize their use of HPC infrastructure
- Develop and monitor tools used to maintain a production computing environment
- Provide operational support on a rotating basis and as needed
- Manage relationships with outside vendors, including travel for meetings



