

Software Engineer, Data Infrastructure
OpenAI
The Software Engineer, Data Infrastructure role at OpenAI involves building and operating scalable data infrastructure systems that support high-performance compute and storage platforms. The team focuses on innovative data solutions that enhance AI-assisted workflows and analytics. This position is hybrid, based in San Francisco, CA, and offers relocation assistance.
Qualification
- 4+ years in data infrastructure engineering or infrastructure engineering with a strong interest in data.
- Experience with platforms such as Spark, Kafka, Flink, Airflow, Trino, or Iceberg.
- Proficiency in infrastructure tooling like Terraform.
- Strong debugging skills for large-scale distributed systems.
- Interest in solving data infrastructure problems in the AI space.
Responsibility
- Design, build, and maintain data infrastructure systems including distributed compute, data orchestration, distributed storage, and streaming infrastructure.
- Ensure scalability, reliability, and security of the data platform.
- Empower engineers and teammates with excellent data tooling and systems to enhance productivity.
- Collaborate with product, research, and analytics teams to develop technical foundations for new features and experiences.
- Participate in on-call rotation for critical incidents to ensure system reliability.




