OpenAI logo

Data Engineer, Analytics

OpenAISan Francisco
FullTimepythonjavascala+5 more
Apply Now
OpenAI logo

Data Engineer, Analytics

OpenAI

Apply Now

OpenAI is seeking a Data Engineer to lead the development of data pipelines and core tables essential for powering analyses, safety systems, and product growth. The role involves collaboration with various teams to ensure data integrity and compliance while contributing to the responsible deployment of AI technology.

Qualification

  • 3+ years of experience as a data engineer and 8+ years of software engineering experience (including data engineering).
  • Proficiency in at least one programming language commonly used in Data Engineering, such as Python, Scala, or Java.
  • Experience with distributed processing technologies and frameworks, such as Hadoop, Flink, and distributed storage systems (e.g., HDFS, S3).
  • Expertise with ETL schedulers such as Airflow, Dagster, Prefect, or similar frameworks.
  • Solid understanding of Spark and ability to write, debug, and optimize Spark code.

Responsibility

  • Design, build and manage data pipelines for user event data integration into the data warehouse.
  • Develop canonical datasets to track key product metrics such as user growth, engagement, and revenue.
  • Collaborate with Infrastructure, Data Science, Product, Marketing, Finance, and Research teams to understand data needs and provide solutions.
  • Implement robust and fault-tolerant systems for data ingestion and processing.
  • Participate in data architecture and engineering decisions, leveraging strong experience and knowledge.
  • Ensure the security, integrity, and compliance of data according to industry and company standards.

Similar Jobs