Software Engineer, Safeguards

Anthropic•San Francisco, CA

Full Timepython typescript ai machine-learning security devops testing qa

Apply Now

Software Engineer, Safeguards

Anthropic•San Francisco, CA

Full Timepython typescript ai+5 more

Apply Now

Software Engineer, Safeguards

Anthropic

Apply Now

Anthropic is seeking experienced software engineers to join their Safeguards team, focusing on building safety and oversight mechanisms for AI systems. The role involves developing monitoring systems, abuse detection mechanisms, and collaborating with research teams to enhance model safety. Candidates should have a strong background in software engineering, particularly in integrity and abuse detection, and be proficient in Python and Typescript.

Qualification

Bachelor’s degree in Computer Science, Software Engineering, or comparable experience.
5-10+ years of experience in software engineering, focusing on integrity, spam, fraud, or abuse detection.
Proficiency in Python and Typescript.
Ability to work across the stack.
Strong communication skills to explain technical concepts to non-technical stakeholders.

Responsibility

Develop monitoring systems to detect unwanted behaviors from API partners and automate enforcement actions.
Build abuse detection mechanisms and infrastructure.
Surface abuse patterns to research teams to improve model training.
Create robust multi-layered defenses for real-time safety mechanisms at scale.
Analyze user reports of inappropriate content or accounts.

Software Engineer, Safeguards

Software Engineer, Safeguards

Software Engineer, Safeguards

Qualification

Responsibility

Similar Jobs

Backend Software Engineer (Evals) – Support Automation Engineering

Systems Engineer - Air

Software Engineer I / II

Manager, Quality Engineering

Enterprise Sales Engineer - Poland

Similar Jobs

Similar Jobs

Backend Software Engineer (Evals) – Support Automation Engineering

Systems Engineer - Air

Software Engineer I / II

Manager, Quality Engineering

Enterprise Sales Engineer - Poland