Research Engineer, Interpretability

Anthropic•San Francisco, CA

Full Timefull-time python machine-learning ai research data-science tensorflow pytorch

Apply Now

Research Engineer, Interpretability

Anthropic•San Francisco, CA

Full Timefull-time python machine-learning+5 more

Apply Now

Research Engineer, Interpretability

Anthropic

Apply Now

Anthropic is seeking a Research Engineer for its Interpretability team, focused on understanding and improving the safety of AI systems through mechanistic interpretability. The role involves reverse-engineering neural networks to enhance trust and reliability in AI models.

Qualification

Strong background in machine learning and AI
Experience with neural networks and deep learning frameworks
Proficiency in programming languages such as Python
Familiarity with research methodologies and scientific writing
Ability to work collaboratively in a fast-paced environment

Responsibility

Conduct research on mechanistic interpretability of neural networks
Develop tools and methodologies for analyzing AI models
Collaborate with a team of researchers and engineers
Publish findings in relevant scientific forums
Engage in discussions and presentations about interpretability challenges

Research Engineer, Interpretability

Research Engineer, Interpretability

Research Engineer, Interpretability

Qualification

Responsibility

Similar Jobs

Research Engineer, Machine Learning (Horizons)

Data Scientist

ML Infrastructure Engineer, Safeguards

Systems Engineer, Open Architecture, Active Clearance

Software Engineer I / II

Similar Jobs

Similar Jobs

Research Engineer, Machine Learning (Horizons)

Data Scientist

ML Infrastructure Engineer, Safeguards

Systems Engineer, Open Architecture, Active Clearance

Software Engineer I / II