Mechanistic Interpretability Website

Welcome to the Mechanistic Interpretability Hub

Mechanistic interpretability is an exciting and rapidly evolving field that aims to uncover the inner workings of complex AI systems. Our mission is to be a hub for the latest research, resources, and community engagement around this important topic.

Whether you're a policymaker looking to understand the implications of AI, a researcher delving into the technical details, or simply curious about how these systems work, you'll find something valuable here.

Explore the Distillation

Get up to speed on the key research insights and breakthroughs from the past few years. Our curated summaries are tailored for both technical and non-technical audiences.

Dive into the Research

Discover in-depth content for researchers, including papers, methodologies, and open questions. Plus, find resources to help you get started in the field of mechanistic interpretability.

Connect with the Community

Join our vibrant Discord server to engage with other enthusiasts, participate in the weekly reading group, and collaborate on projects. We're always eager to learn from each other.

This website aims to strike a balance - providing authoritative and well-researched information, while maintaining an approachable and inviting tone. We want to empower a wide range of stakeholders to understand and engage with the fascinating world of mechanistic interpretability.

So dive in, explore, and let's work together to unravel the mysteries of AI!