Skip to content

cadentj/autointerp

Repository files navigation

autointerp

Simple, hackable implementation of automated interpretability.

Todo:

  • Caching and loading does not support padding at the moment.
  • Make a simple unified client like safety-tooling.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •