Skip to content

Huge/interpretable-layer-cost

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

interpretable-layer-cost

A calculator for estimating the compute cost of building a sparse auto-encoder layer into an LLM to make concepts inside such LLM interpretable. Auto-published at https://huge.github.io/interpretable-layer-cost

todo:

  1. add intro about http://transformer-circuits.pub/2023/monosemantic-features/index.html#problem-setup , short read, ..

  2. explain params( like http://transformer-circuits.pub/2023/monosemantic-features/index.html#problem-setup ) and draw the inserted layer struct

  3. sketch replication OSS efforts and ideas to be explored/developed on public-weights LLMs

  4. instead of the parameter for sheer training samples, maybe a target precision form a scaling law could be set

About

calculator for estimating the costs of building a sparse auto-encoder layer into an LLM for greater interpretability

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Contributors 2

  •  
  •  

Languages