Mechanistic Interpretability
Publications by Category
[Franco and Crovella, 2024]
Gabriel Franco and Mark Crovella (2024).
Sparse Attention Decomposition Applied to Circuit Tracing.
Technical Report Nr. 2410.00340.
doi:10.48550/arXiv.2410.00340
[ Github repository ]