A Primer on the Inner Workings of Transformer-based Language Models Paper • 2405.00208 • Published Apr 30, 2024 • 9
LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models Paper • 2404.07004 • Published Apr 10, 2024 • 6
Calibrating Reasoning in Language Models with Internal Consistency Paper • 2405.18711 • Published May 29, 2024 • 6
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 95 items • Updated 13 days ago • 96