SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation Paper • 2410.12761 • Published Oct 16, 2024
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning Paper • 2502.15082 • Published 18 days ago • 1
UPCORE: Utility-Preserving Coreset Selection for Balanced Unlearning Paper • 2502.15082 • Published 18 days ago • 1 • 2
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 104 items • Updated 4 days ago • 97
Debiasing Multimodal Models via Causal Information Minimization Paper • 2311.16941 • Published Nov 28, 2023 • 1
Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks Paper • 2309.17410 • Published Sep 29, 2023 • 4