SafeRoute: Adaptive Model Selection for Efficient and Accurate Safety Guardrails in Large Language Models Paper • 2502.12464 • Published 23 days ago • 27
SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions Paper • 2501.19377 • Published Jan 31 • 1
HarmAug: Effective Data Augmentation for Knowledge Distillation of Safety Guard Models Paper • 2410.01524 • Published Oct 2, 2024 • 3