Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 14
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 40
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study Paper • 2304.06762 • Published Apr 13, 2023 • 1
CrossNER: Evaluating Cross-Domain Named Entity Recognition Paper • 2012.04373 • Published Dec 8, 2020
ChatQA: Building GPT-4 Level Conversational QA Models Paper • 2401.10225 • Published Jan 18, 2024 • 34
Multi-Stage Prompting for Knowledgeable Dialogue Generation Paper • 2203.08745 • Published Mar 16, 2022
RankRAG: Unifying Context Ranking with Retrieval-Augmented Generation in LLMs Paper • 2407.02485 • Published Jul 2, 2024 • 5