Answering Questions by Meta-Reasoning over Multiple Chains of Thought Paper • 2304.13007 • Published Apr 25, 2023 • 1
Making Retrieval-Augmented Language Models Robust to Irrelevant Context Paper • 2310.01558 • Published Oct 2, 2023 • 2
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? Paper • 2407.15711 • Published Jul 22, 2024 • 9
SCROLLS: Standardized CompaRison Over Long Language Sequences Paper • 2201.03533 • Published Jan 10, 2022 • 1