MedMCQA : A Large-scale Multi-Subject Multi-Choice Dataset for Medical domain Question Answering Paper • 2203.14371 • Published Mar 27, 2022
UI Agent Collection a collection of algorithmic agents for user interfaces/interactions and program synthesis • 250 items • Updated 2 days ago • 43
Runtime error 98 😼 Fluently Playground v0.25 Generate images on modern models of the Fluently family
Executable Code Actions Elicit Better LLM Agents Paper • 2402.01030 • Published Feb 1, 2024 • 45 • 4
Executable Code Actions Elicit Better LLM Agents Paper • 2402.01030 • Published Feb 1, 2024 • 45
Journal Club Collection Candidate papers to read in the H4 journal club • 54 items • Updated Apr 21, 2024 • 31
Med-HALT: Medical Domain Hallucination Test for Large Language Models Paper • 2307.15343 • Published Jul 28, 2023 • 2