Agent Laboratory: Using LLM Agents as Research Assistants Paper โข 2501.04227 โข Published 5 days ago โข 67
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper โข 2412.19723 โข Published 16 days ago โข 78
view article Article ๐บ๐ฆโโฌ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram โข 10 days ago โข 37
view article Article Building Effective Agents with Anthropicโs Best Practices and smolagents โค๏ธ By Sri-Vigneshwar-DJ โข 8 days ago โข 4
A New Approach for Explainable Multiple Organ Annotation with Few Data Paper โข 1912.12932 โข Published Dec 30, 2019 โข 1
view article Article ๐ช๐บโ๏ธ EU AI Act: Systemic Risks in the First CoP Draft Comments โ๏ธ๐ช๐บ By yjernite โข Dec 12, 2024 โข 12
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials Paper โข 2412.09605 โข Published Dec 12, 2024 โข 27
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents Paper โข 2401.00812 โข Published Jan 1, 2024 โข 4
Code Agents are State of the Art Software Testers Paper โข 2406.12952 โข Published Jun 18, 2024 โข 1
Awesome Computer Use Agents Collection https://github.com/ranpox/awesome-computer-use โข 25 items โข Updated 25 days ago โข 7
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper โข 2412.04454 โข Published Dec 5, 2024 โข 59
view article Article ๐บ๐ฆโโฌ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram โข Dec 4, 2024 โข 76
ShowUI: One Vision-Language-Action Model for GUI Visual Agent Paper โข 2411.17465 โข Published Nov 26, 2024 โข 78
DynaSaur: Large Language Agents Beyond Predefined Actions Paper โข 2411.01747 โข Published Nov 4, 2024 โข 21
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 โข Nov 21, 2024 โข 35