-
RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation
Paper • 2501.13726 • Published -
RAG-Star: Enhancing Deliberative Reasoning with Retrieval Augmented Verification and Refinement
Paper • 2412.12881 • Published • 1 -
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
Paper • 2502.01142 • Published • 22
Shang Hong Sim
shanghong
AI & ML interests
Neural decoding, neuroengineering, signal processing
Recent Activity
updated
a collection
1 day ago
RAG
updated
a collection
1 day ago
RAG
updated
a collection
1 day ago
RAG
Organizations
Collections
1
models
7
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6122685ea5e77c00f07e894f/HRXdBDNYycLDsi-kbXvjV.png)
shanghong/q-FrozenLake-4x4-custom
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6122685ea5e77c00f07e894f/HRXdBDNYycLDsi-kbXvjV.png)
shanghong/q-FrozenLake-4x4-test
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6122685ea5e77c00f07e894f/HRXdBDNYycLDsi-kbXvjV.png)
shanghong/q-FrozenLake-custommap-v2
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6122685ea5e77c00f07e894f/HRXdBDNYycLDsi-kbXvjV.png)
shanghong/q-FrozenLake-custommap
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6122685ea5e77c00f07e894f/HRXdBDNYycLDsi-kbXvjV.png)
shanghong/Taxi-v3
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6122685ea5e77c00f07e894f/HRXdBDNYycLDsi-kbXvjV.png)
shanghong/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6122685ea5e77c00f07e894f/HRXdBDNYycLDsi-kbXvjV.png)
shanghong/ppo-LunarLander-v2
Reinforcement Learning
•
Updated