Xiaotian Han

ahxt

AI & ML interests

Large Language Models, Graph Learning, Fairness

Recent Activity

commented on a paper 1 day ago
Thinking Preference Optimization
liked a model 13 days ago
deepseek-ai/DeepSeek-R1-Zero
View all activity

Organizations

None yet

ahxt's activity

upvoted an article 14 days ago
view article
Article

Proximal Policy Optimization (PPO)

23