arxiv:2405.00253
Qian Yang
QianYangMILA
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Grounding Computer Use Agents on Human Demonstrations
upvoted
a
paper
2 months ago
It Takes Two: Your GRPO Is Secretly DPO
upvoted
a
paper
7 months ago
REARANK: Reasoning Re-ranking Agent via Reinforcement Learning
Organizations
None yet