Qian Yang's picture

5

Qian Yang

QianYangMILA

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Grounding Computer Use Agents on Human Demonstrations

upvoted a paper 2 months ago

It Takes Two: Your GRPO Is Secretly DPO

upvoted a paper 7 months ago

REARANK: Reasoning Re-ranking Agent via Reinforcement Learning

View all activity

Organizations

None yet

Papers 1

arxiv:2405.00253

models 1

QianYangMILA/tmp

datasets 0

None public yet