HanXiao's picture

4 15

HanXiao

HanXiao1999

·

Euphoria16

AI & ML interests

None yet

Recent Activity

updated a dataset 9 days ago

HanXiao1999/UI-Genie-Agent-16k

updated a collection 12 days ago

updated a collection 12 days ago

View all activity

Organizations

None yet

upvoted 2 papers about 2 months ago

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning

Paper • 2510.14958 • Published Oct 16 • 22

DriveVLA-W0: World Models Amplify Data Scaling Law in Autonomous Driving

Paper • 2510.12796 • Published Oct 14 • 12

upvoted 2 papers 2 months ago

WebGen-Agent: Enhancing Interactive Website Generation with Multi-Level Feedback and Step-Level Reinforcement Learning

Paper • 2509.22644 • Published Sep 26 • 20

VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing

Paper • 2509.22651 • Published Sep 26 • 22

upvoted a paper 4 months ago

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Paper • 2507.21046 • Published Jul 28 • 82

upvoted 2 papers 6 months ago

Autoregressive Adversarial Post-Training for Real-Time Interactive Video Generation

Paper • 2506.09350 • Published Jun 11 • 48

UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents

Paper • 2505.21496 • Published May 27 • 38

upvoted 3 papers 7 months ago

MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning

Paper • 2505.10557 • Published May 15 • 47

WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch

Paper • 2505.03733 • Published May 6 • 17

LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects

Paper • 2504.19838 • Published Apr 28 • 22

upvoted a paper 12 months ago

EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM

Paper • 2412.09618 • Published Dec 12, 2024 • 21

upvoted 2 papers about 1 year ago

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 56

MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code

Paper • 2410.08196 • Published Oct 10, 2024 • 47

upvoted 2 papers over 1 year ago

MAVIS: Mathematical Visual Instruction Tuning

Paper • 2407.08739 • Published Jul 11, 2024 • 33

Step-Controlled DPO: Leveraging Stepwise Error for Enhanced Mathematical Reasoning

Paper • 2407.00782 • Published Jun 30, 2024 • 25