Fangzhi Xu's picture

4 47 7

Fangzhi Xu

xufangzhi

·

http://xufangzhi.github.io

AI & ML interests

Natural Language Processing, Large Language Models, Neural Symbolic

Recent Activity

upvoted a collection 3 days ago

upvoted a paper 3 days ago

PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling

upvoted a paper 3 days ago

From Imitation to Discrimination: Toward A Generalized Curriculum Advantage Mechanism Enhancing Cross-Domain Reasoning Tasks

View all activity

Organizations

upvoted a collection 3 days ago

PaCo-RL

Data and Model collection for PaCo-RL • 9 items • Updated 3 days ago • 7

upvoted 2 papers 3 days ago

PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling

Paper • 2512.04784 • Published 8 days ago • 23

From Imitation to Discrimination: Toward A Generalized Curriculum Advantage Mechanism Enhancing Cross-Domain Reasoning Tasks

Paper • 2512.02580 • Published 8 days ago • 27

updated a Space 9 days ago

Dynamic Energy Grid Simulator

Manage an energy grid by adjusting power sources

published a Space 9 days ago

Dynamic Energy Grid Simulator

Manage an energy grid by adjusting power sources

updated a Space about 1 month ago

TurnOnLights

Toggle bulbs to solve a logic puzzle

published a Space about 1 month ago

TurnOnLights

Toggle bulbs to solve a logic puzzle

updated a Space about 1 month ago

Trade

Environment for AI Trading

New activity in xufangzhi/Trade about 1 month ago

update

#1 opened about 1 month ago by

published a Space about 1 month ago

Trade

Environment for AI Trading

upvoted 3 papers about 1 month ago

ARISE: An Adaptive Resolution-Aware Metric for Test-Time Scaling Evaluation in Large Reasoning Models

Paper • 2510.06014 • Published Oct 7 • 10

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28 • 71

JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence

Paper • 2510.23538 • Published Oct 27 • 96

upvoted 2 papers about 2 months ago

LIBERO-Plus: In-depth Robustness Analysis of Vision-Language-Action Models

Paper • 2510.13626 • Published Oct 15 • 45

PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

Paper • 2510.13809 • Published Oct 15 • 37

upvoted a collection about 2 months ago

LightReasoner Models

https://arxiv.org/abs/2510.07962 • 3 items • Updated Oct 19 • 5

upvoted 2 papers about 2 months ago

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

Paper • 2510.08189 • Published Oct 9 • 26

AutoPR: Let's Automate Your Academic Promotion!

Paper • 2510.09558 • Published Oct 10 • 51

upvoted a paper 2 months ago

The Era of Real-World Human Interaction: RL from User Conversations

Paper • 2509.25137 • Published Sep 29 • 18