Boundary-Guided Policy Optimization for Memory-Efficient RL of Diffusion Large Language Models
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
DeepPrune: Parallel Scaling without Inter-trace Redundancy
SIRI: Scaling Iterative Reinforcement Learning with Interleaved Compression
models
75
THU-KEG/LLaDA-8B-BGPO-sudoku
Reinforcement Learning
•
8B
•
Updated
•
7
•
1
THU-KEG/LLaDA-8B-BGPO-countdown
Reinforcement Learning
•
8B
•
Updated
•
6
•
1
THU-KEG/LLaDA-8B-BGPO-code
Reinforcement Learning
•
8B
•
Updated
•
14
•
1
THU-KEG/LLaDA-8B-BGPO-math
Reinforcement Learning
•
8B
•
Updated
•
8
•
1
THU-KEG/DeepPrune-Judge-4B
Text Classification
•
Updated
•
9
•
1
THU-KEG/SIRI-1.5B-low
Text Generation
•
2B
•
Updated
•
9
•
2
THU-KEG/SIRI-1.5B-high
Text Generation
•
2B
•
Updated
•
10
•
3
THU-KEG/SIRI-7B-low
Text Generation
•
8B
•
Updated
•
11
•
2
THU-KEG/SIRI-7B-high
Text Generation
•
8B
•
Updated
•
14
•
4
THU-KEG/LongWriter-Zero-32B
Text Generation
•
33B
•
Updated
•
53
•
•
110
datasets
20
THU-KEG/AgentIF
Viewer
•
Updated
•
707
•
149
•
5
THU-KEG/DeepPrune
Preview
•
Updated
•
36
•
1
THU-KEG/LinguaLens-Data
Viewer
•
Updated
•
7.25k
•
71
•
2
THU-KEG/RM-Bench
Viewer
•
Updated
•
1.33k
•
534
•
7
THU-KEG/LongWriter-Zero-RLData
Viewer
•
Updated
•
8.61k
•
91
•
21
THU-KEG/Arena-Write
Viewer
•
Updated
•
595
•
278
•
4
THU-KEG/LongStory
Viewer
•
Updated
•
5.28k
•
27
•
2
THU-KEG/IF-Verifier-Data
Viewer
•
Updated
•
131k
•
87
•
3
THU-KEG/VerInstruct
Viewer
•
Updated
•
27.5k
•
121
•
6
THU-KEG/MM-Math-Align
Viewer
•
Updated
•
4.02k
•
84
•
1