-
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper • 2505.03335 • Published • 188 -
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
Paper • 2502.06781 • Published • 59 -
internlm/OREAL-7B
Text Generation • 8B • Updated • 84 • • 20 -
internlm/OREAL-32B
Text Generation • 33B • Updated • 283 • 24
AI-Insight
non-profit
AI & ML interests
None defined yet.
Recent Activity
-
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows
Paper • 2505.19897 • Published • 104 -
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning
Paper • 2506.10521 • Published • 73 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19
-
meituan-longcat/LongCat-Flash-Omni
Any-to-Any • 561B • Updated • 162 • 100 -
LongCat-Flash-Omni Technical Report
Paper • 2511.00279 • Published • 22 -
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
Paper • 2510.15870 • Published • 89 -
nvidia/omnivinci
Feature Extraction • Updated • 6.94k • 163
-
CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming
Paper • 2505.12925 • Published • 2 -
Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination
Paper • 2503.04149 • Published • 6 -
OSS-Bench: Benchmark Generator for Coding LLMs
Paper • 2505.12331 • Published • 2 -
UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
Paper • 2506.09289 • Published • 2
-
internlm/Intern-S1
Image-Text-to-Text • 241B • Updated • 58.4k • 249 -
Intern-S1: A Scientific Multimodal Foundation Model
Paper • 2508.15763 • Published • 256 -
MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Paper • 2408.01800 • Published • 89 -
openbmb/MiniCPM-V-4_5
Image-Text-to-Text • 9B • Updated • 49.5k • 1.02k
-
tencent/HunyuanOCR
Image-Text-to-Text • 1.0B • Updated • 493k • 654 -
HunyuanOCR Technical Report
Paper • 2511.19575 • Published • 20 -
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 23.4k • 1.39k -
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Paper • 2510.14528 • Published • 105
-
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
Paper • 2505.03335 • Published • 188 -
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
Paper • 2502.06781 • Published • 59 -
internlm/OREAL-7B
Text Generation • 8B • Updated • 84 • • 20 -
internlm/OREAL-32B
Text Generation • 33B • Updated • 283 • 24
-
CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive Programming
Paper • 2505.12925 • Published • 2 -
Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination
Paper • 2503.04149 • Published • 6 -
OSS-Bench: Benchmark Generator for Coding LLMs
Paper • 2505.12331 • Published • 2 -
UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench
Paper • 2506.09289 • Published • 2
-
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows
Paper • 2505.19897 • Published • 104 -
Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning
Paper • 2506.10521 • Published • 73 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19
-
internlm/Intern-S1
Image-Text-to-Text • 241B • Updated • 58.4k • 249 -
Intern-S1: A Scientific Multimodal Foundation Model
Paper • 2508.15763 • Published • 256 -
MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Paper • 2408.01800 • Published • 89 -
openbmb/MiniCPM-V-4_5
Image-Text-to-Text • 9B • Updated • 49.5k • 1.02k
-
meituan-longcat/LongCat-Flash-Omni
Any-to-Any • 561B • Updated • 162 • 100 -
LongCat-Flash-Omni Technical Report
Paper • 2511.00279 • Published • 22 -
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM
Paper • 2510.15870 • Published • 89 -
nvidia/omnivinci
Feature Extraction • Updated • 6.94k • 163
-
tencent/HunyuanOCR
Image-Text-to-Text • 1.0B • Updated • 493k • 654 -
HunyuanOCR Technical Report
Paper • 2511.19575 • Published • 20 -
PaddlePaddle/PaddleOCR-VL
Image-Text-to-Text • 1.0B • Updated • 23.4k • 1.39k -
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
Paper • 2510.14528 • Published • 105