Efficient-Large-Model

community

AI & ML interests

None defined yet.

Recent Activity

AaronHuangWei authored a paper 7 days ago

MC#: Mixture Compressor for Mixture-of-Experts Large Models

AaronHuangWei authored a paper 7 days ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

AaronHuangWei submitted a paper 7 days ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

View all activity

AaronHuangWei

authored 2 papers 7 days ago

MC#: Mixture Compressor for Mixture-of-Experts Large Models

Paper • 2510.10962 • Published Oct 13, 2025

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published 9 days ago • 48

AaronHuangWei

submitted a paper to Daily Papers 7 days ago

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Paper • 2512.20557 • Published 9 days ago • 48

JamesHujy

authored 2 papers 10 days ago

DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer

Paper • 2507.04947 • Published Jul 7, 2025 • 1

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Paper • 2512.17260 • Published 13 days ago • 48

Boyiliee

authored a paper 15 days ago

FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos

Paper • 2512.10927 • Published 21 days ago • 5

yinhongxu

authored 12 papers 23 days ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 93

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 159

NaVILA: Legged Robot Vision-Language-Action Model for Navigation

Paper • 2412.04453 • Published Dec 5, 2024

EgoVLA: Learning Vision-Language-Action Models from Egocentric Human Videos

Paper • 2507.12440 • Published Jul 16, 2025

3D Aware Region Prompted Vision Language Model

Paper • 2509.13317 • Published Sep 16, 2025 • 14

Test-Time Scaling Strategies for Generative Retrieval in Multimodal Conversational Recommendations

Paper • 2508.18132 • Published Aug 25, 2025

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published Oct 13, 2025 • 176

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published Oct 17, 2025 • 89

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

Paper • 2510.15110 • Published Oct 16, 2025 • 15

SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models

Paper • 2406.01584 • Published Jun 3, 2024

WorldModelBench: Judging Video Generation Models As World Models

Paper • 2502.20694 • Published Feb 28, 2025

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 111

Lawrence-cj

updated a collection 23 days ago

SANA-Video

🎬 SANA-Video: Efficient Video Generation with Block Linear Diffusion Transformer • 8 items • Updated 23 days ago • 6

Lawrence-cj

updated a model 23 days ago

Efficient-Large-Model/SANA-Video_2B_480p_LongLive_diffusers

Text-to-Video • Updated 23 days ago • 2