4 10 9

LIU Shih-yang

sliuau

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

upvoted a paper 4 days ago

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

liked a model 5 days ago

mistralai/Ministral-3-3B-Reasoning-2512

View all activity

Organizations

upvoted 2 papers 4 days ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published 11 days ago • 96

Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Paper • 2511.18890 • Published 14 days ago • 29

upvoted a paper about 1 month ago

DLER: Doing Length pEnalty Right - Incentivizing More Intelligence per Token via Reinforcement Learning

Paper • 2510.15110 • Published Oct 16 • 15

upvoted a collection 3 months ago

Reasoning Efficiency Research

Collection

Ultra-efficient reasoning model! SOTA Accuracy / CoT Length trade-offs • 3 items • Updated 4 days ago • 10

upvoted an article 8 months ago

Article

Open R1: Update #3

Mar 11

•

296

upvoted 2 articles 10 months ago

Article

Open R1: Update #2

Feb 10

•

218

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

•

887

upvoted a paper about 1 year ago

EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

Paper • 2410.21271 • Published Oct 28, 2024 • 7

upvoted an article over 1 year ago

Article

Building DoRA Support for Embedding Layers in PEFT

Aug 23, 2024

•

upvoted a paper about 2 years ago

LLM-FP4: 4-Bit Floating-Point Quantized Transformers

Paper • 2310.16836 • Published Oct 25, 2023 • 14

LIU Shih-yang

AI & ML interests

Recent Activity

Organizations

sliuau's activity

Open R1: Update #3

Open R1: Update #2

Open-R1: a fully open reproduction of DeepSeek-R1

Building DoRA Support for Embedding Layers in PEFT