5 13 6

Shuo Zhang

Meteonis

00index

AI & ML interests

None yet

Recent Activity

authored a paper 3 days ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

upvoted a paper 3 days ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

liked a model 19 days ago

nex-agi/DeepSeek-V3.1-Nex-N1

View all activity

Organizations

authored a paper 3 days ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published 4 days ago • 67

upvoted a paper 3 days ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published 4 days ago • 67

liked a model 19 days ago

nex-agi/DeepSeek-V3.1-Nex-N1

671B • Updated 3 days ago • 101 • 26

updated 2 models 20 days ago

nex-agi/internlm3-8B-Nex-N1

9B • Updated 3 days ago • 46 • 12

nex-agi/Qwen3-32B-Nex-N1

33B • Updated 3 days ago • 44.8k • 14

New activity in nex-agi/Qwen3-30B-A3B-Nex-N1 20 days ago

Update chat_template.jinja

#1 opened 20 days ago by

Meteonis

authored a paper about 1 month ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21 • 83

upvoted a paper about 2 months ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21 • 83

New activity in openai/gpt-oss-120b 4 months ago

FlashInfer requires sm75+

#48 opened 4 months ago by

hrithiksagar-tih

upvoted a paper 9 months ago

DropletVideo: A Dataset and Approach to Explore Integral Spatio-Temporal Consistent Video Generation

Paper • 2503.06053 • Published Mar 8 • 138

upvoted a paper 10 months ago

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published Feb 24 • 73

liked a model 12 months ago

deepseek-ai/DeepSeek-V3-Base

685B • Updated Mar 27 • 10.5k • 1.68k

upvoted 2 collections 12 months ago

long-cot-dataset

Collection

16 items • Updated Dec 22, 2024 • 14

DeepSeek-V3

Collection

4 items • Updated 11 days ago • 278

liked a Space 12 months ago

QwQ-32B-Preview

🔍

922

QwQ-32B-Preview

upvoted a paper over 1 year ago

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler

Paper • 2408.13359 • Published Aug 23, 2024 • 24

commented a paper over 1 year ago

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler

Paper • 2408.13359 • Published Aug 23, 2024 • 24 •

upvoted 2 papers over 1 year ago

In-Context Imitation Learning via Next-Token Prediction

Paper • 2408.15980 • Published Aug 28, 2024 • 10

Amuro & Char: Analyzing the Relationship between Pre-Training and Fine-Tuning of Large Language Models

Paper • 2408.06663 • Published Aug 13, 2024 • 16

liked a Space over 1 year ago

ChuanhuChatGPT

🐯

923

Chat with AI using text input

Shuo Zhang

AI & ML interests

Recent Activity

Organizations

Meteonis's activity

Update chat_template.jinja

FlashInfer requires sm75+

QwQ-32B-Preview

ChuanhuChatGPT