1 150 48

js

rldy

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

We Got Claude to Fine-Tune an Open Source LLM

liked a Space 4 days ago

OpenEvals/evaluation-guidebook

upvoted a paper 6 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

View all activity

Organizations

upvoted an article 4 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

5 days ago

•

349

upvoted a paper 6 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 6 days ago • 181

upvoted 2 papers 8 days ago

AnyTalker: Scaling Multi-Person Talking Video Generation with Interactivity Refinement

Paper • 2511.23475 • Published 10 days ago • 41

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published 11 days ago • 163

upvoted a paper 12 days ago

GigaEvo: An Open Source Optimization Framework Powered By LLMs And Evolution Algorithms

Paper • 2511.17592 • Published 21 days ago • 118

upvoted a paper 21 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published 21 days ago • 132

upvoted a paper 24 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 27 days ago • 194

upvoted an article about 1 month ago

Article

On the Shifting Global Compute Landscape

Oct 29

•

upvoted 2 papers about 1 month ago

Towards Robust Mathematical Reasoning

Paper • 2511.01846 • Published Nov 3 • 7

Visual Diffusion Models are Geometric Solvers

Paper • 2510.21697 • Published Oct 24 • 18

upvoted a paper about 2 months ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22 • 114

upvoted a paper 2 months ago

Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls

Paper • 2510.00184 • Published Sep 30 • 16

upvoted 8 papers 3 months ago

LIMI: Less is More for Agency

Paper • 2509.17567 • Published Sep 22 • 102

SWE-Bench Pro: Can AI Agents Solve Long-Horizon Software Engineering Tasks?

Paper • 2509.16941 • Published Sep 21 • 21

OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

Paper • 2509.17627 • Published Sep 22 • 66

Hunyuan3D Studio: End-to-End AI Pipeline for Game-Ready 3D Asset Generation

Paper • 2509.12815 • Published Sep 16 • 39

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Paper • 2509.01363 • Published Sep 1 • 58

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

MotionFlux: Efficient Text-Guided Motion Generation through Rectified Flow Matching and Preference Alignment

Paper • 2508.19527 • Published Aug 27 • 10

CineScale: Free Lunch in High-Resolution Cinematic Visual Generation

Paper • 2508.15774 • Published Aug 21 • 20

js

AI & ML interests

Recent Activity

Organizations

rldy's activity

We Got Claude to Fine-Tune an Open Source LLM

On the Shifting Global Compute Landscape