Deniz Aybey's picture

Deniz Aybey PRO

denizaybey

·

https://sonne.technology

AI & ML interests

None yet

Organizations

upvoted a paper 3 days ago

NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation

Paper • 2512.05106 • Published 4 days ago • 13

upvoted a paper 4 days ago

WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning

Paper • 2512.02425 • Published 6 days ago • 22

upvoted 2 collections 6 days ago

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 6 days ago • 116

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 6 days ago • 70

upvoted 2 articles 13 days ago

Article

Diffusers welcomes FLUX-2

+6

13 days ago

•

158

Article

Continuous batching from first principles

+1

13 days ago

•

252

upvoted a collection 16 days ago

Apriel-H1

Introducing Apriel-H1 hybrids each blending Attention and Mamba State Space layers in varying proportions. • 8 items • Updated Nov 5 • 7

upvoted a collection 18 days ago

Olmo 3

Artifacts for the Olmo 3 release. • 9 items • Updated 7 days ago • 142

upvoted an article about 2 months ago

Article

Supercharge your OCR Pipelines with Open Models

+5

Oct 21

•

273

upvoted a collection about 2 months ago

Kumru

Kumru LLMs • 2 items • Updated Oct 8 • 9

upvoted a collection 2 months ago

Granite 4.0 Language Models

13 items • Updated 21 days ago • 193

upvoted 3 collections 3 months ago

Qwen3-Omni

6 items • Updated Oct 9 • 168

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1 • 304

Qwen3-Next

4 items • Updated Sep 22 • 161

upvoted 6 collections 4 months ago

AFM-Models

The models and training dataset of the paper: Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL • 12 items • Updated Aug 6 • 16

Seed-OSS

Seed-OSS Open-Source Models • 3 items • Updated Aug 20 • 58

DINOv3

DINOv3: foundation models producing excellent dense features, outperforming SotA w/o fine-tuning - https://arxiv.org/abs/2508.10104 • 13 items • Updated Aug 21 • 398

qqWen-Series

Based off the Qwen-2.5 Series - model finetuned for the Q programming language. • 12 items • Updated Oct 22 • 10

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 391

Falcon-H1

Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 38 items • Updated Nov 6 • 56