Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

113,494

Full-text search

Active filters: trl

FluegelQueen/Coeur-Validation-Fin

Text Generation • 8B • Updated 15 days ago • 84 • 1

mradermacher/Tashkeel-350M-v2-GGUF

0.3B • Updated Oct 30 • 254 • 3

Lamapi/next-ocr

Image-Text-to-Text • 9B • Updated 23 days ago • 2.34k • 7

0xgr3y/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-tall_tame_panther

Text Generation • 0.5B • Updated 20 days ago • 4.04k • 1

DmitriyYurckML/QwenFree

Text Generation • Updated 8 days ago • 42 • 2

Freakz3z/Qwen-JSON

Text Generation • 4B • Updated 6 days ago • 215 • 1

suv11235/olmOCR-7B-grpo-v3

Image-to-Text • 8B • Updated 7 days ago • 17 • 1

mradermacher/olmOCR-7B-grpo-v3-GGUF

8B • Updated 7 days ago • 407 • 1

mradermacher/olmOCR-7B-grpo-v3-i1-GGUF

8B • Updated 3 days ago • 2.02k • 1

AhmedZaky1/whisper-small-v1

Updated 7 days ago • 1

b1n1yam/gemma-2-27b-amharic-cpt

Updated 2 days ago • 1

Nonovogo/qwen3-0.6_Python_4R

Updated 4 days ago • 1

kingabzpro/ministral-3-bone-fracture

Updated 3 days ago • 1

zarnite/naizz-tts-v1

Updated 3 days ago • 1

MrIA-boy/LukMK1

Updated 3 days ago • 1

pankajmathur/qwen3-0.6b-codeforces-finetuned-full

Updated 2 days ago • 1

lewtun/dummy-trl-model

Reinforcement Learning • Updated Jan 24, 2023 • 7 • 1

ybelkada/gpt-neo-125m-detox

Reinforcement Learning • Updated Feb 17, 2023 • 9

ybelkada/gpt-neo-125m-detoxified-long-context

Reinforcement Learning • Updated Feb 17, 2023 • 4

dshin/flan-t5-ppo

Reinforcement Learning • Updated Mar 11, 2023 • 5 • 1

SummerSigh/T5-Base-Rule-Of-Thumb-RM

Reinforcement Learning • Updated Mar 12, 2023 • 2

dshin/flan-t5-ppo-testing

Reinforcement Learning • Updated Mar 12, 2023 • 7 • 1

SummerSigh/T5-Base-EvilPrompterRM

Reinforcement Learning • 0.2B • Updated Mar 18, 2023 • 7

dshin/flan-t5-ppo-testing-violation

Reinforcement Learning • Updated Mar 12, 2023 • 1

dshin/flan-t5-ppo-user-b

Reinforcement Learning • Updated Mar 12, 2023 • 2

dshin/flan-t5-ppo-user-h-use-violation

Reinforcement Learning • Updated Mar 13, 2023 • 4

dshin/flan-t5-ppo-user-f-use-violation

Reinforcement Learning • Updated Mar 13, 2023 • 4

dshin/flan-t5-ppo-user-e-use-violation

Reinforcement Learning • Updated Mar 13, 2023 • 4

dshin/flan-t5-ppo-user-a-use-violation

Reinforcement Learning • Updated Mar 13, 2023 • 4

dshin/flan-t5-ppo-user-h-batch-size-8-epoch-0

Reinforcement Learning • Updated Mar 13, 2023 • 3