-
-
-
-
-
-
Inference Providers
Active filters:
trl
FluegelQueen/Coeur-Validation-Fin
Text Generation
•
8B
•
Updated
•
84
•
1
mradermacher/Tashkeel-350M-v2-GGUF
0.3B
•
Updated
•
254
•
3
Lamapi/next-ocr
Image-Text-to-Text
•
9B
•
Updated
•
2.34k
•
7
0xgr3y/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-tall_tame_panther
Text Generation
•
0.5B
•
Updated
•
4.04k
•
1
DmitriyYurckML/QwenFree
Text Generation
•
Updated
•
42
•
2
Freakz3z/Qwen-JSON
Text Generation
•
4B
•
Updated
•
215
•
1
suv11235/olmOCR-7B-grpo-v3
Image-to-Text
•
8B
•
Updated
•
17
•
1
mradermacher/olmOCR-7B-grpo-v3-GGUF
8B
•
Updated
•
407
•
1
mradermacher/olmOCR-7B-grpo-v3-i1-GGUF
8B
•
Updated
•
2.02k
•
1
AhmedZaky1/whisper-small-v1
b1n1yam/gemma-2-27b-amharic-cpt
Nonovogo/qwen3-0.6_Python_4R
kingabzpro/ministral-3-bone-fracture
zarnite/naizz-tts-v1
MrIA-boy/LukMK1
pankajmathur/qwen3-0.6b-codeforces-finetuned-full
lewtun/dummy-trl-model
Reinforcement Learning
•
Updated
•
7
•
1
ybelkada/gpt-neo-125m-detox
Reinforcement Learning
•
Updated
•
9
ybelkada/gpt-neo-125m-detoxified-long-context
Reinforcement Learning
•
Updated
•
4
dshin/flan-t5-ppo
Reinforcement Learning
•
Updated
•
5
•
1
SummerSigh/T5-Base-Rule-Of-Thumb-RM
Reinforcement Learning
•
Updated
•
2
dshin/flan-t5-ppo-testing
Reinforcement Learning
•
Updated
•
7
•
1
SummerSigh/T5-Base-EvilPrompterRM
Reinforcement Learning
•
0.2B
•
Updated
•
7
dshin/flan-t5-ppo-testing-violation
Reinforcement Learning
•
Updated
•
1
dshin/flan-t5-ppo-user-b
Reinforcement Learning
•
Updated
•
2
dshin/flan-t5-ppo-user-h-use-violation
Reinforcement Learning
•
Updated
•
4
dshin/flan-t5-ppo-user-f-use-violation
Reinforcement Learning
•
Updated
•
4
dshin/flan-t5-ppo-user-e-use-violation
Reinforcement Learning
•
Updated
•
4
dshin/flan-t5-ppo-user-a-use-violation
Reinforcement Learning
•
Updated
•
4
dshin/flan-t5-ppo-user-h-batch-size-8-epoch-0
Reinforcement Learning
•
Updated
•
3