Models

73,249

Full-text search

Active filters: reinforcement-learning

nvidia/NitroGen

Reinforcement Learning • Updated Feb 5 • 532

Adilbai/stock-trading-rl-agent

Reinforcement Learning • Updated Jan 8 • 396 • 144

diasAiMaster/unitree-go2-velocity-flat

Reinforcement Learning • Updated 3 days ago • 3

OpenEnvisionLab/Auto-Rubric-as-Reward

Text-to-Image • Updated about 22 hours ago • 2

saadxsalman/Q-SS-0.5B-Reasoning-Math

Text Generation • 0.5B • Updated about 13 hours ago • 2

edbeeching/decision-transformer-gym-hopper-expert

Reinforcement Learning • Updated Jun 29, 2022 • 300 • 20

mradermacher/Tifa-Deepsex-14b-CoT-i1-GGUF

Reinforcement Learning • 15B • Updated Feb 13, 2025 • 481 • 14

Open-Reasoner-Zero/Open-Reasoner-Zero-7B

Reinforcement Learning • 8B • Updated Apr 7, 2025 • 1.75k • 34

ValueFX9507/Tifa-DeepsexV3-14b-GGUF-Q6

Reinforcement Learning • 15B • Updated Jul 1, 2025 • 13.4k • 43

Veri-Code/ReForm-14B-RL-entropy

Text Generation • 15B • Updated 6 days ago • 27 • 3

tensorblock/Nellyw888_VeriReason-codeLlama-7b-RTLCoder-Verilog-GRPO-reasoning-tb-GGUF

Reinforcement Learning • 7B • Updated Jan 27 • 3 • 1

InfiX-ai/InfiGUI-G1-7B

Image-Text-to-Text • 8B • Updated Aug 12, 2025 • 110 • 12

Schrieffer/Llama-SARM-4B

Reinforcement Learning • 5B • Updated Dec 11, 2025 • 23 • 2

mradermacher/ATLAS-8B-Thinking-GGUF

Reinforcement Learning • 8B • Updated Sep 13, 2025 • 266 • 2

JonusNattapong/AI-XAUUSD-Trading

Reinforcement Learning • Updated Oct 10, 2025 • 34

PRIME-RL/P1-30B-A3B

Text Generation • 31B • Updated Oct 24, 2025 • 249 • 11

Freakz3z/Qwen-JSON

Text Generation • 4B • Updated Dec 3, 2025 • 39 • 3

zai-org/GLM-TTS

Text-to-Speech • Updated Jan 12 • 924 • 337

gudo7208/CAD-Coder

Text Generation • 8B • Updated Jan 9 • 517 • 3

exla-ai/openpie-0.6

Robotics • Updated Feb 4 • 121 • 21

PrimeIntellect/INTELLECT-3.1

Text Generation • 107B • Updated Feb 18 • 222 • 43

mradermacher/PulseMind-72B-i1-GGUF

Reinforcement Learning • 73B • Updated Jan 30 • 278 • 2

Dat1710/nexus-1.5b

Text Generation • 2B • Updated 6 days ago • 121 • 1

nvidia/GEAR-SONIC

Reinforcement Learning • Updated Apr 11 • 42

nvidia/EGM-8B

Image-Text-to-Text • 9B • Updated Apr 10 • 613 • 8

Tzafon/Northstar-CUA-Fast

Image-Text-to-Text • 5B • Updated Apr 2 • 2k • 5

jasonmsilvas1984/stock-trading-rl-agent

Reinforcement Learning • Updated Mar 6 • 1

waltgrace/poker-gemma4-26b-a4b-lora

Image-Text-to-Text • Updated 26 days ago • 2

Falconss1/VideoThinker-R1-Bias-3B

Video-Text-to-Text • 4B • Updated 21 days ago • 21 • 1

mradermacher/VideoThinker-R1-Bias-3B-GGUF

Question Answering • 3B • Updated 19 days ago • 674 • 1