Models

931

Full-text search

Active filters: orpo, trl

baconnier/Finance_dolphin-2.9.1-yi-1.5-34b

Text Generation • 34B • Updated Jun 8, 2024 • 3

dayron24/OrpoLlama-3-8B

Text Generation • 8B • Updated Jun 9, 2024 • 3

dchoi44/llama-3-orpo-qlora

Updated Jun 24, 2024 • 2

WbjuSrceu/mistral-7b-instruct-v0.2

Text Generation • 7B • Updated Jun 10, 2024 • 3

iRyanBell/llama3.0-ARC1-II-8b

Text Generation • 8B • Updated Jun 21, 2024 • 25 • 1

retinol/eva_instr_1.2

Text Generation • 8B • Updated Jun 12, 2024 • 3

kevin009/llama

Text Generation • 8B • Updated Jun 13, 2024 • 2

damienbenveniste/HW2-orpo

Text Generation • 0.1B • Updated Jun 4, 2025 • 3

kevin009/llama19

Text Generation • 8B • Updated Jun 16, 2024 • 1

kevin009/llama21

Text Generation • 8B • Updated Jun 17, 2024 • 1

kevin009/llama22-instruct

Text Generation • 8B • Updated Jun 17, 2024 • 2

aisuko/ft-openelm-270m-ultrafeedback

Text Generation • 0.3B • Updated Jun 24, 2024 • 7

coorung/EEVE_10b_ORPO_trained

Text Generation • 11B • Updated Jun 24, 2024 • 3

Samhita/OrpoLlama-3-8B-Instruct-Copy

Updated Jun 24, 2024

ItchyChin/OrpoLlama-3-8B-memorize

Text Generation • 8B • Updated Jun 26, 2024 • 2

ishant0121/zephyr-orpo-141b-A35b

Text Generation • 1B • Updated Jun 26, 2024 • 3

Roshgupta/orpo-tiny-llama

Updated Jun 27, 2024

Roshgupta/orpo-phi3

Updated Jun 28, 2024

coorung/EEVE_10b_ORPO_trained_v2

Text Generation • 11B • Updated Jun 28, 2024 • 5

sert121/results

Updated Jun 30, 2024 • 1

ItchyChin/results

Updated Nov 11, 2024 • 2

sert121/llama3-lora-aligned-orpo

Updated Jun 30, 2024 • 2

sert121/llama3-lora-aligned-orpo-4epochs

Updated Jun 30, 2024 • 2

Sambaro/orpo-phi3

Updated Jul 3, 2024

sert121/llama3-lora-aligned-orpo-beta-0.2

Updated Jul 1, 2024 • 3

joswin03/ORPO-PHI-3

4B • Updated Jul 1, 2024

sert121/llama3-lora-aligned-orpo-low

Updated Jul 1, 2024 • 2

2v2/llama-3-bllossom-orpo-kdj

Text Generation • 8B • Updated Jul 4, 2024 • 4

sert121/orpo_run_yash_beta_0.1_preference_data_v8_for_orpo_thirds_data

Updated Jul 5, 2024 • 2

sert121/orpo_run_yash_beta_0.3_preference_data_v8_for_orpo

Updated Jul 5, 2024