25 13 2

Princeton NLP group

princeton-nlp

ashdev's profile picture

Laeeth's profile picture

akibc123's profile picture

https://princeton-nlp.github.io

princeton_nlp
princeton-nlp

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents

new activity 3 months ago

HuggingFaceTB/FineMath-Llama-3B:Hyperparameters

updated a collection 5 months ago

RLMT Experiments

View all activity

Organizations

princeton-nlp 's collections 6

RLMT Experiments

The *RLMT* collection. Coming soon!

princeton-nlp/warm-start__sft__think__Llama-3.1-8B-Instruct

8B • Updated Sep 22, 2025 • 1
princeton-nlp/warm-start__sft__nothink__Qwen2.5-7B-Instruct

8B • Updated Sep 22, 2025 • 4
princeton-nlp/warm-start__sft__think__Llama-3.1-8B

8B • Updated Sep 22, 2025
princeton-nlp/warm-start__sft__think__Qwen2.5-7B

8B • Updated Sep 22, 2025 • 1

SWE-bench

SWE-bench is a benchmark for evaluating Language Models and AI Systems on their ability resolve real world GitHub Issues.

princeton-nlp/SWE-bench

Viewer • Updated Mar 3, 2025 • 21.5k • 29.2k • 134
princeton-nlp/SWE-bench_Lite

Viewer • Updated Mar 3, 2025 • 323 • 71.6k • 53
princeton-nlp/SWE-bench_Multimodal

Viewer • Updated Jan 13, 2025 • 612 • 783 • 21
princeton-nlp/SWE-bench_Verified

Viewer • Updated Feb 18, 2025 • 500 • 567k • 300

Sheared Llama

princeton-nlp/Sheared-LLaMA-1.3B

Text Generation • Updated Jan 23, 2024 • 9.7k • 99
princeton-nlp/Sheared-LLaMA-2.7B

Text Generation • Updated Jan 23, 2024 • 2.39k • 61
princeton-nlp/Sheared-LLaMA-1.3B-ShareGPT

Text Generation • Updated Dec 4, 2023 • 772 • 10
princeton-nlp/Sheared-LLaMA-2.7B-ShareGPT

Text Generation • Updated Dec 4, 2023 • 777 • 8

SimPO

This collections contains a list of SimPO and baseline models.

princeton-nlp/gemma-2-9b-it-SimPO

Text Generation • 9B • Updated Aug 2, 2024 • 717 • • 172
princeton-nlp/gemma-2-9b-it-DPO

Text Generation • 9B • Updated Jul 18, 2024 • 24 • • 9
princeton-nlp/Llama-3-Base-8B-SFT-IPO

Text Generation • 8B • Updated Jun 17, 2024 • 20 • • 1
princeton-nlp/Llama-3-Base-8B-SFT-DPO

Text Generation • 8B • Updated Jun 17, 2024 • 765 •

ProLong

ProLong is a family of long-context models that are continued trained and supervised fine-tuned from Llama-3-8B, with a maximum context window of 512K

princeton-nlp/Llama-3-8B-ProLong-64k-Base

Text Generation • 8B • Updated Oct 31, 2024 • 8.13k • • 5
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct

Text Generation • 8B • Updated Oct 31, 2024 • 8.4k • • 13
princeton-nlp/Llama-3-8B-ProLong-512k-Base

8B • Updated Oct 31, 2024 • 8.06k • 9
princeton-nlp/Llama-3-8B-ProLong-512k-Instruct

8B • Updated Oct 31, 2024 • 8.17k • 26

SimCSE

princeton-nlp/unsup-simcse-bert-base-uncased

Feature Extraction • Updated Nov 11, 2022 • 1.38k • • 5
princeton-nlp/unsup-simcse-bert-large-uncased

Feature Extraction • Updated Nov 15, 2022 • 33 • 1
princeton-nlp/unsup-simcse-roberta-base

Feature Extraction • Updated Jun 16, 2021 • 588 • 9
princeton-nlp/unsup-simcse-roberta-large

Feature Extraction • Updated Jun 16, 2021 • 92 • 3

RLMT Experiments

The *RLMT* collection. Coming soon!

princeton-nlp/warm-start__sft__think__Llama-3.1-8B-Instruct

8B • Updated Sep 22, 2025 • 1
princeton-nlp/warm-start__sft__nothink__Qwen2.5-7B-Instruct

8B • Updated Sep 22, 2025 • 4
princeton-nlp/warm-start__sft__think__Llama-3.1-8B

8B • Updated Sep 22, 2025
princeton-nlp/warm-start__sft__think__Qwen2.5-7B

8B • Updated Sep 22, 2025 • 1

SimPO

This collections contains a list of SimPO and baseline models.

princeton-nlp/gemma-2-9b-it-SimPO

Text Generation • 9B • Updated Aug 2, 2024 • 717 • • 172
princeton-nlp/gemma-2-9b-it-DPO

Text Generation • 9B • Updated Jul 18, 2024 • 24 • • 9
princeton-nlp/Llama-3-Base-8B-SFT-IPO

Text Generation • 8B • Updated Jun 17, 2024 • 20 • • 1
princeton-nlp/Llama-3-Base-8B-SFT-DPO

Text Generation • 8B • Updated Jun 17, 2024 • 765 •

SWE-bench

SWE-bench is a benchmark for evaluating Language Models and AI Systems on their ability resolve real world GitHub Issues.

princeton-nlp/SWE-bench

Viewer • Updated Mar 3, 2025 • 21.5k • 29.2k • 134
princeton-nlp/SWE-bench_Lite

Viewer • Updated Mar 3, 2025 • 323 • 71.6k • 53
princeton-nlp/SWE-bench_Multimodal

Viewer • Updated Jan 13, 2025 • 612 • 783 • 21
princeton-nlp/SWE-bench_Verified

Viewer • Updated Feb 18, 2025 • 500 • 567k • 300

ProLong

ProLong is a family of long-context models that are continued trained and supervised fine-tuned from Llama-3-8B, with a maximum context window of 512K

princeton-nlp/Llama-3-8B-ProLong-64k-Base

Text Generation • 8B • Updated Oct 31, 2024 • 8.13k • • 5
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct

Text Generation • 8B • Updated Oct 31, 2024 • 8.4k • • 13
princeton-nlp/Llama-3-8B-ProLong-512k-Base

8B • Updated Oct 31, 2024 • 8.06k • 9
princeton-nlp/Llama-3-8B-ProLong-512k-Instruct

8B • Updated Oct 31, 2024 • 8.17k • 26

Sheared Llama

princeton-nlp/Sheared-LLaMA-1.3B

Text Generation • Updated Jan 23, 2024 • 9.7k • 99
princeton-nlp/Sheared-LLaMA-2.7B

Text Generation • Updated Jan 23, 2024 • 2.39k • 61
princeton-nlp/Sheared-LLaMA-1.3B-ShareGPT

Text Generation • Updated Dec 4, 2023 • 772 • 10
princeton-nlp/Sheared-LLaMA-2.7B-ShareGPT

Text Generation • Updated Dec 4, 2023 • 777 • 8

SimCSE

princeton-nlp/unsup-simcse-bert-base-uncased

Feature Extraction • Updated Nov 11, 2022 • 1.38k • • 5
princeton-nlp/unsup-simcse-bert-large-uncased

Feature Extraction • Updated Nov 15, 2022 • 33 • 1
princeton-nlp/unsup-simcse-roberta-base

Feature Extraction • Updated Jun 16, 2021 • 588 • 9
princeton-nlp/unsup-simcse-roberta-large

Feature Extraction • Updated Jun 16, 2021 • 92 • 3