Reliable and Efficient Amortized Model-Based Evaluation Datasets and Models for the REEval project stair-lab/reeval Viewer • Updated Jun 21 • 5.69M • 287 • 1 stair-lab/reeval-difficulty-for-helm Viewer • Updated Mar 18 • 217k • 100
Gathering Context for Decision Support with LLMs stair-lab/bosd_initial_dataset Viewer • Updated Jan 7 • 568 • 4
Dynamics of Learning Datasets and Models for the CodeInsights Projects stair-lab/code_insights_jsons Preview • Updated Dec 11, 2024 • 1 stair-lab/code_insights_csv Viewer • Updated Apr 16 • 3.07M • 22 • 1 stair-lab/code_insights_matrices Preview • Updated Dec 12, 2024 • 15 stair-lab/code-insights-llm_simulator Text Generation • 8B • Updated Sep 8, 2024
Nonmyopic Bayesian Optimization in Dynamic Cost Settings Datasets and Models for the Nonmyopic BO project stair-lab/semi_synthetic_protein_2p12_gemma_7b Viewer • Updated Dec 18, 2024 • 12.3k • 32 stair-lab/proteinea_fluorescence-embedding Viewer • Updated Dec 18, 2024 • 188k • 86
Finetuning and Comprehensive Evaluation of Vietnamese LLM stair-lab/MATH_vi Viewer • Updated Sep 1, 2024 • 25k • 42 • 2 stair-lab/VSMEC Viewer • Updated Sep 1, 2024 • 6.24k • 23 stair-lab/ViHSD Viewer • Updated Sep 1, 2024 • 30.7k • 13 stair-lab/VSFC Viewer • Updated Sep 1, 2024 • 14.6k • 12
Cultural Alignment akhilayerukola/NormAd Viewer • Updated Oct 25, 2024 • 2.63k • 286 • 2 ura-hcmut/ECLeKTic Preview • Updated Jun 5 • 22 • 1 ToxicityPrompts/PolyGuardPrompts Viewer • Updated Jun 23 • 29.3k • 195 • 2 SALT-NLP/CultureBank Viewer • Updated Apr 24, 2024 • 23k • 136 • 15
Reliable and Efficient Amortized Model-Based Evaluation Datasets and Models for the REEval project stair-lab/reeval Viewer • Updated Jun 21 • 5.69M • 287 • 1 stair-lab/reeval-difficulty-for-helm Viewer • Updated Mar 18 • 217k • 100
Nonmyopic Bayesian Optimization in Dynamic Cost Settings Datasets and Models for the Nonmyopic BO project stair-lab/semi_synthetic_protein_2p12_gemma_7b Viewer • Updated Dec 18, 2024 • 12.3k • 32 stair-lab/proteinea_fluorescence-embedding Viewer • Updated Dec 18, 2024 • 188k • 86
Gathering Context for Decision Support with LLMs stair-lab/bosd_initial_dataset Viewer • Updated Jan 7 • 568 • 4
Finetuning and Comprehensive Evaluation of Vietnamese LLM stair-lab/MATH_vi Viewer • Updated Sep 1, 2024 • 25k • 42 • 2 stair-lab/VSMEC Viewer • Updated Sep 1, 2024 • 6.24k • 23 stair-lab/ViHSD Viewer • Updated Sep 1, 2024 • 30.7k • 13 stair-lab/VSFC Viewer • Updated Sep 1, 2024 • 14.6k • 12
Dynamics of Learning Datasets and Models for the CodeInsights Projects stair-lab/code_insights_jsons Preview • Updated Dec 11, 2024 • 1 stair-lab/code_insights_csv Viewer • Updated Apr 16 • 3.07M • 22 • 1 stair-lab/code_insights_matrices Preview • Updated Dec 12, 2024 • 15 stair-lab/code-insights-llm_simulator Text Generation • 8B • Updated Sep 8, 2024
Cultural Alignment akhilayerukola/NormAd Viewer • Updated Oct 25, 2024 • 2.63k • 286 • 2 ura-hcmut/ECLeKTic Preview • Updated Jun 5 • 22 • 1 ToxicityPrompts/PolyGuardPrompts Viewer • Updated Jun 23 • 29.3k • 195 • 2 SALT-NLP/CultureBank Viewer • Updated Apr 24, 2024 • 23k • 136 • 15