DCAgent2/dev_set_v2_exp_gfi_swesmith_random_filtered_10K_glm_4_7_traces_jupiter_20260222_211730 Viewer • Updated 1 day ago • 294 • 3
DCAgent2/swebench_verified_random_100_folders_r2egym_nl2bash_stack_bugsseq_stack_php_v2_04369779 Viewer • Updated 1 day ago • 300 • 1
DCAgent2/swebench_verified_random_100_folders_bs64_rloo_n_noct_stri_micr_auto_conv_pref_884f30e7 Viewer • Updated 1 day ago • 300 • 2
DCAgent2/swebench_verified_random_100_folders_rl_base_code_contests_900s_160_20260222_212545 Viewer • Updated 1 day ago • 300 • 3
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_inferredbugs_32eps_65k_fixeps_2026cf6a6949 Viewer • Updated 1 day ago • 300 • 3
DCAgent2/swebench_verified_random_100_folders_glm46_swesmith_maxeps_131k_fixthink_20260210dfdc29 Viewer • Updated 1 day ago • 300 • 2
DCAgent2/dev_set_v2_exp_syh_tezos_askllm_hardened_glm_4_7_traces_jupiter_20260222_211728 Viewer • Updated 1 day ago • 292 • 2
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_taskmaster2_32eps_32k_fixeps_20260d6190a14 Viewer • Updated 1 day ago • 300 • 2
DCAgent2/swebench_verified_random_100_folders_r2egym_nl2bash_stack_bugsseq_crosscodeevalff90d816 Viewer • Updated 1 day ago • 300 • 2
DCAgent2/swebench_verified_random_100_folders_r2egym_nl2bash_stack_bugsseq_rl_crosscodee7b5a876e Viewer • Updated 1 day ago • 300 • 3
DCAgent2/swebench_verified_random_100_folders_rl_rl_conf_qwen_8b_ll_lr1e_5_bs64_yaml_modedb39190 Viewer • Updated 1 day ago • 300 • 3
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_gemini25flash_stackexchange_overfle1b941a3 Viewer • Updated 1 day ago • 300 • 3
DCAgent2/terminal_bench_2_r2egym_nl2bash_stack_bugsseq_crosscodeeval_python_v2_20260222_044006 Viewer • Updated 1 day ago • 267 • 2
DCAgent2/terminal_bench_2_exp_syh_tezos_stackoverflow_mixed_glm_4_7_traces_jupiter_202601c83c3b5 Viewer • Updated 1 day ago • 257 • 3
DCAgent2/terminal_bench_2_exp_syh_r2egym_swesmith_mixed_glm_4_7_traces_jupiter_20260222_044010 Viewer • Updated 1 day ago • 266 • 2
DCAgent2/terminal_bench_2_r2egym_nl2bash_stack_bugsseq_stack_php_v2_20260222_044012 Viewer • Updated 2 days ago • 267 • 2
DCAgent2/terminal_bench_2_r2egym_nl2bash_stack_bugsseq_rl_crosscodeeval_csharp_20260222_044008 Viewer • Updated 2 days ago • 267 • 4
DCAgent2/dev_set_v2_exp_uns_r2egym_33_6x_glm_4_7_traces_jupiter_20260221_222511 Viewer • Updated 2 days ago • 295 • 3
DCAgent2/terminal_bench_2_exp_gfi_staqc_askllm_filtered_10K_glm_4_7_traces_jupiter_20260d61cc4a3 Viewer • Updated 2 days ago • 259 • 3
DCAgent2/dev_set_v2_exp_syh_tezos_stackoverflow_mixed_glm_4_7_traces_jupiter_20260221_222504 Viewer • Updated 2 days ago • 285 • 5
DCAgent2/dev_set_v2_r2egym_nl2bash_stack_bugsseq_stack_php_v2_20260221_222500 Viewer • Updated 2 days ago • 297 • 3
DCAgent2/dev_set_v2_exp_uns_tezos_10x_glm_4_7_traces_jupiter_20260221_222502 Viewer • Updated 2 days ago • 289 • 2
DCAgent2/dev_set_v2_r2egym_nl2bash_stack_bugsseq_crosscodeeval_python_v2_20260221_222454 Viewer • Updated 2 days ago • 297 • 3
DCAgent2/dev_set_v2_r2egym_nl2bash_stack_bugsseq_rl_crosscodeeval_csharp_20260221_222456 Viewer • Updated 2 days ago • 297 • 5
DCAgent2/terminal_bench_2_exp_tas_timeout_multiplier_1_0_traces_20260219_163757 Viewer • Updated 2 days ago • 264 • 2
DCAgent2/terminal_bench_2_Kimi_K2T_neulab_agenttuning_kg_sandboxes_maxeps_32k_20260219_163802 Viewer • Updated 2 days ago • 263 • 2
DCAgent2/dev_set_v2_exp_swd_r2egym_wo_docker_glm_4_7_traces_20260221_125413 Viewer • Updated 3 days ago • 295 • 3
DCAgent2/dev_set_v2_glm46_Toolscale_tasks_traces_20260221_125415 Viewer • Updated 3 days ago • 297 • 3
DCAgent2/dev_set_v2_rl_rl_conf_qwen_8b_ll_lr1e_5_bs64_yaml_mode_path_r2eg_nl2b_stac_bugs24471e1b Viewer • Updated 3 days ago • 297 • 5
DCAgent2/terminal_bench_2_Kimi_K2T_neulab_agenttuning_mind2web_sandboxes_maxeps_32k_202646ecac48 Viewer • Updated 3 days ago • 264 • 3