Index of /helm/safety/benchmark_output/runs/v1.7.0
Name
Last modified
Size
Description
Parent Directory
-
eval_cache/
16-May-2025 18:25
-
harm_bench:model=writer_palmyra-med/
19-May-2025 10:17
-
anthropic_red_team:model=writer_palmyra-x5/
19-May-2025 10:18
-
bbq:subject=all,method=multiple_choice_joint,max_train_instances=0,model=writer_palmyra-med/
19-May-2025 10:18
-
harm_bench:model=writer_palmyra-x5/
19-May-2025 10:18
-
simple_safety_tests:model=writer_palmyra-med/
19-May-2025 10:18
-
bbq:subject=all,method=multiple_choice_joint,max_train_instances=0,model=qwen_qwen3-235b-a22b-fp8-tput/
19-May-2025 10:18
-
simple_safety_tests:model=qwen_qwen3-235b-a22b-fp8-tput/
19-May-2025 10:18
-
anthropic_red_team:model=writer_palmyra-med/
19-May-2025 10:18
-
bbq:subject=all,method=multiple_choice_joint,max_train_instances=0,model=writer_palmyra-x5/
19-May-2025 10:18
-
simple_safety_tests:model=writer_palmyra-x5/
19-May-2025 10:18
-
xstest:model=writer_palmyra-med/
19-May-2025 10:18
-
harm_bench:model=qwen_qwen3-235b-a22b-fp8-tput/
19-May-2025 10:18
-
xstest:model=writer_palmyra-x5/
19-May-2025 10:18
-
anthropic_red_team:model=qwen_qwen3-235b-a22b-fp8-tput/
19-May-2025 10:18
-
xstest:model=qwen_qwen3-235b-a22b-fp8-tput/
19-May-2025 10:18
-
Apache/2.2.15 (CentOS) Server at nlp.stanford.edu Port 443