Index of /helm/safety/benchmark_output/runs/v1.1.0

[ICO]NameLast modifiedSizeDescription

[DIR]Parent Directory  -  
[DIR]xstest:model=deepseek-ai_deepseek-r1-hide-reasoning/13-Feb-2025 12:41 -  
[DIR]bbq:subject=all,method=multiple_choice_joint,max_train_instances=0,model=deepseek-ai_deepseek-r1-hide-reasoning/13-Feb-2025 12:41 -  
[DIR]anthropic_red_team:model=deepseek-ai_deepseek-r1-hide-reasoning/13-Feb-2025 12:41 -  
[DIR]harm_bench:model=deepseek-ai_deepseek-r1-hide-reasoning/13-Feb-2025 12:41 -  
[DIR]simple_safety_tests:model=deepseek-ai_deepseek-r1-hide-reasoning/13-Feb-2025 12:41 -  
[DIR]bbq:subject=all,method=multiple_choice_joint,max_train_instances=0,model=deepseek-ai_deepseek-r1/11-Feb-2025 14:33 -  
[DIR]xstest:model=openai_o1-2024-12-17/10-Feb-2025 09:14 -  
[DIR]anthropic_red_team:model=openai_o1-2024-12-17/10-Feb-2025 09:14 -  
[DIR]anthropic_red_team:model=deepseek-ai_deepseek-r1/10-Feb-2025 09:14 -  
[DIR]xstest:model=deepseek-ai_deepseek-r1/10-Feb-2025 09:14 -  
[DIR]bbq:subject=all,method=multiple_choice_joint,max_train_instances=0,model=openai_o1-2024-12-17/10-Feb-2025 09:14 -  
[DIR]harm_bench:model=openai_o1-2024-12-17/10-Feb-2025 09:14 -  
[DIR]harm_bench:model=deepseek-ai_deepseek-r1/10-Feb-2025 09:14 -  
[DIR]simple_safety_tests:model=openai_o1-2024-12-17/10-Feb-2025 09:14 -  
[DIR]simple_safety_tests:model=deepseek-ai_deepseek-r1/10-Feb-2025 09:14 -  
[DIR]xstest:model=openai_o3-mini-2025-01-31/07-Feb-2025 17:07 -  
[DIR]xstest:model=openai_o1-mini-2024-09-12/07-Feb-2025 17:07 -  
[DIR]harm_bench:model=openai_o3-mini-2025-01-31/07-Feb-2025 17:07 -  
[DIR]harm_bench:model=openai_o1-mini-2024-09-12/07-Feb-2025 17:07 -  
[DIR]simple_safety_tests:model=openai_o3-mini-2025-01-31/07-Feb-2025 17:07 -  
[DIR]simple_safety_tests:model=openai_o1-mini-2024-09-12/07-Feb-2025 17:07 -  
[DIR]bbq:subject=all,method=multiple_choice_joint,max_train_instances=0,model=openai_o1-mini-2024-09-12/07-Feb-2025 17:07 -  
[DIR]bbq:subject=all,method=multiple_choice_joint,max_train_instances=0,model=openai_o3-mini-2025-01-31/07-Feb-2025 17:07 -  
[DIR]anthropic_red_team:model=openai_o3-mini-2025-01-31/07-Feb-2025 17:07 -  
[DIR]anthropic_red_team:model=openai_o1-mini-2024-09-12/07-Feb-2025 17:07 -  
[DIR]bbq:subject=all,method=multiple_choice_joint,max_train_instances=0,model=deepseek-ai_deepseek-v3/06-Feb-2025 13:17 -  
[DIR]xstest:model=deepseek-ai_deepseek-v3/06-Feb-2025 13:17 -  
[DIR]anthropic_red_team:model=deepseek-ai_deepseek-v3/06-Feb-2025 13:17 -  
[DIR]harm_bench:model=deepseek-ai_deepseek-v3/06-Feb-2025 13:17 -  
[DIR]simple_safety_tests:model=deepseek-ai_deepseek-v3/06-Feb-2025 13:17 -  
[DIR]eval_cache/05-Feb-2025 12:50 -  

Apache/2.2.15 (CentOS) Server at nlp.stanford.edu Port 443