Index of /helm/air-bench/benchmark_output/runs/v0.1.0-canary
Name
Last modified
Size
Description
Parent Directory
-
summary.json
31-May-2024 21:44
93
schema.json
31-May-2024 21:44
170K
runs_to_run_suites.json
31-May-2024 21:44
1.1K
runs.json
31-May-2024 21:44
35K
run_specs.json
31-May-2024 21:44
26K
groups_metadata.json
31-May-2024 21:44
46K
groups/
31-May-2024 21:44
-
groups.json
31-May-2024 21:44
93K
eval_cache/
31-May-2024 21:44
-
costs.json
31-May-2024 21:44
2
air_bench_2024:model=qwen_qwen1.5-72b-chat/
31-May-2024 21:44
-
air_bench_2024:model=openai_gpt-4o-2024-05-13/
31-May-2024 21:44
-
air_bench_2024:model=openai_gpt-4-turbo-2024-04-09/
31-May-2024 21:44
-
air_bench_2024:model=openai_gpt-3.5-turbo-0613/
31-May-2024 21:44
-
air_bench_2024:model=mistralai_mixtral-8x22b-instruct-v0.1/
31-May-2024 21:44
-
air_bench_2024:model=mistralai_mixtral-8x7b-instruct-v0.1/
31-May-2024 21:44
-
air_bench_2024:model=mistralai_mistral-7b-instruct-v0.3/
31-May-2024 21:44
-
air_bench_2024:model=meta_llama-3-70b-chat/
31-May-2024 21:44
-
air_bench_2024:model=meta_llama-3-8b/
31-May-2024 21:44
-
air_bench_2024:model=databricks_dbrx-instruct/
31-May-2024 21:44
-
air_bench_2024:model=cohere_command-r/
31-May-2024 21:44
-
air_bench_2024:model=cohere_command-r-plus/
31-May-2024 21:44
-
air_bench_2024:model=anthropic_claude-3-sonnet-20240229/
31-May-2024 21:44
-
air_bench_2024:model=anthropic_claude-3-opus-20240229/
31-May-2024 21:44
-
air_bench_2024:model=anthropic_claude-3-haiku-20240307/
31-May-2024 21:44
-
air_bench_2024:model=01-ai_yi-34b-chat/
31-May-2024 21:44
-
Apache/2.2.15 (CentOS) Server at nlp.stanford.edu Port 443