Index of /helm/benchmark_output/releases/v1.0.0-links/groups/latex

[ICO]NameLast modifiedSizeDescription

[DIR]Parent Directory  -  
[   ]core_scenarios_general_information.tex12-Jan-2024 16:20 17K 
[   ]core_scenarios_efficiency.tex12-Jan-2024 16:20 8.0K 
[   ]core_scenarios_accuracy.tex12-Jan-2024 16:20 7.1K 
[   ]legalbench_legalbench.tex12-Jan-2024 16:20 4.4K 
[   ]legalbench_legalbench_subset:corporate_lobbying.tex12-Jan-2024 16:20 4.3K 
[   ]math_chain_of_thought_math_chain_of_thought.tex12-Jan-2024 16:20 4.3K 
[   ]narrative_qa_narrative_qa_.tex12-Jan-2024 16:20 4.0K 
[   ]math_chain_of_thought_math_chain_of_thought_subject:intermediate_algebra,level:1,use_official_examples:False,use_chain_of_thought:True.tex12-Jan-2024 16:20 4.0K 
[   ]math_chain_of_thought_math_chain_of_thought_subject:geometry,level:1,use_official_examples:False,use_chain_of_thought:True.tex12-Jan-2024 16:20 4.0K 
[   ]math_chain_of_thought_math_chain_of_thought_subject:counting_and_probability,level:1,use_official_examples:False,use_chain_of_thought:True.tex12-Jan-2024 16:20 4.0K 
[   ]math_chain_of_thought_math_chain_of_thought_subject:prealgebra,level:1,use_official_examples:False,use_chain_of_thought:True.tex12-Jan-2024 16:20 3.9K 
[   ]math_chain_of_thought_math_chain_of_thought_subject:precalculus,level:1,use_official_examples:False,use_chain_of_thought:True.tex12-Jan-2024 16:20 3.9K 
[   ]math_chain_of_thought_math_chain_of_thought_subject:algebra,level:1,use_official_examples:False,use_chain_of_thought:True.tex12-Jan-2024 16:20 3.9K 
[   ]wmt_14_wmt_14_source_language:hi,target_language:en.tex12-Jan-2024 16:20 3.8K 
[   ]legalbench_legalbench_subset:function_of_decision_section.tex12-Jan-2024 16:20 3.8K 
[   ]wmt_14_wmt_14_source_language:de,target_language:en.tex12-Jan-2024 16:20 3.8K 
[   ]wmt_14_wmt_14_source_language:cs,target_language:en.tex12-Jan-2024 16:20 3.8K 
[   ]wmt_14_wmt_14_source_language:ru,target_language:en.tex12-Jan-2024 16:20 3.8K 
[   ]wmt_14_wmt_14_source_language:fr,target_language:en.tex12-Jan-2024 16:20 3.8K 
[   ]wmt_14_wmt_14.tex12-Jan-2024 16:20 3.8K 
[   ]legalbench_legalbench_subset:abercrombie.tex12-Jan-2024 16:20 3.6K 
[   ]math_chain_of_thought_math_chain_of_thought_subject:number_theory,level:1,use_official_examples:False,use_chain_of_thought:True.tex12-Jan-2024 16:20 3.6K 
[   ]natural_qa_openbook_longans_natural_qa_openbook_longans_mode:openbook_longans.tex12-Jan-2024 16:20 3.6K 
[   ]med_qa_med_qa_.tex12-Jan-2024 16:20 3.5K 
[   ]mmlu_mmlu_subject:econometrics.tex12-Jan-2024 16:20 3.5K 
[   ]mmlu_mmlu.tex12-Jan-2024 16:20 3.4K 
[   ]legalbench_legalbench_subset:proa.tex12-Jan-2024 16:20 3.4K 
[   ]natural_qa_closedbook_natural_qa_closedbook_mode:closedbook.tex12-Jan-2024 16:20 3.3K 
[   ]legalbench_legalbench_subset:international_citizenship_questions.tex12-Jan-2024 16:20 2.9K 
[   ]gsm_gsm_.tex12-Jan-2024 16:20 2.8K 
[   ]openbookqa_openbookqa_.tex12-Jan-2024 16:20 2.8K 
[   ]mmlu_mmlu_subject:us_foreign_policy.tex12-Jan-2024 16:20 2.7K 
[   ]mmlu_mmlu_subject:computer_security.tex12-Jan-2024 16:20 2.7K 
[   ]mmlu_mmlu_subject:abstract_algebra.tex12-Jan-2024 16:20 2.7K 
[   ]mmlu_mmlu_subject:college_chemistry.tex12-Jan-2024 16:20 2.7K 

Apache/2.2.15 (CentOS) Server at nlp.stanford.edu Port 443