![[ICO]](/icons/blank.gif) | Name | Last modified | Size | Description |
|---|
|
![[DIR]](/icons/back.gif) | Parent Directory | | - | |
![[DIR]](/icons/folder.gif) | gsm:model=snowflake_snowflake-arctic-instruct,stop=none/ | 17-Jun-2024 10:34 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=qwen_qwen2-72b-instruct,stop=none/ | 17-Jun-2024 10:34 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=openai_gpt-4o-2024-05-13,stop=none/ | 17-Jun-2024 10:34 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=qwen_qwen1.5-110b-chat,stop=none/ | 17-Jun-2024 10:34 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=openai_gpt-4-turbo-2024-04-09,stop=none/ | 17-Jun-2024 10:34 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=mistralai_mistral-7b-instruct-v0.3,stop=none/ | 17-Jun-2024 10:34 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=01-ai_yi-large-preview/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=cohere_command-r/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=cohere_command-r-plus/ | 14-Jun-2024 18:28 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:51 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:51 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:51 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:51 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=01-ai_yi-large-preview/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=snowflake_snowflake-arctic-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=01-ai_yi-large-preview/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=qwen_qwen2-72b-instruct/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=cohere_command-r/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=cohere_command-r-plus/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=qwen_qwen1.5-110b-chat/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=google_gemini-1.5-flash-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=openai_gpt-4o-2024-05-13/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=google_gemini-1.5-pro-001/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=cohere_command-r-plus/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=openai_gpt-4-turbo-2024-04-09/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=mistralai_mistral-7b-instruct-v0.3/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=cohere_command-r/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=01-ai_yi-large-preview/ | 14-Jun-2024 16:50 | - | |
![[DIR]](/icons/folder.gif) | eval_cache/ | 14-Jun-2024 15:11 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:39 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=google_gemini-1.5-flash-preview-0514/ | 15-May-2024 16:18 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=ru-en,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=hi-en,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=fr-en,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=de-en,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | wmt_14:language_pair=cs-en,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=openbook_longans,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | natural_qa:mode=closedbook,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | narrative_qa:model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=us_foreign_policy,method=multiple_choice_joint,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=econometrics,method=multiple_choice_joint,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=computer_security,method=multiple_choice_joint,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=college_chemistry,method=multiple_choice_joint,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | mmlu:subject=abstract_algebra,method=multiple_choice_joint,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | med_qa:model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=precalculus,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=prealgebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=number_theory,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=intermediate_algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=geometry,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=counting_and_probability,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | math:subject=algebra,level=1,use_official_examples=False,use_chain_of_thought=True,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=proa,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=international_citizenship_questions,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=function_of_decision_section,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=corporate_lobbying,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | legalbench:subset=abercrombie,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | gsm:model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=google_gemini-1.5-pro-preview-0409/ | 01-May-2024 22:07 | - | |
![[DIR]](/icons/folder.gif) | commonsense:dataset=openbookqa,method=multiple_choice_joint,model=google_gemini-1.0-pro-001/ | 01-May-2024 22:07 | - | |
|