You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
JustinLin610 ba2d85a13b first commit 1 year ago
..
EVALUATION.md first commit 1 year ago
evaluate_ceval.py first commit 1 year ago
evaluate_gsm8k.py first commit 1 year ago
evaluate_humaneval.py first commit 1 year ago
evaluate_mmlu.py first commit 1 year ago
gsm8k_prompt.txt first commit 1 year ago