兼欣
|
7eb9016908
|
update agent benchmarks and add qwen-72b results
|
1 year ago |
yangapku
|
fc57dea277
|
release latest models
|
1 year ago |
yangapku
|
b86a0f2c8a
|
update EVALUATION.md
|
1 year ago |
feihu.hf
|
4864f7b278
|
fix format problems in evaluation code; update ceval extraction rules
|
1 year ago |
兼欣
|
9139fbdf99
|
release the evaluation benchmark for tool use; update tool use results to that of the hf version
|
1 year ago |
feihu.hf
|
680a3e8bb8
|
update EVALUATION.md
|
1 year ago |
JustinLin610
|
ba2d85a13b
|
first commit
|
1 year ago |