122 Commits (204c2c59f49cfa7461e8e02d5ad2f6b3d082f08c)

Author SHA1 Message Date
yangapku 29fea23f87 update README 1 year ago
苏阳 23a01b0696 Add Docker image for CUDA-12.1. 1 year ago
苏阳 35023b6f2a Add multinode finetuning section into README. 1 year ago
feihu.hf ea86f6136a add run gptq 1 year ago
兼欣 508acdeb88 add openai version requirement (openai<1.0) 1 year ago
feihu.hf b7eb73d6ec update readme for vllm-gptq 1 year ago
兼欣 cadc4c7d1a fix typo 1 year ago
兼欣 7eb9016908 update agent benchmarks and add qwen-72b results 1 year ago
yangapku c4fdd89d20 update README 1 year ago
yangapku b1d80a9385 add 72B and 1.8B Qwen models, add Ascend 910 and Hygon DCU support, add docker support 1 year ago
yangapku e8e15962d8 add 72B and 1.8B Qwen models, add Ascend 910 and Hygon DCU support, add docker support 1 year ago
lukeming.lkm 845dc08474 add modelscope links for int8 models 1 year ago
Junyang Lin d082c2c926
Update README.md 1 year ago
JustinLin610 c908968cea update readme 1 year ago
JustinLin610 899bc5bb98 update news 1 year ago
JustinLin610 e6d8deb975 add french readme 1 year ago
yangapku 93963f8d1f add result of int8 models 1 year ago
JustinLin610 235aa8f71e update readme 1 year ago
yangapku 78352b5a79 update readme about batch inference 1 year ago
Wang Peng c73a065849
Update README.md, update batch infer 1 year ago
Junyang Lin 4eee29e790
Merge pull request #442 from QwenLM/logicwong-patch-2
Update README.md, add batch inference
1 year ago
lukeming.lkm e6f2a7af6d update readme 1 year ago
Wang Peng bef488ba2c
Update README.md, add batch inference 1 year ago
Yang An 1d5f3503fb
Update README.md 1 year ago
JustinLin610 ce1ca46099 update readme 1 year ago
Junyang Lin 12e4c8bda5
Update README.md 1 year ago
Junyang Lin c7cf15dbdc
Update README.md 1 year ago
Junyang Lin 581512f6b5
Update README.md 1 year ago
Junyang Lin ee5350521e
Update README.md 1 year ago
JustinLin610 3261c62f74 update citation 1 year ago
JustinLin610 360fca3f87 add citation 1 year ago
JustinLin610 6e987235d8 update readme 1 year ago
JustinLin610 0b55158031 update readme 1 year ago
JustinLin610 83eac494b2 update readme 1 year ago
JustinLin610 b5fad3d561 fix single-gpu qlora, and add profiling 1 year ago
Junyang Lin fc7e37a9e4
Update README.md 1 year ago
Yang An c586c20d85
Update README.md 1 year ago
yangapku 3e5ade9352 update readme 1 year ago
yangapku 04ee3ec9eb update readme 1 year ago
yangapku 26da1a2f9d update kvcache 1 year ago
simonJJJ 8c02bef17d qwen.cpp news 1 year ago
simonJJJ 0efa58245d qwen.cpp link 1 year ago
Junyang Lin a46024035b
Update README.md
typo
1 year ago
Junyang Lin 1e0821b3b1
Update README.md 1 year ago
Junyang Lin 111190e21e
Update README.md 1 year ago
feihu.hf d201cba3f4 update baseline scores 1 year ago
季仁 84b62b47c4 update 1 year ago
Iurnem 06ba6f08ae
Update README.md 1 year ago
Iurnem 9de10e77e9
Update README.md 1 year ago
yangapku fc57dea277 release latest models 1 year ago