From ce5f0e18c6a55a6aa0c4ac74f0f238239230e67f Mon Sep 17 00:00:00 2001 From: Yang An Date: Tue, 22 Aug 2023 08:42:13 +0800 Subject: [PATCH] Update README_JA.md --- README_JA.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README_JA.md b/README_JA.md index 008037c..3eff97f 100644 --- a/README_JA.md +++ b/README_JA.md @@ -254,7 +254,7 @@ BF16の精度とInt4の量子化レベルの下で、それぞれ2048個と8192 | Quantization Level | Peak Usage for Encoding 2048 Tokens | Peak Usage for Generating 8192 Tokens | | ------------------ | :---------------------------------: | :-----------------------------------: | | BF16 | 18.99GB | 24.40GB | -| In4 | 10.20GB | 15.61GB | +| Int4 | 10.20GB | 15.61GB | 上記のスピードとメモリーのプロファイリングは、[このスクリプト](https://qianwen-res.oss-cn-beijing.aliyuncs.com/profile.py)を使用しています。