Merge pull request #335 from QwenLM/qwen_cpp

qwen.cpp link
2 years ago · 4b124bfcba
parent e49fe22bab 0efa58245d
commit 4b124bfcba
2 changed files with 4 additions and 0 deletions
--- a/README.md
+++ b/README.md
@ -467,6 +467,8 @@ model = load_model_on_gpus('Qwen/Qwen-7B-Chat', num_gpus=2)
 Then you can run the 7B chat model on 2 GPUs using the above scripts.
 <br><br>

+We also provide pure C++ implementation of Qwen-LM and tiktoken, see [qwen.cpp](https://github.com/QwenLM/qwen.cpp) for details.
+
 ## Tool Usage

 Qwen-Chat has been optimized for tool usage and function calling capabilities. Users can develop agents, LangChain applications, and even agument Qwen with a Python Code Interpreter.
--- a/README_CN.md
+++ b/README_CN.md
@ -456,6 +456,8 @@ model = load_model_on_gpus('Qwen/Qwen-7B-Chat', num_gpus=2)
 你即可使用2张GPU进行推理。
 <br><br>

+我们同时提供了Qwen-LM和tiktoken的C++实现, 更多细节请查看[qwen.cpp](https://github.com/QwenLM/qwen.cpp).
+
 ## 工具调用

 Qwen-Chat针对工具使用、函数调用能力进行了优化。用户可以开发基于Qwen的Agent、LangChain应用、甚至Code Interpreter。