add 72B and 1.8B Qwen models, add Ascend 910 and Hygon DCU support, add docker support

2026-05-20 16:35:47 +08:00 · 2023-11-30 15:43:00 +08:00
parent e8e15962d8
commit b1d80a9385
5 changed files with 20 additions and 20 deletions
--- a/README.md
+++ b/README.md
@@ -30,10 +30,10 @@ In brief, we have strong base language models, which have been stably pretrained

 | Model     | Release Date | Max Length | System Prompt Enhancement | # of Pretrained Tokens | Minimum GPU Memory Usage of Finetuning (Q-Lora) | Minimum GPU Usage of Generating 2048 Tokens (Int4) | Tool Usage |
 |:----------|:------------:|:----------:|:-------------------------:|:----------------------:|:-----------------------------------------------:|:--------------------------------------------------:|:----------:|
-| Qwen-1.8B |   23.11.30   |    32K     |             √             |          2.2T          |                      5.8GB                      |                       2.9GB                        |     √      |  
-| Qwen-7B   |   23.08.03   |    32K     |             ×             |          2.4T          |                     11.5GB                      |                       8.2GB                        |     √      |   
-| Qwen-14B  |   23.09.25   |     8K     |             ×             |          3.0T          |                     18.7GB                      |                       13.0GB                       |     √      |
-| Qwen-72B  |   23.11.30   |    32K     |             √             |          3.0T          |                     61.4GB                      |                       48.9GB                       |     √      |   
+| Qwen-1.8B |   23.11.30   |    32K     |             ✅             |          2.2T          |                      5.8GB                      |                       2.9GB                        |     ✅      |  
+| Qwen-7B   |   23.08.03   |    32K     |             ❎             |          2.4T          |                     11.5GB                      |                       8.2GB                        |     ✅      |   
+| Qwen-14B  |   23.09.25   |     8K     |             ❎             |          3.0T          |                     18.7GB                      |                       13.0GB                       |     ✅      |
+| Qwen-72B  |   23.11.30   |    32K     |             ✅             |          3.0T          |                     61.4GB                      |                       48.9GB                       |     ✅      |   

 In this repo, you can figure out: