Commit Graph

127 Commits

Author SHA1 Message Date
Junyang Lin
aa862758f3 Update README.md 2024-03-10 16:45:56 +08:00
Junyang Lin
cdf7ae5d37 Update README.md 2024-03-03 02:05:45 +08:00
Junyang Lin
85cb093f20 Update README.md 2024-02-06 00:26:40 +08:00
Ren Xuancheng
d40742b004 Update README.md
Added notes due to recent peft update.
2024-02-01 20:41:31 +08:00
Wang Peng
1c34702e82 Update README.md 2024-01-31 12:19:05 +08:00
yangapku
29fea23f87 update README 2024-01-09 19:28:24 +08:00
苏阳
23a01b0696 Add Docker image for CUDA-12.1. 2024-01-08 14:22:05 +08:00
苏阳
35023b6f2a Add multinode finetuning section into README. 2023-12-27 11:15:38 +08:00
feihu.hf
ea86f6136a add run gptq 2023-12-25 20:24:57 +08:00
兼欣
508acdeb88 add openai version requirement (openai<1.0) 2023-12-21 10:37:27 +08:00
feihu.hf
b7eb73d6ec update readme for vllm-gptq 2023-12-14 16:25:00 +08:00
兼欣
cadc4c7d1a fix typo 2023-12-06 14:14:35 +08:00
兼欣
7eb9016908 update agent benchmarks and add qwen-72b results 2023-12-06 12:57:11 +08:00
yangapku
c4fdd89d20 update README 2023-11-30 19:27:39 +08:00
yangapku
b1d80a9385 add 72B and 1.8B Qwen models, add Ascend 910 and Hygon DCU support, add docker support 2023-11-30 15:43:00 +08:00
yangapku
e8e15962d8 add 72B and 1.8B Qwen models, add Ascend 910 and Hygon DCU support, add docker support 2023-11-30 15:29:13 +08:00
lukeming.lkm
845dc08474 add modelscope links for int8 models 2023-11-06 11:47:32 +08:00
Junyang Lin
d082c2c926 Update README.md 2023-10-20 21:50:15 +08:00
JustinLin610
c908968cea update readme 2023-10-20 01:50:47 +08:00
JustinLin610
899bc5bb98 update news 2023-10-17 20:50:35 +08:00
JustinLin610
e6d8deb975 add french readme 2023-10-17 20:28:36 +08:00
yangapku
93963f8d1f add result of int8 models 2023-10-17 19:56:11 +08:00
JustinLin610
235aa8f71e update readme 2023-10-16 16:55:27 +08:00
yangapku
78352b5a79 update readme about batch inference 2023-10-14 21:36:53 +08:00
Wang Peng
c73a065849 Update README.md, update batch infer 2023-10-12 13:29:06 +08:00
Junyang Lin
4eee29e790 Merge pull request #442 from QwenLM/logicwong-patch-2
Update README.md, add batch inference
2023-10-11 21:10:08 +08:00
lukeming.lkm
e6f2a7af6d update readme 2023-10-11 19:09:26 +08:00
Wang Peng
bef488ba2c Update README.md, add batch inference 2023-10-11 15:04:58 +08:00
Yang An
1d5f3503fb Update README.md 2023-10-10 17:46:21 +08:00
JustinLin610
ce1ca46099 update readme 2023-10-09 01:03:27 +08:00
Junyang Lin
12e4c8bda5 Update README.md 2023-10-08 16:33:12 +08:00
Junyang Lin
c7cf15dbdc Update README.md 2023-10-08 15:35:07 +08:00
Junyang Lin
581512f6b5 Update README.md 2023-10-08 15:32:23 +08:00
Junyang Lin
ee5350521e Update README.md 2023-10-08 10:24:35 +08:00
JustinLin610
3261c62f74 update citation 2023-10-08 00:41:56 +08:00
JustinLin610
360fca3f87 add citation 2023-10-08 00:36:40 +08:00
JustinLin610
6e987235d8 update readme 2023-10-07 21:54:57 +08:00
JustinLin610
0b55158031 update readme 2023-10-07 21:40:21 +08:00
JustinLin610
83eac494b2 update readme 2023-10-07 11:04:26 +08:00
JustinLin610
b5fad3d561 fix single-gpu qlora, and add profiling 2023-10-07 10:37:42 +08:00
Junyang Lin
fc7e37a9e4 Update README.md 2023-10-04 13:54:40 +08:00
Yang An
c586c20d85 Update README.md 2023-09-29 22:19:07 +08:00
yangapku
3e5ade9352 update readme 2023-09-28 17:10:20 +08:00
yangapku
04ee3ec9eb update readme 2023-09-26 16:59:08 +08:00
yangapku
26da1a2f9d update kvcache 2023-09-25 21:16:21 +08:00
simonJJJ
8c02bef17d qwen.cpp news 2023-09-25 17:59:01 +08:00
simonJJJ
0efa58245d qwen.cpp link 2023-09-25 15:23:58 +08:00
Junyang Lin
a46024035b Update README.md
typo
2023-09-25 14:46:15 +08:00
Junyang Lin
1e0821b3b1 Update README.md 2023-09-25 14:44:46 +08:00
Junyang Lin
111190e21e Update README.md 2023-09-25 14:13:46 +08:00