兼欣
|
508acdeb88
|
add openai version requirement (openai<1.0)
|
2023-12-21 10:37:27 +08:00 |
|
feihu.hf
|
b7eb73d6ec
|
update readme for vllm-gptq
|
2023-12-14 16:25:00 +08:00 |
|
兼欣
|
cadc4c7d1a
|
fix typo
|
2023-12-06 14:14:35 +08:00 |
|
兼欣
|
7eb9016908
|
update agent benchmarks and add qwen-72b results
|
2023-12-06 12:57:11 +08:00 |
|
yangapku
|
c4fdd89d20
|
update README
|
2023-11-30 19:27:39 +08:00 |
|
yangapku
|
b1d80a9385
|
add 72B and 1.8B Qwen models, add Ascend 910 and Hygon DCU support, add docker support
|
2023-11-30 15:43:00 +08:00 |
|
yangapku
|
e8e15962d8
|
add 72B and 1.8B Qwen models, add Ascend 910 and Hygon DCU support, add docker support
|
2023-11-30 15:29:13 +08:00 |
|
lukeming.lkm
|
845dc08474
|
add modelscope links for int8 models
|
2023-11-06 11:47:32 +08:00 |
|
Junyang Lin
|
d082c2c926
|
Update README.md
|
2023-10-20 21:50:15 +08:00 |
|
JustinLin610
|
c908968cea
|
update readme
|
2023-10-20 01:50:47 +08:00 |
|
JustinLin610
|
899bc5bb98
|
update news
|
2023-10-17 20:50:35 +08:00 |
|
JustinLin610
|
e6d8deb975
|
add french readme
|
2023-10-17 20:28:36 +08:00 |
|
yangapku
|
93963f8d1f
|
add result of int8 models
|
2023-10-17 19:56:11 +08:00 |
|
JustinLin610
|
235aa8f71e
|
update readme
|
2023-10-16 16:55:27 +08:00 |
|
yangapku
|
78352b5a79
|
update readme about batch inference
|
2023-10-14 21:36:53 +08:00 |
|
Wang Peng
|
c73a065849
|
Update README.md, update batch infer
|
2023-10-12 13:29:06 +08:00 |
|
Junyang Lin
|
4eee29e790
|
Merge pull request #442 from QwenLM/logicwong-patch-2
Update README.md, add batch inference
|
2023-10-11 21:10:08 +08:00 |
|
lukeming.lkm
|
e6f2a7af6d
|
update readme
|
2023-10-11 19:09:26 +08:00 |
|
Wang Peng
|
bef488ba2c
|
Update README.md, add batch inference
|
2023-10-11 15:04:58 +08:00 |
|
Yang An
|
1d5f3503fb
|
Update README.md
|
2023-10-10 17:46:21 +08:00 |
|
JustinLin610
|
ce1ca46099
|
update readme
|
2023-10-09 01:03:27 +08:00 |
|
Junyang Lin
|
12e4c8bda5
|
Update README.md
|
2023-10-08 16:33:12 +08:00 |
|
Junyang Lin
|
c7cf15dbdc
|
Update README.md
|
2023-10-08 15:35:07 +08:00 |
|
Junyang Lin
|
581512f6b5
|
Update README.md
|
2023-10-08 15:32:23 +08:00 |
|
Junyang Lin
|
ee5350521e
|
Update README.md
|
2023-10-08 10:24:35 +08:00 |
|
JustinLin610
|
3261c62f74
|
update citation
|
2023-10-08 00:41:56 +08:00 |
|
JustinLin610
|
360fca3f87
|
add citation
|
2023-10-08 00:36:40 +08:00 |
|
JustinLin610
|
6e987235d8
|
update readme
|
2023-10-07 21:54:57 +08:00 |
|
JustinLin610
|
0b55158031
|
update readme
|
2023-10-07 21:40:21 +08:00 |
|
JustinLin610
|
83eac494b2
|
update readme
|
2023-10-07 11:04:26 +08:00 |
|
JustinLin610
|
b5fad3d561
|
fix single-gpu qlora, and add profiling
|
2023-10-07 10:37:42 +08:00 |
|
Junyang Lin
|
fc7e37a9e4
|
Update README.md
|
2023-10-04 13:54:40 +08:00 |
|
Yang An
|
c586c20d85
|
Update README.md
|
2023-09-29 22:19:07 +08:00 |
|
yangapku
|
3e5ade9352
|
update readme
|
2023-09-28 17:10:20 +08:00 |
|
yangapku
|
04ee3ec9eb
|
update readme
|
2023-09-26 16:59:08 +08:00 |
|
yangapku
|
26da1a2f9d
|
update kvcache
|
2023-09-25 21:16:21 +08:00 |
|
simonJJJ
|
8c02bef17d
|
qwen.cpp news
|
2023-09-25 17:59:01 +08:00 |
|
simonJJJ
|
0efa58245d
|
qwen.cpp link
|
2023-09-25 15:23:58 +08:00 |
|
Junyang Lin
|
a46024035b
|
Update README.md
typo
|
2023-09-25 14:46:15 +08:00 |
|
Junyang Lin
|
1e0821b3b1
|
Update README.md
|
2023-09-25 14:44:46 +08:00 |
|
Junyang Lin
|
111190e21e
|
Update README.md
|
2023-09-25 14:13:46 +08:00 |
|
feihu.hf
|
d201cba3f4
|
update baseline scores
|
2023-09-25 13:28:37 +08:00 |
|
季仁
|
84b62b47c4
|
update
|
2023-09-25 11:47:40 +08:00 |
|
Iurnem
|
06ba6f08ae
|
Update README.md
|
2023-09-25 11:35:01 +08:00 |
|
Iurnem
|
9de10e77e9
|
Update README.md
|
2023-09-25 11:12:08 +08:00 |
|
yangapku
|
fc57dea277
|
release latest models
|
2023-09-25 10:41:59 +08:00 |
|
Junyang Lin
|
fb52dd3308
|
Update README.md
|
2023-09-13 16:53:34 +08:00 |
|
Yang An
|
861086b66d
|
Update README.md
|
2023-09-12 11:29:32 +08:00 |
|
Keming (Luke) Lu
|
a145875018
|
Update README.md
|
2023-09-12 11:25:59 +08:00 |
|
JustinLin610
|
c5f7fa9487
|
update readme
|
2023-09-12 00:16:06 +08:00 |
|