JustinLin610
|
0b55158031
|
update readme
|
2023-10-07 21:40:21 +08:00 |
|
JustinLin610
|
83eac494b2
|
update readme
|
2023-10-07 11:04:26 +08:00 |
|
JustinLin610
|
b5fad3d561
|
fix single-gpu qlora, and add profiling
|
2023-10-07 10:37:42 +08:00 |
|
Junyang Lin
|
fc7e37a9e4
|
Update README.md
|
2023-10-04 13:54:40 +08:00 |
|
Yang An
|
c586c20d85
|
Update README.md
|
2023-09-29 22:19:07 +08:00 |
|
yangapku
|
3e5ade9352
|
update readme
|
2023-09-28 17:10:20 +08:00 |
|
yangapku
|
04ee3ec9eb
|
update readme
|
2023-09-26 16:59:08 +08:00 |
|
yangapku
|
26da1a2f9d
|
update kvcache
|
2023-09-25 21:16:21 +08:00 |
|
simonJJJ
|
8c02bef17d
|
qwen.cpp news
|
2023-09-25 17:59:01 +08:00 |
|
simonJJJ
|
0efa58245d
|
qwen.cpp link
|
2023-09-25 15:23:58 +08:00 |
|
Junyang Lin
|
a46024035b
|
Update README.md
typo
|
2023-09-25 14:46:15 +08:00 |
|
Junyang Lin
|
1e0821b3b1
|
Update README.md
|
2023-09-25 14:44:46 +08:00 |
|
Junyang Lin
|
111190e21e
|
Update README.md
|
2023-09-25 14:13:46 +08:00 |
|
feihu.hf
|
d201cba3f4
|
update baseline scores
|
2023-09-25 13:28:37 +08:00 |
|
季仁
|
84b62b47c4
|
update
|
2023-09-25 11:47:40 +08:00 |
|
Iurnem
|
06ba6f08ae
|
Update README.md
|
2023-09-25 11:35:01 +08:00 |
|
Iurnem
|
9de10e77e9
|
Update README.md
|
2023-09-25 11:12:08 +08:00 |
|
yangapku
|
fc57dea277
|
release latest models
|
2023-09-25 10:41:59 +08:00 |
|
Junyang Lin
|
fb52dd3308
|
Update README.md
|
2023-09-13 16:53:34 +08:00 |
|
Yang An
|
861086b66d
|
Update README.md
|
2023-09-12 11:29:32 +08:00 |
|
Keming (Luke) Lu
|
a145875018
|
Update README.md
|
2023-09-12 11:25:59 +08:00 |
|
JustinLin610
|
c5f7fa9487
|
update readme
|
2023-09-12 00:16:06 +08:00 |
|
JustinLin610
|
b1c10b956d
|
debug
|
2023-09-11 23:50:49 +08:00 |
|
JustinLin610
|
af22d5e0ce
|
add finetuning
|
2023-09-11 23:47:32 +08:00 |
|
yangapku
|
d5afb731c6
|
update readme to support easier load of model
|
2023-08-31 15:54:44 +08:00 |
|
JustinLin610
|
d76a9eb530
|
update readme
|
2023-08-30 23:29:59 +08:00 |
|
兼欣
|
9e80cc085c
|
add function calling support
|
2023-08-30 15:04:13 +08:00 |
|
yangapku
|
f1402ce523
|
update deployment in readme and cli_demo
|
2023-08-29 16:46:15 +08:00 |
|
Yang An
|
2167406b72
|
update speed profiling result after optimizing memory cost
|
2023-08-28 20:35:33 +08:00 |
|
Junyang Lin
|
97039ac230
|
Update README.md
|
2023-08-28 13:21:51 +08:00 |
|
yangapku
|
5dbbd1025b
|
update readme
|
2023-08-25 15:48:07 +08:00 |
|
yangapku
|
dad3b3a408
|
update README
|
2023-08-25 15:35:32 +08:00 |
|
yangapku
|
6f5f076ad1
|
update README
|
2023-08-25 14:54:14 +08:00 |
|
yangapku
|
d0cc30be23
|
update README
|
2023-08-25 14:52:39 +08:00 |
|
cyente
|
a3a5b3de47
|
add stop word on openai api ChatCompletion
|
2023-08-23 16:30:53 +08:00 |
|
Junyang Lin
|
4b6a2f0170
|
Update README.md
|
2023-08-23 01:28:49 +08:00 |
|
Junyang Lin
|
4ae8a9f340
|
Update README.md
|
2023-08-23 01:24:35 +08:00 |
|
Yang An
|
6446fe0437
|
Update README.md
|
2023-08-22 08:41:30 +08:00 |
|
Junyang Lin
|
f0ec7f7525
|
Update README.md
fix
|
2023-08-21 21:33:26 +08:00 |
|
yangapku
|
04f896f7d4
|
update new version of quantization and inference efficiency profiling result
|
2023-08-21 21:16:38 +08:00 |
|
JustinLin610
|
512f90a069
|
update gifs
|
2023-08-16 16:16:25 +08:00 |
|
yangapku
|
4957c33d18
|
update readme
|
2023-08-16 15:37:31 +08:00 |
|
Junyang Lin
|
70ad542cef
|
Update README.md
|
2023-08-15 15:08:13 +08:00 |
|
Haonan Li
|
e7072a49c0
|
add CMMLU evaluation results
|
2023-08-13 20:58:52 +04:00 |
|
JustinLin610
|
eca51b72cc
|
Merge remote-tracking branch 'origin/main' into update_ja_readme
|
2023-08-13 15:46:35 +08:00 |
|
Junyang Lin
|
7741635911
|
Merge pull request #177 from hanpenggit/main
参考ChatGLM2-6B的openai_api.py,适配Qwen-7B
|
2023-08-13 15:43:52 +08:00 |
|
JustinLin610
|
d7b6d26843
|
update readme
|
2023-08-13 13:22:12 +08:00 |
|
Junyang Lin
|
5379035498
|
Update README.md
|
2023-08-13 12:41:26 +08:00 |
|
Junyang Lin
|
afc571987f
|
Update README.md
|
2023-08-13 12:38:39 +08:00 |
|
yangapku
|
af6486a0a9
|
update efficiency profiling in readme
|
2023-08-12 17:17:20 +08:00 |
|