Logo
Explore Help
Sign In
starlight-apk/Qwen
1
0
Fork 0
You've already forked Qwen
mirror of https://github.com/QwenLM/Qwen.git synced 2026-05-21 00:45:48 +08:00
Code Issues Packages Projects Releases Wiki Activity
Files
3dc2d87188104cd6708d9f1fbe31d00904e12b8b
Qwen/eval
History
Haonan Li e7072a49c0 add CMMLU evaluation results
2023-08-13 20:58:52 +04:00
..
evaluate_ceval.py
first commit
2023-08-03 12:57:53 +08:00
evaluate_chat_ceval.py
add evaluation code for Qwen-7B-Chat
2023-08-03 23:27:48 +08:00
evaluate_chat_gsm8k.py
add evaluation code for Qwen-7B-Chat
2023-08-03 23:27:48 +08:00
evaluate_chat_humaneval.py
add evaluation code for Qwen-7B-Chat
2023-08-03 23:27:48 +08:00
evaluate_chat_mmlu.py
add evaluation code for Qwen-7B-Chat
2023-08-03 23:27:48 +08:00
evaluate_cmmlu.py
add CMMLU evaluation results
2023-08-13 20:58:52 +04:00
evaluate_gsm8k.py
first commit
2023-08-03 12:57:53 +08:00
evaluate_humaneval.py
first commit
2023-08-03 12:57:53 +08:00
evaluate_mmlu.py
first commit
2023-08-03 12:57:53 +08:00
evaluate_plugin.py
release the evaluation benchmark for tool use; update tool use results to that of the hf version
2023-08-08 17:45:41 +08:00
EVALUATION.md
release the evaluation benchmark for tool use; update tool use results to that of the hf version
2023-08-08 17:45:41 +08:00
gsm8k_prompt.txt
first commit
2023-08-03 12:57:53 +08:00
© 2026 starlight-apk 版权所有