Logo
Explore Help
Sign In
starlight-apk/Qwen
1
0
Fork 0
You've already forked Qwen
mirror of https://github.com/QwenLM/Qwen.git synced 2026-05-21 00:45:48 +08:00
Code Issues Packages Projects Releases Wiki Activity
Files
2252578da03234607101bba8d373e284523ae49d
Qwen/eval
History
Yang An 677180a653 Merge pull request #185 from Owen-Qin/fix_ceval
fix bug for ceval
2023-08-15 17:55:23 +08:00
..
evaluate_ceval.py
fix code
2023-08-15 11:03:24 +08:00
evaluate_chat_ceval.py
add evaluation code for Qwen-7B-Chat
2023-08-03 23:27:48 +08:00
evaluate_chat_gsm8k.py
add evaluation code for Qwen-7B-Chat
2023-08-03 23:27:48 +08:00
evaluate_chat_humaneval.py
add evaluation code for Qwen-7B-Chat
2023-08-03 23:27:48 +08:00
evaluate_chat_mmlu.py
add evaluation code for Qwen-7B-Chat
2023-08-03 23:27:48 +08:00
evaluate_cmmlu.py
add CMMLU evaluation results
2023-08-13 20:58:52 +04:00
evaluate_gsm8k.py
first commit
2023-08-03 12:57:53 +08:00
evaluate_humaneval.py
first commit
2023-08-03 12:57:53 +08:00
evaluate_mmlu.py
first commit
2023-08-03 12:57:53 +08:00
evaluate_plugin.py
release the evaluation benchmark for tool use; update tool use results to that of the hf version
2023-08-08 17:45:41 +08:00
EVALUATION.md
release the evaluation benchmark for tool use; update tool use results to that of the hf version
2023-08-08 17:45:41 +08:00
gsm8k_prompt.txt
first commit
2023-08-03 12:57:53 +08:00
© 2026 starlight-apk 版权所有