Logo
Explore Help
Sign In
starlight-apk/Qwen
1
0
Fork 0
You've already forked Qwen
mirror of https://github.com/QwenLM/Qwen.git synced 2026-05-20 16:35:47 +08:00
Code Issues Packages Projects Releases Wiki Activity
Files
1c282d421ce2e5aa8b2888425a41d0669a10a110
Qwen/eval
History
Sean 2b565da220 Update evaluate_plugin.py
change Old Evaluation Dataset (Version 20230803) to new version
2024-02-17 17:27:49 +08:00
..
evaluate_ceval.py
update evaluate scripts
2023-10-30 19:13:14 +08:00
evaluate_chat_ceval.py
specify repetition penalty
2023-10-13 11:44:48 +08:00
evaluate_chat_gsm8k.py
add 72B and 1.8B Qwen models, add Ascend 910 and Hygon DCU support, add docker support
2023-11-30 15:29:13 +08:00
evaluate_chat_humaneval.py
specify repetition penalty
2023-10-13 11:44:48 +08:00
evaluate_chat_mmlu.py
add 72B and 1.8B Qwen models, add Ascend 910 and Hygon DCU support, add docker support
2023-11-30 15:29:13 +08:00
evaluate_cmmlu.py
update evaluate scripts
2023-10-30 19:13:14 +08:00
evaluate_gsm8k.py
fix format problems in evaluation code; update ceval extraction rules
2023-08-25 22:44:07 +08:00
evaluate_humaneval.py
fix format problems in evaluation code; update ceval extraction rules
2023-08-25 22:44:07 +08:00
evaluate_mmlu.py
update evaluate scripts
2023-10-30 19:13:14 +08:00
evaluate_plugin.py
Update evaluate_plugin.py
2024-02-17 17:27:49 +08:00
EVALUATION.md
update agent benchmarks and add qwen-72b results
2023-12-06 12:57:11 +08:00
gsm8k_prompt.txt
first commit
2023-08-03 12:57:53 +08:00
© 2026 starlight-apk 版权所有