Commit Graph

724 Commits

Author SHA1 Message Date
hiyouga d019956808 fix ChatGLM lm_head #494 2023-08-14 14:14:48 +08:00
hiyouga 20a29297b1 fix bug in webui 2023-08-14 11:38:42 +08:00
hiyouga ca08e5efd3 fix webui cache 2023-08-14 11:37:01 +08:00
hiyouga 2391a84e26 update readme_zh 2023-08-14 11:13:25 +08:00
hiyouga ec94274ca1 web UI integrating RLHF 2023-08-14 10:48:47 +08:00
hiyouga 2f2fd55d81 fix #480 2023-08-14 00:23:56 +08:00
hiyouga d69b1388e6 fix webui 2023-08-12 23:52:07 +08:00
hiyouga 9dc6a296e3 tiny fix 2023-08-12 22:02:43 +08:00
hiyouga 8545c11c45 fix rope scaling 2023-08-12 22:00:01 +08:00
hiyouga 8a79ded55d update readme 2023-08-12 21:29:06 +08:00
hiyouga 3ea1fa35d1 update readme 2023-08-12 21:25:19 +08:00
hiyouga 2618e0b5a7 update readme 2023-08-12 21:23:05 +08:00
hiyouga 1836c020c5 update readme 2023-08-12 21:00:11 +08:00
hiyouga fa940c17b8 support rope scaling, fix #475 #476 #478 2023-08-12 20:46:27 +08:00
hoshi-hiyouga 2eb0eca65f
Merge pull request #479 from hiyouga/feature-addCmdExport
add sft script preview in webui
2023-08-12 20:41:52 +08:00
codemayq 6bc8e9866d add sft script preview in webui 2023-08-12 13:53:55 +08:00
hiyouga dd51c24203 fix unusual output of 8bit models #278 #391 2023-08-12 00:25:29 +08:00
hiyouga a48cb0d474 Release v0.1.6 2023-08-11 23:25:57 +08:00
hiyouga 156710a995 Update README_zh.md 2023-08-11 14:06:02 +08:00
hiyouga d3844e97e3 add defaults 2023-08-11 13:56:26 +08:00
hiyouga d59f938959 fix stop word in baichuan template 2023-08-11 13:51:46 +08:00
hiyouga 9c6dd10514 fix baichuan template 2023-08-11 13:45:47 +08:00
hiyouga 3ec4351cfd support DPO training (2305.18290) 2023-08-11 03:02:53 +08:00
hoshi-hiyouga 685dae4eff
Merge pull request #451 from jovialchen/main
huggingface login for projects must login while running
2023-08-10 17:25:38 +08:00
hiyouga ad6e7c76c7 fix webui val size 2023-08-10 15:20:44 +08:00
jiongxuc 3e000c2b60 huggingface login for projects must login while running 2023-08-10 14:57:12 +08:00
hiyouga eb6e571cb7 fix template 2023-08-09 23:14:27 +08:00
hiyouga ac29f4d5f0 fix template 2023-08-09 23:10:20 +08:00
hiyouga d86ea314a1 support val set in streaming mode 2023-08-09 23:00:26 +08:00
hiyouga 572ea3bafb fix tokenizer 2023-08-09 17:52:15 +08:00
hiyouga ef5b299b18 Update wechat.jpg 2023-08-09 17:36:17 +08:00
niuba 2ec68d3398 add last_checkpoint support 2023-08-09 16:39:27 +08:00
hiyouga df946e6949 fix sft trainer 2023-08-09 16:35:03 +08:00
hiyouga 39cd8b6989 fix rm #420, fix template #426, fix #423 2023-08-09 16:23:31 +08:00
hoshi-hiyouga 2d90685358
fix llama2 template 2023-08-09 00:58:27 +08:00
hoshi-hiyouga 32fa5e8d70
fix tokenizer 2023-08-09 00:54:54 +08:00
hiyouga 3a720aac66 update webui 2023-08-09 00:26:11 +08:00
hiyouga eecc4b2131 fix tokenizer #417 2023-08-08 23:59:41 +08:00
hiyouga caa0eda27d fix bug 2023-08-08 21:28:28 +08:00
hiyouga 4b841a6b35 fix bug 2023-08-08 17:55:55 +08:00
hiyouga a9980617f5 fix chatml template #408 2023-08-08 17:44:39 +08:00
hiyouga 5453b93db0 update args spec 2023-08-07 15:23:35 +08:00
hiyouga 20cf27976f update readme 2023-08-07 15:02:02 +08:00
hiyouga cacd5b703d Merge branch 'main' of https://github.com/hiyouga/LLaMA-Efficient-Tuning 2023-08-07 13:59:16 +08:00
hiyouga 081345baca fix #376 2023-08-07 13:58:59 +08:00
hoshi-hiyouga da42d289ee
Merge pull request #382 from hiyouga/feature-updateReadme
add detailed model configs
2023-08-07 13:43:38 +08:00
hiyouga 220175ab24 update trainer 2023-08-07 13:34:35 +08:00
codemayq 293bd95712 add detailed model configs 2023-08-07 09:30:23 +08:00
hiyouga e21ae01356 fix qwen eos token 2023-08-06 13:31:17 +08:00
hiyouga 7f18d2a335 fix qwen tokenizer #361 2023-08-05 17:06:05 +08:00