Commit Graph

2104 Commits

Author SHA1 Message Date
beat4ocean 7b45de6b9f fix KeyError: 'lang' bug 2023-08-20 15:32:36 +08:00
hiyouga 0676497104 fix ppo trainer #551 2023-08-20 14:07:11 +08:00
hiyouga 290be836b7 Update wechat.jpg 2023-08-19 18:03:36 +08:00
hiyouga 9c9009f49f Release v0.1.7 2023-08-18 17:21:27 +08:00
hiyouga d75e377b0f tiny fix 2023-08-18 13:07:35 +08:00
hiyouga 53e33418d0 support ppo score norm (trl 0.5.1.dev required) 2023-08-18 12:02:42 +08:00
hiyouga 9020524418 fix PPO trainer #551 , update readme 2023-08-18 11:43:10 +08:00
hiyouga e4eec9ddfd update readme 2023-08-18 01:51:55 +08:00
hiyouga 10cd6c9171 Update .gitignore 2023-08-18 01:43:42 +08:00
hiyouga 58f13e22da update training resuming 2023-08-18 01:41:17 +08:00
hoshi-hiyouga 7926432d27
Merge pull request #434 from niuba/main
add last_checkpoint support
2023-08-18 01:38:31 +08:00
hoshi-hiyouga 7252903245
Merge branch 'main' into main 2023-08-18 01:37:23 +08:00
hiyouga d125218cde support bf16 ppo #551 2023-08-18 00:40:32 +08:00
hiyouga 9f4c2adc9a fix ChatGLM2 ppo #527 #528 2023-08-18 00:34:59 +08:00
hiyouga be21fc83f9 fix generation bug #532 2023-08-17 22:21:34 +08:00
hiyouga b0ed0dec5e fix streaming in pt stage #548 #549 2023-08-17 17:59:26 +08:00
hiyouga ff0aa793b6 update readme 2023-08-17 11:00:22 +08:00
hiyouga 892fd39373 fix baichuan and intern template 2023-08-17 01:27:20 +08:00
hiyouga d9e62711a3 fix generation 2023-08-16 22:39:54 +08:00
hiyouga 7407d9daa1 fix system prompt 2023-08-16 01:35:52 +08:00
hiyouga 273135f595 fix baichuan template #481 2023-08-15 11:38:21 +08:00
hoshi-hiyouga 7f35487c4a
Merge pull request #516 from liuyanyi/add_gitignore
[Enhance] Add .gitignore file
2023-08-15 11:25:40 +08:00
hiyouga af6c011fcb fix ChatGLM RLHF 2023-08-15 11:19:20 +08:00
hiyouga a7dd9611db Update wechat.jpg 2023-08-15 11:13:46 +08:00
Yanyi Liu 448478f938
Add .gitignore 2023-08-15 11:13:45 +08:00
hiyouga 80b4053602 alert pad_token source 2023-08-15 00:07:56 +08:00
hiyouga 9d0f6214b6 update webui 2023-08-14 22:45:26 +08:00
hoshi-hiyouga adb0f186e9
Merge pull request #511 from hiyouga/feature-autoTemplate
add template match and stage in webui
2023-08-14 22:44:04 +08:00
codemayq 0bf892ff1a auto match template when change model_name 2023-08-14 20:56:05 +08:00
codemayq 79c68e5527 add template match and stage in webui 2023-08-14 20:42:59 +08:00
hiyouga d019956808 fix ChatGLM lm_head #494 2023-08-14 14:14:48 +08:00
hiyouga 20a29297b1 fix bug in webui 2023-08-14 11:38:42 +08:00
hiyouga ca08e5efd3 fix webui cache 2023-08-14 11:37:01 +08:00
hiyouga 2391a84e26 update readme_zh 2023-08-14 11:13:25 +08:00
hiyouga ec94274ca1 web UI integrating RLHF 2023-08-14 10:48:47 +08:00
hiyouga 2f2fd55d81 fix #480 2023-08-14 00:23:56 +08:00
hiyouga d69b1388e6 fix webui 2023-08-12 23:52:07 +08:00
hiyouga 9dc6a296e3 tiny fix 2023-08-12 22:02:43 +08:00
hiyouga 8545c11c45 fix rope scaling 2023-08-12 22:00:01 +08:00
hiyouga 8a79ded55d update readme 2023-08-12 21:29:06 +08:00
hiyouga 3ea1fa35d1 update readme 2023-08-12 21:25:19 +08:00
hiyouga 2618e0b5a7 update readme 2023-08-12 21:23:05 +08:00
hiyouga 1836c020c5 update readme 2023-08-12 21:00:11 +08:00
hiyouga fa940c17b8 support rope scaling, fix #475 #476 #478 2023-08-12 20:46:27 +08:00
hoshi-hiyouga 2eb0eca65f
Merge pull request #479 from hiyouga/feature-addCmdExport
add sft script preview in webui
2023-08-12 20:41:52 +08:00
codemayq 6bc8e9866d add sft script preview in webui 2023-08-12 13:53:55 +08:00
hiyouga dd51c24203 fix unusual output of 8bit models #278 #391 2023-08-12 00:25:29 +08:00
hiyouga a48cb0d474 Release v0.1.6 2023-08-11 23:25:57 +08:00
hiyouga 156710a995 Update README_zh.md 2023-08-11 14:06:02 +08:00
hiyouga d3844e97e3 add defaults 2023-08-11 13:56:26 +08:00