Commit Graph

194 Commits

Author SHA1 Message Date
hiyouga 0cb260f453 update readme 2023-12-01 22:58:29 +08:00
hiyouga bd42c229b0 patch modelscope 2023-12-01 22:53:15 +08:00
hoshi-hiyouga 00f5c9ee16
Merge branch 'main' into feat/support_ms 2023-12-01 20:23:46 +08:00
yuze.zyz 5aa6751e52 add readme 2023-12-01 16:11:30 +08:00
hiyouga bf6f6aeefe fix #1696 2023-12-01 15:34:50 +08:00
hiyouga 509abe8864 add models 2023-11-30 19:16:13 +08:00
hiyouga 9d38e5687d add gpu requirement #1657 2023-11-29 12:05:03 +08:00
hiyouga 5085b00a1d update readme 2023-11-21 13:15:46 +08:00
hiyouga 9ea9380145 support GPTQ tuning #729 #1481 #1545 , fix chatglm template #1453 #1480 #1569 2023-11-20 22:52:11 +08:00
hiyouga 5021062493 update ppo trainer 2023-11-20 21:39:15 +08:00
hoshi-hiyouga 48211e3799
Merge pull request #1553 from hannlp/hans
Change the default argument settings for PPO training
2023-11-20 20:32:55 +08:00
hiyouga a2019c8b61 update benchmark 2023-11-18 11:30:01 +08:00
hiyouga 90212280d6 update readme 2023-11-18 11:15:56 +08:00
hiyouga 329134f58c add benchmark 2023-11-18 11:09:52 +08:00
Yuchen Han 7cab47b822
Update README_zh.md 2023-11-17 00:18:07 -08:00
hiyouga 72e6699547 update readme 2023-11-16 15:58:37 +08:00
hiyouga ce78303600 support full-parameter PPO 2023-11-16 02:08:04 +08:00
hiyouga 8350bcf85d add demo mode for web UI 2023-11-15 23:51:26 +08:00
hiyouga 1e19cf242a update readme and constants 2023-11-15 18:04:37 +08:00
hiyouga 88ab33254e fix dc link 2023-11-13 23:22:56 +08:00
hiyouga 442aefb925 refactor evaluation, upgrade trl to 074 2023-11-13 22:20:35 +08:00
hiyouga 3697a3dc9a refactor constants 2023-11-10 14:16:10 +08:00
hiyouga b3572659f5 update readme 2023-11-09 16:00:24 +08:00
hiyouga e1e04cb1f1 update readme (list in alphabetical order) 2023-11-06 17:18:12 +08:00
hiyouga a7eeb8e17c update templates 2023-11-06 12:25:47 +08:00
hiyouga cc8ffa10d8 update data readme (zh) 2023-11-02 23:42:49 +08:00
hiyouga a837172413 support sharegpt format, add datasets 2023-11-02 23:10:04 +08:00
hiyouga 640a520108 update projects 2023-10-29 22:53:47 +08:00
hiyouga 59f342e76f add projects 2023-10-29 22:07:13 +08:00
hiyouga 52fc24d166 fix vicuna template 2023-10-27 22:15:25 +08:00
hiyouga 4600c29e93 update readme 2023-10-27 19:19:03 +08:00
hiyouga 1c0ab9a908 support chatglm3 2023-10-27 19:16:28 +08:00
hiyouga 7b4acf7265 reimplement neftune 2023-10-22 16:15:08 +08:00
anvie 57fb40aa04 add NEFTune optimization 2023-10-21 13:24:10 +07:00
hiyouga b665e9e133 fix #1232 2023-10-20 23:28:52 +08:00
hiyouga 6496a99b7d fix #1217 2023-10-19 15:52:24 +08:00
hoshi-hiyouga 5f83a6e72c
Update README_zh.md 2023-10-16 00:28:27 +08:00
hiyouga f5d0da4d2a update readme 2023-10-15 20:28:14 +08:00
hiyouga cb42676694 update readme 2023-10-13 13:53:43 +08:00
hiyouga c4102f306a update discord link 2023-10-12 21:44:28 +08:00
hiyouga 197c754d73 rename repository 2023-10-12 21:42:29 +08:00
hiyouga 8e2ed6b8ce update readme 2023-10-09 20:02:50 +08:00
hiyouga d11a545463 fix #1068 #1074 2023-09-28 14:39:16 +08:00
hiyouga 4eae061464 update readme 2023-09-27 21:57:47 +08:00
hiyouga 90375f600d support LongLoRA 2023-09-27 21:55:50 +08:00
hiyouga 4dd9b4d982 add CMMLU, update eval script 2023-09-23 21:10:17 +08:00
hiyouga badd2735b5 move file 2023-09-23 11:52:12 +08:00
hiyouga 465ee8119a add MMLU and C-Eval script 2023-09-23 00:34:17 +08:00
hiyouga 5cc7a44784 fix #1000 2023-09-22 15:00:48 +08:00
hiyouga 044d4425b4 update readme 2023-09-22 14:34:13 +08:00
hiyouga ace3f85a72 tiny fix 2023-09-21 15:25:29 +08:00
hiyouga acda45e463 update readme 2023-09-16 17:33:01 +08:00
hiyouga 026af87e7f add MathInstruct dataset 2023-09-13 22:30:14 +08:00
hiyouga d4be857e23 fix #762 #814 2023-09-12 16:10:10 +08:00
hiyouga ccb3553576 Release v0.1.8 2023-09-11 17:31:34 +08:00
hiyouga baac22f4f4 truncate readme 2023-09-10 21:04:20 +08:00
hiyouga 63611de7ae update readme 2023-09-10 21:01:20 +08:00
hiyouga 34005252df update readme 2023-09-10 20:52:21 +08:00
hiyouga d8aa1404be support FlashAttention2 2023-09-10 20:43:56 +08:00
hiyouga bca1a247bc support lora target auto find 2023-09-09 15:38:37 +08:00
hiyouga d8d82ca281 fix chatglm2 tokenizer 2023-09-09 13:50:29 +08:00
hiyouga 85b1f6632a fix baichuan templates 2023-09-07 18:54:14 +08:00
hiyouga 0531886e1f update baichuan2 template 2023-09-06 21:43:06 +08:00
hiyouga 60603a94c6 add Baichuan2 models 2023-09-06 18:40:11 +08:00
hiyouga a9d1fb72f7 refactor dataset_attr, add eos in pt, fix #757 2023-09-01 19:00:45 +08:00
codemayq 604f85487b add ad gen dataset 2023-08-27 20:35:32 +08:00
hiyouga 4318347d3f update template 2023-08-22 19:46:09 +08:00
hiyouga 9020524418 fix PPO trainer #551 , update readme 2023-08-18 11:43:10 +08:00
hiyouga e4eec9ddfd update readme 2023-08-18 01:51:55 +08:00
hiyouga 58f13e22da update training resuming 2023-08-18 01:41:17 +08:00
hiyouga ff0aa793b6 update readme 2023-08-17 11:00:22 +08:00
hiyouga 2391a84e26 update readme_zh 2023-08-14 11:13:25 +08:00
hiyouga 8a79ded55d update readme 2023-08-12 21:29:06 +08:00
hiyouga 3ea1fa35d1 update readme 2023-08-12 21:25:19 +08:00
hiyouga 2618e0b5a7 update readme 2023-08-12 21:23:05 +08:00
hiyouga 1836c020c5 update readme 2023-08-12 21:00:11 +08:00
hiyouga 156710a995 Update README_zh.md 2023-08-11 14:06:02 +08:00
hiyouga 3ec4351cfd support DPO training (2305.18290) 2023-08-11 03:02:53 +08:00
hiyouga 20cf27976f update readme 2023-08-07 15:02:02 +08:00
codemayq 293bd95712 add detailed model configs 2023-08-07 09:30:23 +08:00
hiyouga 87f8f830e2 support Qwen-7B, fix InternLM-7B inference 2023-08-03 15:53:32 +08:00
hiyouga c689857bbb release v0.1.5 2023-08-02 16:10:31 +08:00
hiyouga ccde51c5ea update readme 2023-08-01 18:48:27 +08:00
hiyouga ac88ce5233 fix RM save model 2023-08-01 11:56:17 +08:00
hiyouga 973a638665 release v0.1.4 2023-08-01 10:08:47 +08:00
hiyouga 62dca5bb82 update readme 2023-07-31 23:42:32 +08:00
hiyouga 0411a4b3e1 support streaming data, fix #284 #274 #268 2023-07-31 23:33:00 +08:00
hiyouga 5ee87138e4 update readme 2023-07-28 17:36:00 +08:00
hiyouga f5c2ccdde4 update dataset 2023-07-26 17:05:12 +08:00
hiyouga 00efa8a07f fix #242 2023-07-25 17:04:02 +08:00
hiyouga 182b425043 update dataset 2023-07-23 20:01:43 +08:00
hiyouga 6a2967ff7a Update README_zh.md 2023-07-22 14:31:16 +08:00
hiyouga 035c966d5c update readme, fix web ui postprocess 2023-07-22 14:29:22 +08:00
mrhan1993 9f0b57b370 根据GLM Efficient Tuning添加中文README,web添加了server_port 2023-07-21 16:57:58 +08:00