Commit Graph

1105 Commits

Author SHA1 Message Date
hiyouga 31ffbde24d update examples 2024-04-02 20:41:49 +08:00
hiyouga 11a6c1bad6 update readme 2024-04-02 20:37:37 +08:00
hiyouga 949e5fe638 update readme 2024-04-02 20:22:11 +08:00
hiyouga 92dab8a90b simplify readme 2024-04-02 20:07:43 +08:00
hiyouga b267aeb53f add moe aux loss control #3085 2024-04-02 14:26:31 +08:00
hiyouga 9ddbe2866a fix #3022 2024-04-02 13:58:39 +08:00
hiyouga a86ae17241 Update SECURITY.md 2024-04-01 23:30:03 +08:00
hiyouga dd73a0c248 set dev version 2024-04-01 23:24:08 +08:00
hiyouga 4a6ca621c0 fix #3083 2024-04-01 22:53:52 +08:00
hiyouga 54b7d34908 add qwen1.5 moe 2024-04-01 21:49:40 +08:00
hiyouga aee634cd20 fix #3077 2024-04-01 21:35:18 +08:00
hiyouga eb259cc573 support infer 4bit model on GPUs #3023 2024-04-01 17:34:04 +08:00
hiyouga d0842f6828 update webui 2024-04-01 16:23:28 +08:00
hiyouga 816d714146 fix ORPO loss 2024-04-01 14:42:41 +08:00
hiyouga 5b9b40403d fix IPO and ORPO loss 2024-04-01 14:37:53 +08:00
hiyouga 5907216a1c fix plots 2024-03-31 19:43:48 +08:00
hiyouga 68aaa4904b use log1p in orpo loss
https://github.com/huggingface/trl/pull/1491
2024-03-31 19:27:08 +08:00
hiyouga 099db6acc0 update readme 2024-03-31 18:46:34 +08:00
hoshi-hiyouga a81d88b780
Merge pull request #3066 from hiyouga/orpo
support ORPO
2024-03-31 18:42:48 +08:00
hiyouga 5195add324 support orpo in webui 2024-03-31 18:34:59 +08:00
hiyouga 17bf8a2c3a support ORPO 2024-03-31 18:29:50 +08:00
hiyouga 27776c3474 tiny fix 2024-03-31 00:10:29 +08:00
hoshi-hiyouga de3564ff70
Merge pull request #3057 from marko1616/bugfix/lora-model-merge
Fix Llama model save for full param train
2024-03-31 00:07:20 +08:00
marko1616 d9a5134617 fix blank line contains whitespace 2024-03-30 23:46:55 +08:00
marko1616 eb178eaff3 Fix Llama model save for full param train 2024-03-30 23:45:04 +08:00
hiyouga 7a086ed333 support save args in webui #2807 #3046
some ideas are borrowed from @marko1616
2024-03-30 23:09:12 +08:00
hoshi-hiyouga 257f643a74
Merge pull request #3053 from lealaxy/main
Fix pile dataset download config
2024-03-30 20:41:43 +08:00
hiyouga 831c5321ac upgrade gradio to 4.21.0 2024-03-30 20:37:08 +08:00
li.yunhao 9c2ef9cdf4 fix pile datset hf hub url 2024-03-30 16:06:10 +08:00
hiyouga a0333bb0ce Update wechat.jpg 2024-03-29 16:55:53 +08:00
hiyouga ca793028c6 release v0.6.1 2024-03-29 11:36:08 +08:00
hiyouga c1fe6ce782 update readme 2024-03-28 22:02:32 +08:00
hiyouga 1e43319f9c add project 2024-03-28 20:24:27 +08:00
hiyouga 8d603f8820 fix #2982 2024-03-28 20:22:31 +08:00
hiyouga 6c94305e47 update readme 2024-03-28 18:35:11 +08:00
hiyouga b19c14870d fix #3010 2024-03-28 18:31:17 +08:00
hiyouga 8c77b10912 update trainers 2024-03-28 18:16:27 +08:00
hoshi-hiyouga 3bcd41b639 fix ds optimizer 2024-03-26 23:39:56 +08:00
hiyouga b29d5560f1 fix #2981 2024-03-26 17:53:04 +08:00
hiyouga 3164b4f11b fix bug 2024-03-26 17:30:12 +08:00
hiyouga 511f675402 fix #2961 2024-03-26 17:26:14 +08:00
hiyouga 7ea1a1f5b3 Update wechat.jpg 2024-03-26 16:24:42 +08:00
hiyouga ba70aca8fb release v0.6.0 (real) 2024-03-25 23:37:48 +08:00
hiyouga 98a42cbdaa tiny fix 2024-03-25 23:28:52 +08:00
hiyouga 7b3d8188f5 update readme 2024-03-25 23:06:13 +08:00
hoshi-hiyouga f633ac6646
Merge pull request #2967 from Tsumugii24/main
Update README_zh.md
2024-03-25 23:02:22 +08:00
Tsumugii24 1704599503 Update README.md 2024-03-25 22:54:38 +08:00
Tsumugii24 7aa77a3451 Update README_zh.md 2024-03-25 22:54:26 +08:00
hiyouga 1484f76a95 add arg check 2024-03-25 22:42:58 +08:00
hiyouga 6f2b563f12 release v0.6.0 2024-03-25 22:38:56 +08:00