Commit Graph

263 Commits

Author SHA1 Message Date
hiyouga e83e2fa897 update readme 2024-04-26 05:49:26 +08:00
hiyouga 27ba1b63ce update readme 2024-04-26 05:44:30 +08:00
hiyouga 44a43ee152 add olmo 1.7 2024-04-24 05:50:50 +08:00
hiyouga 07737a3d2d reenable sdpa and fast tok by default 2024-04-24 02:18:44 +08:00
hiyouga 1a13f05555 support phi-3 2024-04-24 00:28:53 +08:00
hiyouga db7f3b9784 update readme 2024-04-22 17:09:17 +08:00
hiyouga 836ca05586 update readme 2024-04-22 00:51:35 +08:00
hiyouga 34d66a3a85 update readme 2024-04-22 00:42:25 +08:00
hiyouga a1f1fac33b update readme and examples 2024-04-22 00:37:32 +08:00
hiyouga a83e7587a0 update readme 2024-04-22 00:21:01 +08:00
hiyouga f58425ab45 fix mod stuff 2024-04-21 18:11:10 +08:00
Marco 620add7b9f Added Mixture of Depths 2024-04-18 20:31:24 +02:00
hoshi-hiyouga 2aaaede247 support llama3 2024-04-19 01:13:50 +08:00
hiyouga 942362d008 fix #3324 2024-04-18 15:34:45 +08:00
hiyouga 3b43a3b7c5 tiny fix 2024-04-18 00:22:17 +08:00
hiyouga e2f1c6fc6a update readme 2024-04-17 23:40:49 +08:00
hiyouga cab0598fd0 add mixtral 8x22B models 2024-04-17 23:35:59 +08:00
hiyouga 5d62a51c12 update readme and gradio version 2024-04-16 18:09:16 +08:00
hiyouga e3d8fc75eb support badam for all stages 2024-04-16 17:44:48 +08:00
hiyouga cf52911fed update readme 2024-04-16 02:36:54 +08:00
hiyouga 6084eb7cf1 update readme 2024-04-16 02:35:36 +08:00
hiyouga 6543f3d449 add codegemma 2024-04-16 00:11:15 +08:00
hiyouga e0dbac2845 support cohere commandR #3184 2024-04-15 23:26:42 +08:00
hiyouga 9d4c949461 release v0.6.2 2024-04-11 20:08:51 +08:00
hiyouga a88fe8c1af update readme 2024-04-07 00:48:24 +08:00
hiyouga 7f6e412604 fix requires for windows 2024-04-03 21:56:43 +08:00
hiyouga 49a2dfaf90 update vllm example 2024-04-02 22:45:20 +08:00
hiyouga 66b0fe4e96 update readme 2024-04-02 22:17:48 +08:00
hiyouga 7765f337c7 add zh readme 2024-04-02 20:58:45 +08:00
hiyouga 11a6c1bad6 update readme 2024-04-02 20:37:37 +08:00
hiyouga 949e5fe638 update readme 2024-04-02 20:22:11 +08:00
hiyouga 92dab8a90b simplify readme 2024-04-02 20:07:43 +08:00
hiyouga 54b7d34908 add qwen1.5 moe 2024-04-01 21:49:40 +08:00
hiyouga aee634cd20 fix #3077 2024-04-01 21:35:18 +08:00
hiyouga 099db6acc0 update readme 2024-03-31 18:46:34 +08:00
hiyouga 17bf8a2c3a support ORPO 2024-03-31 18:29:50 +08:00
hiyouga c1fe6ce782 update readme 2024-03-28 22:02:32 +08:00
hiyouga 1e43319f9c add project 2024-03-28 20:24:27 +08:00
hiyouga 6c94305e47 update readme 2024-03-28 18:35:11 +08:00
hiyouga 8c77b10912 update trainers 2024-03-28 18:16:27 +08:00
hiyouga 7b3d8188f5 update readme 2024-03-25 23:06:13 +08:00
hoshi-hiyouga f633ac6646
Merge pull request #2967 from Tsumugii24/main
Update README_zh.md
2024-03-25 23:02:22 +08:00
Tsumugii24 1704599503 Update README.md 2024-03-25 22:54:38 +08:00
hiyouga 6f2b563f12 release v0.6.0 2024-03-25 22:38:56 +08:00
hiyouga a1c8c98c5f fix #2941 2024-03-24 00:28:44 +08:00
0xez 675ba41562
Update README.md, fix the release date of the paper 2024-03-21 22:14:48 +08:00
hiyouga 5eaa50fa01 add citation 2024-03-21 17:04:10 +08:00
hiyouga 0581bfdbc7 paper release 2024-03-21 13:49:17 +08:00
hiyouga bfe7a91289 update readme 2024-03-21 00:48:42 +08:00
hiyouga 8408225162 support fsdp + qlora 2024-03-21 00:36:06 +08:00