Commit Graph

1422 Commits

Author SHA1 Message Date
hiyouga 3010154adb fix slow op in dpo/orpo trainer 2024-05-03 23:06:52 +08:00
hiyouga 9585838ebe fix callback log multigpu #3559 2024-05-03 21:24:27 +08:00
hiyouga 5e6f808e3c enable tqdm in webui 2024-05-03 04:42:50 +08:00
hiyouga 17d2e5147e fix gen_args 2024-05-03 04:24:50 +08:00
hiyouga 530f6b49bb fix colab gradio 2024-05-03 03:54:46 +08:00
hiyouga 245fe47ece update webui and add CLIs 2024-05-03 02:58:23 +08:00
hiyouga 39e964a97a Update prepare.sh 2024-05-02 17:16:02 +08:00
hiyouga 9433c8c215 fix badam configs 2024-05-02 02:47:04 +08:00
hoshi-hiyouga f1c0eedeb3
Merge pull request #3487 from codemayq/main
support BAdam in WebUI
2024-05-02 02:38:01 +08:00
hoshi-hiyouga dcd53cb89a
Update train.py 2024-05-02 02:21:27 +08:00
hoshi-hiyouga 282b5d5b1f
Merge pull request #3490 from khazic/main
Added the second sharegpt format
2024-05-02 02:15:23 +08:00
hoshi-hiyouga d4d9180c40
Update README_zh.md 2024-05-02 02:14:55 +08:00
hoshi-hiyouga b072ec9d1b
Update README.md 2024-05-02 02:13:46 +08:00
zhaonx 42edc81585 "add support for vllm api stop parameter" 2024-04-30 17:17:09 +08:00
codingma b4a212f934
Merge branch 'hiyouga:main' into main 2024-04-30 10:02:41 +08:00
codingma d27e6a46b4 update wechat 2024-04-30 09:40:04 +08:00
Lao ce17eccf45
Update README_zh.md 2024-04-28 23:31:37 +08:00
khazic 288911fc7b Upgrade the second sharegpt format 2024-04-28 14:30:05 +08:00
khazic d1ba32e4bb added the second sharegpt format 2024-04-28 14:27:45 +08:00
codingma 26f7170393 support BAdam in WebUI 2024-04-28 11:31:34 +08:00
codingma e898fabbe3
Merge pull request #3484 from codemayq/main
update wechat
2024-04-28 08:40:08 +08:00
codingma 850f9b554f update wechat 2024-04-28 08:37:19 +08:00
hiyouga 32347901d4 fix setup 2024-04-28 03:49:13 +08:00
hiyouga b3e33c703e fix llava rlhf 2024-04-28 03:01:49 +08:00
hiyouga 4dbbce21d5 add models to 0.7.0 2024-04-28 01:50:30 +08:00
hiyouga 5ee04d418c update readme 2024-04-26 23:39:19 +08:00
hoshi-hiyouga 8f91420223
Merge pull request #3471 from BUAADreamer/main
add llava_150k en/zh mllm sft data
2024-04-26 23:36:41 +08:00
hoshi-hiyouga 456ad61ac5
Update dataset_info.json 2024-04-26 23:36:13 +08:00
hoshi-hiyouga c29b257007
Update dataset_info.json 2024-04-26 23:34:34 +08:00
BUAADreamer a177872010 add llava_150k en/zh mllm sft data 2024-04-26 23:18:58 +08:00
hiyouga 168f56683a release v0.7.0 2024-04-26 23:18:00 +08:00
hiyouga 031775ade8 update readme 2024-04-26 20:09:14 +08:00
hiyouga 375b25131b support Qwen1.5 110B 2024-04-26 19:59:22 +08:00
hiyouga fc67b736ba fix llava qlora 2024-04-26 18:00:23 +08:00
hiyouga cd3a960f81 add llava to llamaboard 2024-04-26 06:41:35 +08:00
hiyouga e83e2fa897 update readme 2024-04-26 05:49:26 +08:00
hoshi-hiyouga 20bc959e2f
Merge pull request #3454 from hiyouga/mllm
Support fine-tuning LLaVA-1.5 MLLM @BUAADreamer
2024-04-26 05:46:29 +08:00
hiyouga 27ba1b63ce update readme 2024-04-26 05:44:30 +08:00
hiyouga e057c8de48 support mllm hf inference 2024-04-26 05:34:58 +08:00
hoshi-hiyouga c20f750d11
Merge pull request #3450 from BUAADreamer/mllm
Add Multimodal LLM Finetuning
2024-04-26 05:30:30 +08:00
hoshi-hiyouga 7f3bd35c0e
Update preprocess.py 2024-04-26 04:10:28 +08:00
hoshi-hiyouga fcd09112d5
Update aligner.py 2024-04-26 03:48:34 +08:00
hoshi-hiyouga f62cadb258
Update parser.py 2024-04-26 03:35:39 +08:00
hoshi-hiyouga 3408af236f
Update loader.py 2024-04-26 03:33:07 +08:00
hoshi-hiyouga e16f128dc3
Update workflow.py 2024-04-26 03:29:12 +08:00
hoshi-hiyouga 7d812ed841
Update loader.py 2024-04-26 03:22:40 +08:00
hoshi-hiyouga f8c26e6a34
Update dataset_info.json 2024-04-26 03:03:36 +08:00
hoshi-hiyouga 5ef293387f
Update mllm_demo.json 2024-04-26 02:58:45 +08:00
hoshi-hiyouga 7dcae3dba3
Update and rename llava_instruct_example.json to mllm_demo.json 2024-04-26 02:57:54 +08:00
hoshi-hiyouga 860549b99b
update hparam name 2024-04-26 02:49:39 +08:00