hiyouga
|
a2ba69183b
|
fix bug in PPO stage
|
2023-07-05 19:14:10 +08:00 |
hiyouga
|
8e3540c62d
|
fix compute dtype
|
2023-07-05 15:13:00 +08:00 |
hiyouga
|
c136f362c1
|
support falcon model #72
|
2023-07-05 15:00:06 +08:00 |
hiyouga
|
966b5c70fc
|
Update wechat.jpg
|
2023-07-05 00:22:22 +08:00 |
hiyouga
|
cac87fd553
|
fix bleu score
|
2023-07-05 00:11:21 +08:00 |
hiyouga
|
395ed1cf1b
|
set use_cache before saving model
|
2023-07-04 23:18:20 +08:00 |
hiyouga
|
65e9ce2cdd
|
fix seq2seq predictions
|
2023-07-04 22:56:51 +08:00 |
hoshi-hiyouga
|
cb26f78923
|
Merge pull request #119 from codemayq/main
add the pre-built version of bitsandbytes library for windows user
|
2023-07-03 19:51:46 +08:00 |
codemayq
|
d3b30ecde3
|
add the pre-built version of bitsandbytes library for windows user
|
2023-07-03 13:58:10 +08:00 |
hiyouga
|
0db9d29111
|
Update auto_gptq.py
|
2023-07-02 20:56:11 +08:00 |
hiyouga
|
cf6d57fd3e
|
add autogptq
|
2023-07-02 20:36:37 +08:00 |
hiyouga
|
b8e1f09a2e
|
Update wechat.jpg
|
2023-06-30 15:45:20 +08:00 |
hiyouga
|
92fa515e97
|
fix typo
|
2023-06-30 10:09:59 +08:00 |
hiyouga
|
021b035c1e
|
Update README.md
|
2023-06-29 19:36:22 +08:00 |
hiyouga
|
f14bd729a8
|
rename evaluate.py
|
2023-06-29 15:40:39 +08:00 |
hiyouga
|
23a7266272
|
Update evaluate.py
|
2023-06-29 15:40:03 +08:00 |
hiyouga
|
70592035b8
|
Update README.md
|
2023-06-29 15:37:19 +08:00 |
hiyouga
|
3154fec979
|
add open assistant dataset
|
2023-06-28 23:09:33 +08:00 |
hiyouga
|
4d0fddba21
|
update loading logic
|
2023-06-28 12:07:16 +08:00 |
hiyouga
|
0a46313cca
|
fix loading best model
|
2023-06-28 01:55:12 +08:00 |
hiyouga
|
7826a8ca77
|
fix RM accuracy
|
2023-06-28 01:40:13 +08:00 |
hiyouga
|
9cb1af71f3
|
add star history
|
2023-06-27 23:56:29 +08:00 |
hiyouga
|
450910c1db
|
tiny fix
|
2023-06-27 23:54:24 +08:00 |
hiyouga
|
18f87c1b25
|
fix initializing data arguments
|
2023-06-27 22:50:23 +08:00 |
hiyouga
|
2e01abfda5
|
support save full model, replace BOS token
|
2023-06-27 21:40:11 +08:00 |
hiyouga
|
1c732e2537
|
fix decoding in seq2seq
|
2023-06-27 19:33:08 +08:00 |
hiyouga
|
33f2141507
|
Update wechat.jpg
|
2023-06-27 16:41:09 +08:00 |
hiyouga
|
4f3772b342
|
Update evaluate.py
|
2023-06-26 23:41:33 +08:00 |
hiyouga
|
5a0a9daf74
|
Create evaluate.py
|
2023-06-26 23:30:18 +08:00 |
hoshi-hiyouga
|
907e065454
|
Merge pull request #86 from Jingsong-Yan/main
Update README.md with baichuan-7b-rtx3090
|
2023-06-26 20:14:40 +08:00 |
Jingsong-Yan
|
90bb5b6f37
|
Update README.md with baichuan-7b-rtx3090
在 Changelog 中新增 baichuan-7b-rtx3090 分支的描述
|
2023-06-26 19:45:41 +08:00 |
hiyouga
|
993cabdd4c
|
Merge branch 'main' of https://github.com/hiyouga/LLaMA-Efficient-Tuning
|
2023-06-26 18:07:09 +08:00 |
hiyouga
|
1175948029
|
fix generation in seq2seq.py
|
2023-06-26 18:07:06 +08:00 |
hoshi-hiyouga
|
95b057f5af
|
Merge pull request #84 from wu-yy/patch-1
Update requirements.txt
|
2023-06-26 15:39:08 +08:00 |
蓝鲸123
|
e2a16d549e
|
Update requirements.txt
LlamaTokenizer requires the SentencePiece library but it was not found in your environment.
|
2023-06-26 15:36:19 +08:00 |
hiyouga
|
cec9760eb8
|
support prefixes, loading multiple local files
|
2023-06-26 15:32:40 +08:00 |
hiyouga
|
f030b09924
|
update api
|
2023-06-26 13:39:57 +08:00 |
hiyouga
|
d21cc71750
|
Update wechat.jpg
|
2023-06-25 23:41:11 +08:00 |
hiyouga
|
0697643358
|
update readme
|
2023-06-23 00:17:05 +08:00 |
hiyouga
|
614d3a996c
|
update API
|
2023-06-22 20:46:24 +08:00 |
hiyouga
|
76ecb8c222
|
match api with OpenAI format
|
2023-06-22 20:27:00 +08:00 |
hoshi-hiyouga
|
9324940b76
|
Merge pull request #68 from mMrBun/main
Compatible with OpenAI API.
|
2023-06-22 15:52:34 +08:00 |
Bun
|
6e4db0903f
|
Compatible with OpenAI API.
|
2023-06-21 14:45:04 +08:00 |
hiyouga
|
ded5aa3c3d
|
Update wechat.jpg
|
2023-06-19 19:46:04 +08:00 |
hiyouga
|
f621f7631a
|
add default template
|
2023-06-16 21:12:17 +08:00 |
hiyouga
|
334d1a6d26
|
add belle multiturn dataset
|
2023-06-16 20:01:16 +08:00 |
hiyouga
|
a6c4b141cd
|
fix freeze layers
|
2023-06-16 17:38:21 +08:00 |
hiyouga
|
fc4d8155b3
|
add source prefix
|
2023-06-16 16:32:17 +08:00 |
hiyouga
|
0574b590ef
|
support loading lora from hub
|
2023-06-16 00:02:17 +08:00 |
hiyouga
|
0cee6ad67f
|
support baichuan model
|
2023-06-15 16:02:01 +08:00 |