Henk
8376f12e21
Add NS mode
...
OPT supports newlines, but it also needs some of the behavior we use in S mode. NS mode is a more limited version of S mode that still handles the </s> token, but instead of replacing it with a new line we replace it empty and newlines are not converted.
In future if your Fairseq style model has newline support use NS mode, while if it needs artifically inserted newlines use S mode. This also means that people finetuning fairseq models to include newlines might benefit from testing their models on ns mode.
2022-05-13 10:44:12 +02:00
Gnome Ann
55079f672a
Fix typo in soft prompt patching code
2022-05-13 01:51:55 -04:00
Gnome Ann
29bb3f569b
Fix a bug in OPTForCausalLM where self.lm_head is the wrong size
2022-05-13 01:37:17 -04:00
Gnome Ann
defbb53b68
OPT breakmodel
2022-05-13 01:03:38 -04:00
Gnome Ann
b1d8797a54
Allow TPU Colab to load sharded HF models
2022-05-12 23:51:40 -04:00
Gnome Ann
4fa5f1cd6a
Add TPU support for OPT-350M
...
The 350M model seems to have a different structure than the other ones ???
2022-05-12 22:21:15 -04:00
Gnome Ann
dfa2aa7314
Merge branch 'united' into opt
2022-05-12 20:11:53 -04:00
Henk
5c4a087970
Disable S mode for OPT
2022-05-13 01:47:59 +02:00
Gnome Ann
f5e689a725
Upload maps/opt.json and update requirements
2022-05-12 19:09:31 -04:00
Henk
e98cc3cb16
OPT models
2022-05-12 23:55:21 +02:00
Gnome Ann
39b770d6ef
Merge branch 'united' into neox
2022-05-12 17:29:34 -04:00
Henk
376e76f5da
S mode for OPT
2022-05-12 02:18:14 +02:00
henk717
a1c7017ddc
Merge pull request #127 from VE-FORBRYDERNE/aria2
...
Handle aria2 properly when it exits with nonzero exit code
2022-05-11 22:57:45 +02:00
Gnome Ann
580dd0b2a3
Handle aria2 properly when it exits with nonzero exit code
2022-05-11 16:23:24 -04:00
henk717
05549de42d
Merge pull request #126 from VE-FORBRYDERNE/aria2
...
Aria2 downloader bug fixes
2022-05-11 21:58:31 +02:00
Gnome Ann
2ebba9488b
Change `force_download` back to False
...
This is to prevent fully downloaded models from being re-downloaded in
Colab.
2022-05-11 15:51:48 -04:00
Gnome Ann
6d481ca57e
Merge branch 'united' into aria2
2022-05-11 15:51:11 -04:00
Gnome Ann
c65272052a
aria2 now downloads to different filename and renames afterwards
...
This is to match the behaviour of the original transformers downloader
in order to deal with the rare case of someone downloading a model using
aria2, cancelling before it finishes, and then attempting to resume the
download with the normal transformers downloader.
2022-05-11 15:45:38 -04:00
Henk
6d27084e8a
Better Aria2 Defaults
...
Trunc prevents slow allocation on windows, force_download=True has proven a more reliable default. Since models are converted to local formats it does not impact local users. And because -c is used the impact of checking if the model is correct is desirable and minimal.
2022-05-11 21:38:33 +02:00
Gnome Ann
7a3f865e3f
Prevent aria2 from resuming cancelled downloads
...
Resumed downloads tend to be very slow.
The original transformers downloader didn't allow resuming downloads
either.
2022-05-11 15:14:37 -04:00
Gnome Ann
c81f3bd084
Use `--file-allocation=trunc` instead of `--file-allocation=none`
2022-05-11 14:51:43 -04:00
Gnome Ann
f96c878d83
Use aria2 even when all model files are already in cache
...
This allows aria2 to continue downloading a pytorch_model.bin after a
cancelled download.
2022-05-11 14:43:56 -04:00
Gnome Ann
f60c7d8492
Fix the behaviour of `aria2_hook()` when using `force_download`
2022-05-11 14:41:34 -04:00
Gnome Ann
5732a8f15a
Don't use `aria2_hook()` if `force_download=True` is used
2022-05-11 14:40:31 -04:00
henk717
903d593ce4
Merge pull request #125 from VE-FORBRYDERNE/aria2
...
Use aria2 to improve HF model download speeds in Colab
2022-05-11 07:55:53 +02:00
Gnome Ann
46cfa1367f
Add `--no_aria2` command line flag
2022-05-11 00:44:56 -04:00
Gnome Ann
f09959f9be
Fix patching code of `PreTrainedModel.from_pretrained()`
2022-05-11 00:41:53 -04:00
Gnome Ann
22b4f3c9df
Bug fixes for `aria2_hook()` when running Windows
2022-05-11 00:14:00 -04:00
Gnome Ann
82205722af
Fix logic of `aria2_hook()`
2022-05-10 23:46:29 -04:00
Gnome Ann
4b49d1c464
Make sure `vars.revision` is defined
2022-05-10 22:51:36 -04:00
Gnome Ann
4b693b4858
Fix the logic of `force_download` in utils.py
2022-05-10 22:47:03 -04:00
Gnome Ann
c1ef20bcff
Also enable aria2 downloading for non-sharded checkpoints
2022-05-10 22:43:41 -04:00
Gnome Ann
e115bb68e4
aria2 downloads in utils.py now use correct user agent
2022-05-10 22:22:46 -04:00
Gnome Ann
b97b2a02d6
Add `--revision` command line flag
2022-05-10 22:14:56 -04:00
Gnome Ann
937d9ee06a
Change default `model.save_pretrained` shard size to 500 MiB
2022-05-10 22:04:25 -04:00
Gnome Ann
a388c63023
Use aria2 to download split checkpoints
2022-05-10 21:28:13 -04:00
Henk
01e15d03d6
Remove play.ipnyb
...
Interactive Python doesn't work well on Jupyter, until they support what Colab can do this file is pointless.
2022-05-11 01:49:07 +02:00
Henk
7a9297adc3
Jupyter Git integration
2022-05-11 01:31:12 +02:00
Henk
f917d3438f
Updated models
2022-05-10 21:39:16 +02:00
henk717
7fcc1a9acb
Fix C1
2022-05-10 18:38:50 +02:00
Henk
c5462ec480
Better Jupyter
2022-05-09 02:41:00 +02:00
Henk
e09b939f04
Force Bash
2022-05-08 16:02:16 +02:00
Henk
0ca4917056
Linux Runtime Info
2022-05-08 00:24:14 +02:00
Henk
030df1a09f
Small installer fix
2022-05-07 20:22:33 +02:00
Henk
a3dc188c8f
Linux Installer Improvements
2022-05-01 15:58:37 +02:00
henk717
9f7c9c4b9e
Merge pull request #124 from Crafteko/united
...
Replaced Adventure 125M and added C1-1.3B to the menu
2022-04-29 17:38:55 +02:00
Gnome Ann
a61ba0d000
Upload map file for GPT-NeoX
2022-04-29 00:41:56 -04:00
subtlewave
9c83ef7fa9
Replaced Adventure 125M and added C1-1.3B
2022-04-28 22:35:04 +00:00
Henk
810f6614af
Cap GIT version for now
2022-04-27 18:20:43 +02:00
henk717
716951f059
Merge pull request #123 from VE-FORBRYDERNE/settings
...
Prevent the settings throttle from lagging the sliders
2022-04-27 03:17:53 +02:00