vfbd
|
b20d80ca2a
|
Add vocab padding to embedding bias in gptj.json
|
2022-11-02 19:02:09 -04:00 |
vfbd
|
d9e7ca5b48
|
Upload map file for BLOOM
|
2022-07-07 17:48:00 -04:00 |
Gnome Ann
|
5e3c7c07ae
|
Merge branch 'main' into neox
|
2022-06-21 19:30:51 -04:00 |
Gnome Ann
|
a7e3ef71aa
|
Add final layer norm to OPT
|
2022-06-21 16:36:26 -04:00 |
Gnome Ann
|
4fa5f1cd6a
|
Add TPU support for OPT-350M
The 350M model seems to have a different structure than the other ones ???
|
2022-05-12 22:21:15 -04:00 |
Gnome Ann
|
f5e689a725
|
Upload maps/opt.json and update requirements
|
2022-05-12 19:09:31 -04:00 |
Gnome Ann
|
a61ba0d000
|
Upload map file for GPT-NeoX
|
2022-04-29 00:41:56 -04:00 |
Gnome Ann
|
2d38e90509
|
Remove lm_head.weight from maps/xglm.json
|
2022-04-20 12:56:57 -04:00 |
Gnome Ann
|
4625158d30
|
Fix typo in previous commit
|
2022-03-05 12:56:42 -05:00 |
Gnome Ann
|
0a258a6282
|
Support for loading HF models on TPU with `--colab_tpu`
|
2022-03-05 12:33:33 -05:00 |
Gnome Ann
|
58a2c18821
|
Add lazy torch loading support to transformers backend
|
2022-03-04 00:33:10 -05:00 |