henk717
|
a7f652f293
|
Merge pull request #101 from VE-FORBRYDERNE/neox
GPT-NeoX-20B support in Colab TPU instances
|
2022-03-19 09:56:15 +01:00 |
Gnome Ann
|
05fc46b253
|
Changing this again to divide by 8
|
2022-03-19 02:09:41 -04:00 |
Gnome Ann
|
b1125a6705
|
Add EOS and padding token to default NeoX badwords
|
2022-03-19 01:30:02 -04:00 |
Gnome Ann
|
6c20d0d657
|
Nevermind, dividing by 4 is actually correct...
|
2022-03-19 00:55:04 -04:00 |
Gnome Ann
|
f16b61ec77
|
Should divide NeoX replicated parameters by 8 (not by 4)
Also, suppresses the PyTorch 1.11 warning about transposing tensors with
ndim != 2 in the new code
|
2022-03-19 00:48:33 -04:00 |
Gnome Ann
|
c2c139e940
|
Change default PE type for NeoX to `neox_rotary`
|
2022-03-19 00:26:04 -04:00 |
Gnome Ann
|
85a4959efa
|
Merge branch 'united' into neox
|
2022-03-18 11:19:03 -04:00 |
henk717
|
f581fe89cb
|
Torch version changes
|
2022-03-17 21:11:36 +01:00 |
henk717
|
9e9c1c3fe0
|
Merge pull request #100 from VE-FORBRYDERNE/patch
Add PyTorch 1.11 support for lazy loader
|
2022-03-17 21:06:38 +01:00 |
Gnome Ann
|
c444260eac
|
Silence PyTorch warning about transposing tensors with dimension != 2
|
2022-03-17 15:16:56 -04:00 |
Gnome Ann
|
ef21ab9c91
|
PyTorch 1.9 lazy loader compatibility bugfix
|
2022-03-17 14:10:51 -04:00 |
Gnome Ann
|
eaf190469d
|
Add PyTorch 1.11 support for lazy loader
|
2022-03-17 12:51:41 -04:00 |
henk717
|
9235754eb9
|
Dependency Fixes
|
2022-03-17 00:35:59 +01:00 |
henk717
|
a3e5e052b3
|
Newer umamba + slope tweak
|
2022-03-16 18:34:02 +01:00 |
Gnome Ann
|
95c4251db9
|
Print two newlines before loading HF models
|
2022-03-15 13:58:53 -04:00 |
Gnome Ann
|
9e2848e48f
|
Show parameter count when loading GPT-NeoX in Colab TPU instance
|
2022-03-15 13:55:27 -04:00 |
Gnome Ann
|
9dc48b15f0
|
Add custom badwords and pad token ID for GPT-NeoX
|
2022-03-14 23:31:49 -04:00 |
Gnome Ann
|
88f247d535
|
GPT-NeoX-20B support in Colab TPU instances
|
2022-03-14 23:14:20 -04:00 |
ebolam
|
e65015aed4
|
Merge branch 'Web-UI' of https://github.com/ebolam/KoboldAI into HEAD
|
2022-03-14 16:43:42 -04:00 |
ebolam
|
36fef6bfbc
|
Delete base.yml
|
2022-03-14 16:43:15 -04:00 |
ebolam
|
bc5f30610d
|
Removed base.yml
|
2022-03-14 16:42:49 -04:00 |
henk717
|
4892556059
|
Model saving for colab mode
|
2022-03-13 11:22:44 +01:00 |
henk717
|
ccadeabbde
|
Merge pull request #99 from VE-FORBRYDERNE/model-patch
Model loading fixes
|
2022-03-13 11:10:15 +01:00 |
Gnome Ann
|
2b8c46338e
|
Change current working directory to KoboldAI folder
|
2022-03-13 01:22:11 -05:00 |
Gnome Ann
|
48d07adb54
|
Also fallback to generic GPT2 tokenizer in Colab TPU instances
|
2022-03-12 23:19:35 -05:00 |
ebolam
|
8ae0a4a3e7
|
Online Services Working now (without a way to test as I don't have accounts)
|
2022-03-12 14:21:11 -05:00 |
henk717
|
d29a629320
|
Merge pull request #98 from ebolam/united
Fix for retry
|
2022-03-12 16:52:07 +01:00 |
ebolam
|
45eed78d21
|
Merge branch 'united' of https://github.com/ebolam/KoboldAI into united
|
2022-03-12 10:33:01 -05:00 |
ebolam
|
b55e5a8e0b
|
Retry Bug Fix
|
2022-03-12 10:32:27 -05:00 |
henk717
|
2e1b3c82f9
|
Merge pull request #97 from ebolam/united
Fix for retry causing issues for future redo actions
|
2022-03-11 17:41:49 +01:00 |
ebolam
|
ae854bab3d
|
Fix for retry causing issues for future redo actions
|
2022-03-11 11:40:55 -05:00 |
ebolam
|
772ae2eb80
|
Added model info to show model load progress in UI
|
2022-03-11 11:31:41 -05:00 |
henk717
|
2c66461c14
|
Merge pull request #96 from VE-FORBRYDERNE/dlpack
Use DLPack to convert PyTorch tensors to JAX arrays
|
2022-03-10 22:00:38 +01:00 |
Gnome Ann
|
a99eb8724d
|
Use DLPack to convert PyTorch tensors to JAX arrays
|
2022-03-10 15:12:42 -05:00 |
henk717
|
b02d5e8696
|
Allows missing model_config again
|
2022-03-10 19:59:10 +01:00 |
henk717
|
172a548fa1
|
Fallback to generic GPT2 Tokenizer
|
2022-03-10 19:52:15 +01:00 |
henk717
|
68281184bf
|
Remove Lowmem from TPU
|
2022-03-09 19:21:15 +01:00 |
henk717
|
9dee9b5c6d
|
Ignore incorrect problems
|
2022-03-09 12:03:37 +01:00 |
henk717
|
a28e553412
|
Remove unused gettokenids
|
2022-03-09 11:59:33 +01:00 |
ebolam
|
0943926f6a
|
Fix for lazy loading
|
2022-03-07 19:52:44 -05:00 |
ebolam
|
bfc07073e3
|
layer count fix
|
2022-03-07 19:33:24 -05:00 |
ebolam
|
d8ab58892d
|
saved layer value fix
|
2022-03-07 19:21:55 -05:00 |
ebolam
|
da53d7edb3
|
Custom Path Load fix
|
2022-03-07 18:54:11 -05:00 |
ebolam
|
d1a64e25da
|
Custom Model Load Fix
|
2022-03-07 18:44:37 -05:00 |
ebolam
|
70f1c2da9c
|
Added stub for model name feedback
|
2022-03-07 14:20:25 -05:00 |
ebolam
|
d0553779ab
|
Bug Fix
|
2022-03-07 12:33:35 -05:00 |
ebolam
|
6a08fe2f10
|
Added scroll bars to the model load menu
|
2022-03-07 12:04:41 -05:00 |
ebolam
|
c50fe77a7d
|
Load Fix
|
2022-03-07 11:57:33 -05:00 |
ebolam
|
49fc854e55
|
Added saving of breakmodel values so that it defaults to it on next load
|
2022-03-07 11:49:34 -05:00 |
ebolam
|
2cf6b6e650
|
Merge branch 'henk717:united' into united
|
2022-03-07 11:31:14 -05:00 |