Gnome Ann
67e28d2b5c
Typical sampling needs to use nansum instead of sum
...
If `probs` is zero then `log_probs` will be negative infinity, and the
calculation of `neg_entropy` would then give NaN because zero times
infinity is a mathematically indeterminate value.
We need to use nansum so that those NaN values are treated as zeros to
ignore them in the entropy calculation.
2022-03-28 00:02:31 -04:00
henk717
77ae893f4d
Merge pull request #106 from VE-FORBRYDERNE/typical
...
Typical sampling
2022-03-28 00:14:09 +02:00
Gnome Ann
e2cd49d552
Typo fix in `TypicalLogitsWarper`
2022-03-27 17:08:57 -04:00
Gnome Ann
bbd0a83fef
Fix `TypicalLogitsWarper` argument typing
2022-03-27 16:59:23 -04:00
Gnome Ann
d5989d4c62
Hide division by zero warning in JAX typical filter
...
This warning happens when `np.log` gets an input containing zeros.
In that case, NumPy will throw a warning and output negative infinity.
Negative infinity is the correct behaviour here, so we can safely ignore
the warning.
2022-03-27 16:57:12 -04:00
Gnome Ann
20e48b11d7
Typical sampling
2022-03-27 16:25:50 -04:00
henk717
e4c72ca2e5
Merge pull request #104 from VE-FORBRYDERNE/retry-randomgame
...
Allow regenerating random story using Retry button
2022-03-24 12:57:04 +01:00
Gnome Ann
0348970b19
Make sure AI is not busy when using retry to regenerate random story
2022-03-23 22:09:35 -04:00
Gnome Ann
4832dd6f37
Allow regenerating random story using Retry button
...
Commit b55e5a8e0b
removed this feature, so
this commit adds it back.
2022-03-23 13:39:46 -04:00
henk717
38d78d10db
Merge pull request #103 from VE-FORBRYDERNE/neox
...
Divide GPT-NeoX replicated bias layers by 4 again instead of by 8
2022-03-21 02:19:32 +01:00
henk717
cf99f02ca5
Merge branch 'main' into united
2022-03-20 19:22:53 +01:00
henk717
20eab085dd
Fix AutoSave Toggle
2022-03-20 19:12:11 +01:00
henk717
5c795609e4
KML Fix
2022-03-20 13:10:56 +01:00
Gnome Ann
73aecc0510
Divide NeoX replicated bias layers by 4 again instead of by 8
2022-03-20 01:04:55 -04:00
henk717
f1487a4551
New Linux Runtime
2022-03-20 00:00:21 +01:00
henk717
a7f652f293
Merge pull request #101 from VE-FORBRYDERNE/neox
...
GPT-NeoX-20B support in Colab TPU instances
2022-03-19 09:56:15 +01:00
Gnome Ann
05fc46b253
Changing this again to divide by 8
2022-03-19 02:09:41 -04:00
Gnome Ann
b1125a6705
Add EOS and padding token to default NeoX badwords
2022-03-19 01:30:02 -04:00
Gnome Ann
6c20d0d657
Nevermind, dividing by 4 is actually correct...
2022-03-19 00:55:04 -04:00
Gnome Ann
f16b61ec77
Should divide NeoX replicated parameters by 8 (not by 4)
...
Also, suppresses the PyTorch 1.11 warning about transposing tensors with
ndim != 2 in the new code
2022-03-19 00:48:33 -04:00
Gnome Ann
c2c139e940
Change default PE type for NeoX to `neox_rotary`
2022-03-19 00:26:04 -04:00
Gnome Ann
85a4959efa
Merge branch 'united' into neox
2022-03-18 11:19:03 -04:00
henk717
f581fe89cb
Torch version changes
2022-03-17 21:11:36 +01:00
henk717
9e9c1c3fe0
Merge pull request #100 from VE-FORBRYDERNE/patch
...
Add PyTorch 1.11 support for lazy loader
2022-03-17 21:06:38 +01:00
Gnome Ann
c444260eac
Silence PyTorch warning about transposing tensors with dimension != 2
2022-03-17 15:16:56 -04:00
Gnome Ann
ef21ab9c91
PyTorch 1.9 lazy loader compatibility bugfix
2022-03-17 14:10:51 -04:00
Gnome Ann
eaf190469d
Add PyTorch 1.11 support for lazy loader
2022-03-17 12:51:41 -04:00
henk717
9235754eb9
Dependency Fixes
2022-03-17 00:35:59 +01:00
henk717
a3e5e052b3
Newer umamba + slope tweak
2022-03-16 18:34:02 +01:00
Gnome Ann
95c4251db9
Print two newlines before loading HF models
2022-03-15 13:58:53 -04:00
Gnome Ann
9e2848e48f
Show parameter count when loading GPT-NeoX in Colab TPU instance
2022-03-15 13:55:27 -04:00
Gnome Ann
9dc48b15f0
Add custom badwords and pad token ID for GPT-NeoX
2022-03-14 23:31:49 -04:00
Gnome Ann
88f247d535
GPT-NeoX-20B support in Colab TPU instances
2022-03-14 23:14:20 -04:00
henk717
4892556059
Model saving for colab mode
2022-03-13 11:22:44 +01:00
henk717
ccadeabbde
Merge pull request #99 from VE-FORBRYDERNE/model-patch
...
Model loading fixes
2022-03-13 11:10:15 +01:00
Gnome Ann
2b8c46338e
Change current working directory to KoboldAI folder
2022-03-13 01:22:11 -05:00
Gnome Ann
48d07adb54
Also fallback to generic GPT2 tokenizer in Colab TPU instances
2022-03-12 23:19:35 -05:00
henk717
d29a629320
Merge pull request #98 from ebolam/united
...
Fix for retry
2022-03-12 16:52:07 +01:00
ebolam
45eed78d21
Merge branch 'united' of https://github.com/ebolam/KoboldAI into united
2022-03-12 10:33:01 -05:00
ebolam
b55e5a8e0b
Retry Bug Fix
2022-03-12 10:32:27 -05:00
henk717
2e1b3c82f9
Merge pull request #97 from ebolam/united
...
Fix for retry causing issues for future redo actions
2022-03-11 17:41:49 +01:00
ebolam
ae854bab3d
Fix for retry causing issues for future redo actions
2022-03-11 11:40:55 -05:00
henk717
2c66461c14
Merge pull request #96 from VE-FORBRYDERNE/dlpack
...
Use DLPack to convert PyTorch tensors to JAX arrays
2022-03-10 22:00:38 +01:00
Gnome Ann
a99eb8724d
Use DLPack to convert PyTorch tensors to JAX arrays
2022-03-10 15:12:42 -05:00
henk717
b02d5e8696
Allows missing model_config again
2022-03-10 19:59:10 +01:00
henk717
172a548fa1
Fallback to generic GPT2 Tokenizer
2022-03-10 19:52:15 +01:00
henk717
68281184bf
Remove Lowmem from TPU
2022-03-09 19:21:15 +01:00
henk717
9dee9b5c6d
Ignore incorrect problems
2022-03-09 12:03:37 +01:00
henk717
a28e553412
Remove unused gettokenids
2022-03-09 11:59:33 +01:00
henk717
7434c9221b
Expand OAI Setting Compatibility
2022-03-07 08:56:47 +01:00