KoboldAI-Client

mirror of https://github.com/KoboldAI/KoboldAI-Client.git synced 2025-06-05 21:59:24 +02:00

Author	SHA1	Message	Date
Gnome Ann	67e28d2b5c	Typical sampling needs to use nansum instead of sum If `probs` is zero then `log_probs` will be negative infinity, and the calculation of `neg_entropy` would then give NaN because zero times infinity is a mathematically indeterminate value. We need to use nansum so that those NaN values are treated as zeros to ignore them in the entropy calculation.	2022-03-28 00:02:31 -04:00
henk717	77ae893f4d	Merge pull request #106 from VE-FORBRYDERNE/typical Typical sampling	2022-03-28 00:14:09 +02:00
Gnome Ann	e2cd49d552	Typo fix in `TypicalLogitsWarper`	2022-03-27 17:08:57 -04:00
Gnome Ann	bbd0a83fef	Fix `TypicalLogitsWarper` argument typing	2022-03-27 16:59:23 -04:00
Gnome Ann	d5989d4c62	Hide division by zero warning in JAX typical filter This warning happens when `np.log` gets an input containing zeros. In that case, NumPy will throw a warning and output negative infinity. Negative infinity is the correct behaviour here, so we can safely ignore the warning.	2022-03-27 16:57:12 -04:00
Gnome Ann	20e48b11d7	Typical sampling	2022-03-27 16:25:50 -04:00
henk717	e4c72ca2e5	Merge pull request #104 from VE-FORBRYDERNE/retry-randomgame Allow regenerating random story using Retry button	2022-03-24 12:57:04 +01:00
Gnome Ann	0348970b19	Make sure AI is not busy when using retry to regenerate random story	2022-03-23 22:09:35 -04:00
Gnome Ann	4832dd6f37	Allow regenerating random story using Retry button Commit `b55e5a8e0b` removed this feature, so this commit adds it back.	2022-03-23 13:39:46 -04:00
henk717	38d78d10db	Merge pull request #103 from VE-FORBRYDERNE/neox Divide GPT-NeoX replicated bias layers by 4 again instead of by 8	2022-03-21 02:19:32 +01:00
henk717	cf99f02ca5	Merge branch 'main' into united	2022-03-20 19:22:53 +01:00
henk717	20eab085dd	Fix AutoSave Toggle	2022-03-20 19:12:11 +01:00
henk717	5c795609e4	KML Fix	2022-03-20 13:10:56 +01:00
Gnome Ann	73aecc0510	Divide NeoX replicated bias layers by 4 again instead of by 8	2022-03-20 01:04:55 -04:00
henk717	f1487a4551	New Linux Runtime	2022-03-20 00:00:21 +01:00
henk717	a7f652f293	Merge pull request #101 from VE-FORBRYDERNE/neox GPT-NeoX-20B support in Colab TPU instances	2022-03-19 09:56:15 +01:00
Gnome Ann	05fc46b253	Changing this again to divide by 8	2022-03-19 02:09:41 -04:00
Gnome Ann	b1125a6705	Add EOS and padding token to default NeoX badwords	2022-03-19 01:30:02 -04:00
Gnome Ann	6c20d0d657	Nevermind, dividing by 4 is actually correct...	2022-03-19 00:55:04 -04:00
Gnome Ann	f16b61ec77	Should divide NeoX replicated parameters by 8 (not by 4) Also, suppresses the PyTorch 1.11 warning about transposing tensors with ndim != 2 in the new code	2022-03-19 00:48:33 -04:00
Gnome Ann	c2c139e940	Change default PE type for NeoX to `neox_rotary`	2022-03-19 00:26:04 -04:00
Gnome Ann	85a4959efa	Merge branch 'united' into neox	2022-03-18 11:19:03 -04:00
henk717	f581fe89cb	Torch version changes	2022-03-17 21:11:36 +01:00
henk717	9e9c1c3fe0	Merge pull request #100 from VE-FORBRYDERNE/patch Add PyTorch 1.11 support for lazy loader	2022-03-17 21:06:38 +01:00
Gnome Ann	c444260eac	Silence PyTorch warning about transposing tensors with dimension != 2	2022-03-17 15:16:56 -04:00
Gnome Ann	ef21ab9c91	PyTorch 1.9 lazy loader compatibility bugfix	2022-03-17 14:10:51 -04:00
Gnome Ann	eaf190469d	Add PyTorch 1.11 support for lazy loader	2022-03-17 12:51:41 -04:00
henk717	9235754eb9	Dependency Fixes	2022-03-17 00:35:59 +01:00
henk717	a3e5e052b3	Newer umamba + slope tweak	2022-03-16 18:34:02 +01:00
Gnome Ann	95c4251db9	Print two newlines before loading HF models	2022-03-15 13:58:53 -04:00
Gnome Ann	9e2848e48f	Show parameter count when loading GPT-NeoX in Colab TPU instance	2022-03-15 13:55:27 -04:00
Gnome Ann	9dc48b15f0	Add custom badwords and pad token ID for GPT-NeoX	2022-03-14 23:31:49 -04:00
Gnome Ann	88f247d535	GPT-NeoX-20B support in Colab TPU instances	2022-03-14 23:14:20 -04:00
henk717	4892556059	Model saving for colab mode	2022-03-13 11:22:44 +01:00
henk717	ccadeabbde	Merge pull request #99 from VE-FORBRYDERNE/model-patch Model loading fixes	2022-03-13 11:10:15 +01:00
Gnome Ann	2b8c46338e	Change current working directory to KoboldAI folder	2022-03-13 01:22:11 -05:00
Gnome Ann	48d07adb54	Also fallback to generic GPT2 tokenizer in Colab TPU instances	2022-03-12 23:19:35 -05:00
henk717	d29a629320	Merge pull request #98 from ebolam/united Fix for retry	2022-03-12 16:52:07 +01:00
ebolam	45eed78d21	Merge branch 'united' of https://github.com/ebolam/KoboldAI into united	2022-03-12 10:33:01 -05:00
ebolam	b55e5a8e0b	Retry Bug Fix	2022-03-12 10:32:27 -05:00
henk717	2e1b3c82f9	Merge pull request #97 from ebolam/united Fix for retry causing issues for future redo actions	2022-03-11 17:41:49 +01:00
ebolam	ae854bab3d	Fix for retry causing issues for future redo actions	2022-03-11 11:40:55 -05:00
henk717	2c66461c14	Merge pull request #96 from VE-FORBRYDERNE/dlpack Use DLPack to convert PyTorch tensors to JAX arrays	2022-03-10 22:00:38 +01:00
Gnome Ann	a99eb8724d	Use DLPack to convert PyTorch tensors to JAX arrays	2022-03-10 15:12:42 -05:00
henk717	b02d5e8696	Allows missing model_config again	2022-03-10 19:59:10 +01:00
henk717	172a548fa1	Fallback to generic GPT2 Tokenizer	2022-03-10 19:52:15 +01:00
henk717	68281184bf	Remove Lowmem from TPU	2022-03-09 19:21:15 +01:00
henk717	9dee9b5c6d	Ignore incorrect problems	2022-03-09 12:03:37 +01:00
henk717	a28e553412	Remove unused gettokenids	2022-03-09 11:59:33 +01:00
henk717	7434c9221b	Expand OAI Setting Compatibility	2022-03-07 08:56:47 +01:00

1 2 3 4 5 ...

984 Commits