KoboldAI-Client

mirror of https://github.com/KoboldAI/KoboldAI-Client.git synced 2025-06-05 21:59:24 +02:00

Author	SHA1	Message	Date
0cc4m	4d34f9b7de	Move 4-bit loading code to separate inference_model file	2023-04-16 14:20:13 +02:00
0cc4m	05b1d36803	Merge latestgptq branch onto one-some's model-structure-and-maybe-rwkv branch	2023-04-16 09:06:12 +02:00
somebody	f9fb5eba89	Remove debug	2023-04-15 18:56:49 -05:00
somebody	5dd67d027a	Workaround for socketio context errors for loading	2023-04-15 18:54:21 -05:00
somebody	08b4e317ff	Fix double slashing	2023-04-15 13:30:05 -05:00
somebody	d3a73aaeba	Fix api	2023-04-15 13:17:20 -05:00
somebody	4dcf570407	Fix legacy model loading	2023-04-15 12:57:35 -05:00
somebody	2a977feb3e	Merge branch 'model-structure-and-maybe-rwkv' of https://github.com/one-some/KoboldAI into model-structure-and-maybe-rwkv	2023-04-15 11:51:39 -05:00
somebody	a2ae87d1b7	Utils: Support safetensors aria2 download	2023-04-15 11:51:16 -05:00
0cc4m	fff2385173	Merge upstream changes	2023-04-15 18:35:54 +02:00
Henk	b68860b3de	Workaround to make --host work again	2023-04-15 18:31:39 +02:00
one-some	1b500c7179	Merge pull request #5 from LostRuins/concedo_api Added stop sequences functionality for API calls	2023-04-15 10:51:31 -05:00
somebody	2b950f08d3	Remove legacy no accelerate fallback code Was causing issues with disk cache the old code had a `and not utils.HAS_ACCELERATE` preceding it (a variable which no longer exists), and since disk cache is accelerate only, there was no disk handling code in here. Anyway its bad so blast it	2023-04-15 10:47:31 -05:00
Henk	67334bd698	Pin accelerate version	2023-04-15 17:45:00 +02:00
somebody	b2e6fcfe3a	Remove line that sets disk_layers to None always whoops	2023-04-15 10:41:10 -05:00
Henk	3eda7269f7	Fix incorrect host merge	2023-04-15 14:58:24 +02:00
Concedo	dd01cf1a93	Merge branch 'concedo_api' of https://github.com/LostRuins/KoboldAI into concedo_api	2023-04-15 18:10:28 +08:00
Concedo	9705b7b79c	increase API version (+1 squashed commits) Squashed commits: [`c168c08`] Added stop sequences functionality for API calls	2023-04-15 18:09:53 +08:00
Concedo	d22423e4be	increase API version	2023-04-15 18:09:29 +08:00
Concedo	c168c08245	Added stop sequences functionality for API calls	2023-04-15 18:00:11 +08:00
somebody	ea8df4c0d3	Merge branch 'united' of https://github.com/henk717/KoboldAI into model-structure-and-maybe-rwkv	2023-04-14 20:38:56 -05:00
somebody	38c53191d3	possible fix for cache dl thing	2023-04-14 20:25:03 -05:00
Henk	bde9c6980f	Transformers 4.28 support	2023-04-14 14:13:46 +02:00
henk717	03ab4b25af	Merge branch 'KoboldAI:main' into united	2023-04-14 13:59:55 +02:00
biscober	35f908e147	Update install_requirements.bat (#7 ) * Update install_requirements.bat move command to dismount temp B drive to after pip install command which requires B drive to still be mounted * Update install_requirements.bat cmd /k not necessary * Update install_requirements.bat add quotes (probably not required but w/e)	2023-04-11 04:37:48 +02:00
0cc4m	687d107d20	Update README, remove steps that are no longer required	2023-04-10 22:46:12 +02:00
0cc4m	b628aec719	Automatic installation of the quant_cuda module during install_requirements Kepler (K40+) and Maxwell support	2023-04-10 22:37:16 +02:00
henk717	2385a34098	Merge pull request #325 from YellowRoseCx/patch-1 Add IP Whitelisting to --host	2023-04-10 14:08:09 +02:00
somebody	334c09606b	Fix for tokenizer stuff on pythia	2023-04-09 18:23:58 -05:00
somebody	3e8e3a18b0	Fix for custom gpt2	2023-04-09 18:23:52 -05:00
somebody	f73a8bb808	Merge branch 'united' of https://github.com/henk717/KoboldAI into model-structure-and-maybe-rwkv	2023-04-09 14:38:09 -05:00
somebody	fedbffd07b	Small fixes Typos galore!	2023-04-09 13:35:28 -05:00
0cc4m	7efd314428	Improve guide	2023-04-07 20:10:24 +02:00
0cc4m	ccf34a5edc	Fix merge issues with upstream, merge changes	2023-04-07 19:51:07 +02:00
0cc4m	636c4e5a52	Update gptq repo	2023-04-07 11:48:57 +02:00
YellowRoseCx	ac98cd6dd1	add IP_whitelisting to koboldai_settings.py	2023-04-05 21:27:59 -05:00
YellowRoseCx	71e5d23a5b	Add IP whitelisting to --host	2023-04-05 21:23:24 -05:00
0cc4m	40092cc9fa	Improve guide formatting	2023-04-05 21:49:13 +02:00
0cc4m	8b4375307c	Update file formatting section in guide	2023-04-05 21:10:40 +02:00
0cc4m	e4f8a9344c	Merge pull request #1 from Digitous/patch-1 Add install instructions	2023-04-05 21:08:14 +02:00
Henk	80e4b9e536	Merge branch 'main' into united	2023-04-05 00:22:30 +02:00
henk717	29c2d4b7a6	Removing Pygmalion from the TPU colab to get it unbanned	2023-04-04 19:51:18 +02:00
henk717	fd12214091	Clean the description of the GPU colab	2023-04-04 19:40:22 +02:00
henk717	bb51127bbf	We no longer support Pygmalion on Colab due to Google's Pygmalion ban	2023-04-04 19:37:15 +02:00
Henk	4b71da1714	Horde settings in the UI	2023-04-04 17:20:43 +02:00
0cc4m	ce6761e744	Fix issue causing expected scalar type Float but found Half RuntimeErrors	2023-04-04 07:46:53 +02:00
Henk	8bf533da9a	Pin Accelerate Version	2023-04-04 01:47:59 +02:00
somebody	8412f83ce5	Breakmodel: Fix typo	2023-04-03 18:41:18 -05:00
0cc4m	b9df9b6f59	Improve CPU offloading speed significantly when offloading less than half of the layers	2023-04-03 20:27:17 +02:00
0cc4m	5abdecad2c	Merge pull request #5 from 0cc4m/cpu-offload-1 CPU Offloading Support	2023-04-03 06:52:48 +02:00

1 2 3 4 5 ...

3899 Commits