Commit Graph

4611 Commits

Author SHA1 Message Date
db0
a655f8f066 adjust for stop mechanism 2023-08-21 15:56:27 +02:00
db0
45661ddc75 switch to AI Horde Worker 2023-08-21 15:52:17 +02:00
Henk
955db1567e Keep the usual temp folder instead of ours 2023-08-21 14:29:37 +02:00
Henk
57e5f51d63 AutoGPTQ for Colab 2023-08-21 14:08:14 +02:00
Henk
5917737676 Don't disable exllama 2023-08-21 13:17:30 +02:00
Henk
8daa2f1adc Update Optimum on Git HF 2023-08-21 02:01:34 +02:00
Henk
3dd0e91fbb Preliminary HF GPTQ changes 2023-08-21 01:58:52 +02:00
Llama
070cfd339a Strip the eos token from exllama generations.
The end-of-sequence (</s>) token indicates the end of a generation.
When a token sequence containing </s> is decoded, an extra (wrong)
space is inserted at the beginning of the generation. To avoid this,
strip the eos token out of the result before returning it.
The eos token was getting stripped later, so this doesn't change
the output except to avoid the spurious leading space.
2023-08-19 17:40:23 -07:00
Henk
6f557befa9 GPTQ --revision support 2023-08-19 15:17:29 +02:00
Henk
d93631c889 GPTQ improvements 2023-08-19 14:45:45 +02:00
Henk
13b68c67d1 Basic GPTQ Downloader 2023-08-19 13:02:50 +02:00
henk717
029e8736c0 Merge pull request #438 from one-some/another-api-fix
More api fixes
2023-08-19 02:43:38 +02:00
somebody
45486a47b0 WI: Fix UID keys being str
...again
2023-08-18 19:27:02 -05:00
Henk
80e784d3ea Polish 2023-08-19 01:39:31 +02:00
Henk
5ae64354ee NSFW menu updates 2023-08-19 01:20:09 +02:00
Henk
f8987cb2f0 Adventure ram update 2023-08-19 00:21:22 +02:00
Henk
79bc1d6610 First instruct batch 2023-08-18 23:15:21 +02:00
Henk
ff999657d2 Chat List Update 2023-08-18 22:42:15 +02:00
Llama
dda5acd5d5 Merge pull request #33 from henk717/united
Merge united.
2023-08-18 13:19:24 -07:00
Henk
87934ee393 AutoGPTQ Exllama compile 2023-08-18 21:49:05 +02:00
henk717
3f28503b87 Update huggingface.yml 2023-08-15 22:19:18 +02:00
somebody
213d7a55d4 Fixup 2023-08-14 13:30:22 -05:00
somebody
19029939c2 API: somewhat-thoroughly automatically test WI api 2023-08-14 02:04:49 -05:00
somebody
4bd04d02ab Try to fix wi 2023-08-14 01:36:27 -05:00
somebody
b9da974eb7 GenericHFTorch: Change use_4_bit to quantization in __init__ 2023-08-14 00:56:40 -05:00
henk717
1c65528dbf Fix BnB on Colab 2023-08-14 03:28:29 +02:00
Llama
d8d9890f46 Merge pull request #32 from henk717/united
Merge united
2023-08-13 14:10:35 -07:00
Henk
e90903946d AutoGPTQ updates 2023-08-13 17:36:17 +02:00
Henk
116a88b46c Better stable diffusion 2023-08-13 16:45:31 +02:00
henk717
89a805a0cc Merge pull request #411 from one-some/fixing-time
Fix most of the API
2023-08-13 13:43:13 +02:00
henk717
dae9a6eb5a Merge branch 'KoboldAI:main' into united 2023-08-12 02:18:11 +02:00
henk717
ee93fe6e4a Add model cleaner 2023-08-11 22:39:49 +02:00
Henk
1e87c05e68 Fix discord link 2023-08-11 17:36:41 +02:00
henk717
a9dbe2837e Merge branch 'KoboldAI:main' into united 2023-08-11 00:36:09 +02:00
henk717
9cb93d6b4c Add some 13B's for easier beta testing 2023-08-10 23:56:44 +02:00
henk717
2938a9993a Merge pull request #434 from one-some/united
UI: Change mobile aspect ratio threshold from 7/5 to 5/6
2023-08-10 19:57:56 +02:00
henk717
5f1be7c482 Merge pull request #435 from one-some/token-stream-newline-fix
UI: Fix token streaming gobbling trailing whitespace
2023-08-10 19:55:50 +02:00
Henk
2628726e1c Dont use exllama on fail 2023-08-10 19:34:08 +02:00
Henk
9c7ebe3b04 Better AutoGPTQ fallback 2023-08-10 18:10:48 +02:00
Henk
f2d7ef3aca AutoGPTQ breakmodel 2023-08-10 17:41:31 +02:00
Henk
54addfc234 AutoGPTQ fallback 2023-08-10 17:18:53 +02:00
Henk
1b253ce95f 4-bit dependency fixes 2023-08-10 17:08:48 +02:00
Henk
6143071b27 Make settings folder early 2023-08-08 14:51:15 +02:00
somebody
9704c86aee Actually do pre-wrap instead
just pre makes long texts without whitespace not wrap
2023-08-07 21:13:01 -05:00
somebody
906d1f2522 Merge branch 'united' of https://github.com/henk717/KoboldAI into fixing-time 2023-08-07 16:22:04 -05:00
somebody
7f2085ffe8 UI: Fix token streaming gobbling trailing whitespace
which ended up being mostly newlines
2023-08-07 16:00:49 -05:00
somebody
1632f3c684 UI: Change mobile aspect ratio threshold from 7/5 to 5/6 2023-08-07 13:59:57 -05:00
Henk
824050471b Default to new UI 2023-08-07 20:03:09 +02:00
henk717
0f8cf0dc2c Merge pull request #433 from LostRuins/concedo_united
updated lite to v54
2023-08-07 18:37:50 +02:00
Concedo
06d6364b6b updated lite to v54 2023-08-07 23:43:27 +08:00