db0
a655f8f066
adjust for stop mechanism
2023-08-21 15:56:27 +02:00
db0
45661ddc75
switch to AI Horde Worker
2023-08-21 15:52:17 +02:00
Henk
955db1567e
Keep the usual temp folder instead of ours
2023-08-21 14:29:37 +02:00
Henk
57e5f51d63
AutoGPTQ for Colab
2023-08-21 14:08:14 +02:00
Henk
5917737676
Don't disable exllama
2023-08-21 13:17:30 +02:00
Henk
8daa2f1adc
Update Optimum on Git HF
2023-08-21 02:01:34 +02:00
Henk
3dd0e91fbb
Preliminary HF GPTQ changes
2023-08-21 01:58:52 +02:00
Llama
070cfd339a
Strip the eos token from exllama generations.
...
The end-of-sequence (</s>) token indicates the end of a generation.
When a token sequence containing </s> is decoded, an extra (wrong)
space is inserted at the beginning of the generation. To avoid this,
strip the eos token out of the result before returning it.
The eos token was getting stripped later, so this doesn't change
the output except to avoid the spurious leading space.
2023-08-19 17:40:23 -07:00
Henk
6f557befa9
GPTQ --revision support
2023-08-19 15:17:29 +02:00
Henk
d93631c889
GPTQ improvements
2023-08-19 14:45:45 +02:00
Henk
13b68c67d1
Basic GPTQ Downloader
2023-08-19 13:02:50 +02:00
henk717
029e8736c0
Merge pull request #438 from one-some/another-api-fix
...
More api fixes
2023-08-19 02:43:38 +02:00
somebody
45486a47b0
WI: Fix UID keys being str
...
...again
2023-08-18 19:27:02 -05:00
Henk
80e784d3ea
Polish
2023-08-19 01:39:31 +02:00
Henk
5ae64354ee
NSFW menu updates
2023-08-19 01:20:09 +02:00
Henk
f8987cb2f0
Adventure ram update
2023-08-19 00:21:22 +02:00
Henk
79bc1d6610
First instruct batch
2023-08-18 23:15:21 +02:00
Henk
ff999657d2
Chat List Update
2023-08-18 22:42:15 +02:00
Llama
dda5acd5d5
Merge pull request #33 from henk717/united
...
Merge united.
2023-08-18 13:19:24 -07:00
Henk
87934ee393
AutoGPTQ Exllama compile
2023-08-18 21:49:05 +02:00
henk717
3f28503b87
Update huggingface.yml
2023-08-15 22:19:18 +02:00
somebody
213d7a55d4
Fixup
2023-08-14 13:30:22 -05:00
somebody
19029939c2
API: somewhat-thoroughly automatically test WI api
2023-08-14 02:04:49 -05:00
somebody
4bd04d02ab
Try to fix wi
2023-08-14 01:36:27 -05:00
somebody
b9da974eb7
GenericHFTorch: Change use_4_bit to quantization in __init__
2023-08-14 00:56:40 -05:00
henk717
1c65528dbf
Fix BnB on Colab
2023-08-14 03:28:29 +02:00
Llama
d8d9890f46
Merge pull request #32 from henk717/united
...
Merge united
2023-08-13 14:10:35 -07:00
Henk
e90903946d
AutoGPTQ updates
2023-08-13 17:36:17 +02:00
Henk
116a88b46c
Better stable diffusion
2023-08-13 16:45:31 +02:00
henk717
89a805a0cc
Merge pull request #411 from one-some/fixing-time
...
Fix most of the API
2023-08-13 13:43:13 +02:00
henk717
dae9a6eb5a
Merge branch 'KoboldAI:main' into united
2023-08-12 02:18:11 +02:00
henk717
ee93fe6e4a
Add model cleaner
2023-08-11 22:39:49 +02:00
Henk
1e87c05e68
Fix discord link
2023-08-11 17:36:41 +02:00
henk717
a9dbe2837e
Merge branch 'KoboldAI:main' into united
2023-08-11 00:36:09 +02:00
henk717
9cb93d6b4c
Add some 13B's for easier beta testing
2023-08-10 23:56:44 +02:00
henk717
2938a9993a
Merge pull request #434 from one-some/united
...
UI: Change mobile aspect ratio threshold from 7/5 to 5/6
2023-08-10 19:57:56 +02:00
henk717
5f1be7c482
Merge pull request #435 from one-some/token-stream-newline-fix
...
UI: Fix token streaming gobbling trailing whitespace
2023-08-10 19:55:50 +02:00
Henk
2628726e1c
Dont use exllama on fail
2023-08-10 19:34:08 +02:00
Henk
9c7ebe3b04
Better AutoGPTQ fallback
2023-08-10 18:10:48 +02:00
Henk
f2d7ef3aca
AutoGPTQ breakmodel
2023-08-10 17:41:31 +02:00
Henk
54addfc234
AutoGPTQ fallback
2023-08-10 17:18:53 +02:00
Henk
1b253ce95f
4-bit dependency fixes
2023-08-10 17:08:48 +02:00
Henk
6143071b27
Make settings folder early
2023-08-08 14:51:15 +02:00
somebody
9704c86aee
Actually do pre-wrap instead
...
just pre makes long texts without whitespace not wrap
2023-08-07 21:13:01 -05:00
somebody
906d1f2522
Merge branch 'united' of https://github.com/henk717/KoboldAI into fixing-time
2023-08-07 16:22:04 -05:00
somebody
7f2085ffe8
UI: Fix token streaming gobbling trailing whitespace
...
which ended up being mostly newlines
2023-08-07 16:00:49 -05:00
somebody
1632f3c684
UI: Change mobile aspect ratio threshold from 7/5 to 5/6
2023-08-07 13:59:57 -05:00
Henk
824050471b
Default to new UI
2023-08-07 20:03:09 +02:00
henk717
0f8cf0dc2c
Merge pull request #433 from LostRuins/concedo_united
...
updated lite to v54
2023-08-07 18:37:50 +02:00
Concedo
06d6364b6b
updated lite to v54
2023-08-07 23:43:27 +08:00