Llama
812df5ea56
Merge pull request #65 from pi6am/feat/exllama-badwords
...
Add the eos token to exllama bad words.
2023-08-27 17:03:25 -07:00
Llama
08ff7c138c
Add the eos token to exllama bad words.
...
The bos token was already hardcoded as a bad word id.
Store badwords in a list and iterate over them during generation.
Add the Llama eos token to the list of bad words.
Also support "single line mode", which adds newline (13) to badwords.
2023-08-27 16:34:52 -07:00
Henk
3e0b8279f2
Rename GPTQ loading
2023-08-27 20:51:14 +02:00
Llama
0d150e412e
Merge pull request #64 from pi6am/fix/multinomial-workaround
...
Resample to work around a bug in torch.multinomial
2023-08-26 22:42:21 -07:00
Llama
b7e38b4757
Resample to work around a bug in torch.multinomial
...
There is a bug in PyTorch 2.0.1 that allows torch.multinomial to
sometimes choose elements that have zero probability. Since
this is uncommon we can continue to use torch.multinomial as
long as we verify that the results are valid. If they aren't,
try again until the probability of each selected token is positive.
2023-08-26 22:26:26 -07:00
Henk
290f2ce05e
CPU only warning
2023-08-26 00:03:28 +02:00
db0
4b2d591354
avoid conflictinng sys args
2023-08-25 15:05:36 +02:00
Henk
f40236c04a
Modern llama tokenizer
2023-08-25 14:27:44 +02:00
Henk
2887467eec
Safetensors 0.3.3
2023-08-24 14:30:44 +02:00
henk717
d86f61151b
Working revision support
2023-08-23 22:07:37 +02:00
Henk
39c1b39b4a
Fix markers
2023-08-23 21:42:06 +02:00
Henk
5d9f180489
Fix typo
2023-08-23 21:36:26 +02:00
Henk
85810cd3fd
AutoGPTQ for Colab
2023-08-23 21:30:58 +02:00
Henk
c20ea949d7
Fix duplicate safetensors
2023-08-23 21:02:11 +02:00
Henk
91155ed2f3
HF dependencies
2023-08-23 20:34:40 +02:00
Llama
b1895de518
Merge pull request #63 from pi6am/feat/exllama-stoppers
...
Add stopper hooks suppport to exllama
2023-08-22 23:14:00 -07:00
Llama
b96d5d8646
Add stopper hooks suppport to exllama
2023-08-22 23:06:16 -07:00
Henk
f66173f2a0
Git gonna git
2023-08-22 20:43:44 +02:00
Henk
2b6dcbe55e
New Horde Worker
2023-08-22 20:40:04 +02:00
Henk
3f438fda53
Scribe name instead of worker name
2023-08-22 18:56:23 +02:00
Henk
e5aca6fdad
Cleaned horde
2023-08-22 18:43:29 +02:00
Henk
69c794506b
HF 4.32
2023-08-22 17:48:00 +02:00
Henk
4b482a0619
Pending trick
2023-08-22 14:58:44 +02:00
Henk
b41bf99b55
Cleanup
2023-08-22 14:00:05 +02:00
Henk
179c4ad07f
Restore UI
2023-08-22 13:58:02 +02:00
Henk
f570787077
Allow worker to stop
2023-08-22 13:38:28 +02:00
0cc4m
22fd49937a
Merge pull request #62 from pi6am/fix/exllama-eos-space
...
Strip the eos token from exllama generations.
2023-08-22 08:06:02 +02:00
Henk
401cc1609a
Kaiemb branch
2023-08-21 19:05:35 +02:00
db0
148a7c21b8
using stop()
2023-08-21 19:02:15 +02:00
Henk
d9815d4b1f
New worker fixes
2023-08-21 17:52:21 +02:00
Henk
8abb5746f8
Add bridge back
2023-08-21 16:50:17 +02:00
Henk
7b8fba31f7
Git is stubborn
2023-08-21 16:46:02 +02:00
Henk
be8f527911
Horde URL fixes
2023-08-21 16:44:58 +02:00
Henk
a7251fa599
Bridge settings
2023-08-21 16:44:09 +02:00
Henk
e2d56db195
Fix bridge reference
2023-08-21 16:27:53 +02:00
db0
a655f8f066
adjust for stop mechanism
2023-08-21 15:56:27 +02:00
db0
45661ddc75
switch to AI Horde Worker
2023-08-21 15:52:17 +02:00
Henk
955db1567e
Keep the usual temp folder instead of ours
2023-08-21 14:29:37 +02:00
Henk
57e5f51d63
AutoGPTQ for Colab
2023-08-21 14:08:14 +02:00
Henk
5917737676
Don't disable exllama
2023-08-21 13:17:30 +02:00
Henk
8daa2f1adc
Update Optimum on Git HF
2023-08-21 02:01:34 +02:00
Henk
3dd0e91fbb
Preliminary HF GPTQ changes
2023-08-21 01:58:52 +02:00
Llama
070cfd339a
Strip the eos token from exllama generations.
...
The end-of-sequence (</s>) token indicates the end of a generation.
When a token sequence containing </s> is decoded, an extra (wrong)
space is inserted at the beginning of the generation. To avoid this,
strip the eos token out of the result before returning it.
The eos token was getting stripped later, so this doesn't change
the output except to avoid the spurious leading space.
2023-08-19 17:40:23 -07:00
Henk
6f557befa9
GPTQ --revision support
2023-08-19 15:17:29 +02:00
Henk
d93631c889
GPTQ improvements
2023-08-19 14:45:45 +02:00
Henk
13b68c67d1
Basic GPTQ Downloader
2023-08-19 13:02:50 +02:00
henk717
029e8736c0
Merge pull request #438 from one-some/another-api-fix
...
More api fixes
2023-08-19 02:43:38 +02:00
somebody
45486a47b0
WI: Fix UID keys being str
...
...again
2023-08-18 19:27:02 -05:00
Henk
80e784d3ea
Polish
2023-08-19 01:39:31 +02:00
Henk
5ae64354ee
NSFW menu updates
2023-08-19 01:20:09 +02:00