somebody
23e54b6658
WI: Workaround for Chrome order weirdness
...
Chrome fires `blur()` before deleting nodes, meaning the -1 WI was
getting sent after being deleted, resulting in two
`delete_new_world_info_entry` packets being sent to the browser.
Really, it would be better to not do this full WI reset/sync cycle and
just send state changes and update accordingly. That would stop all the
WI weirdness probably.
2023-07-31 12:30:37 -05:00
somebody
8cc0a8cab9
WI: Fix UI1 WI errors
2023-07-30 14:25:09 -05:00
somebody
93233068cc
Merge branch 'united' of https://github.com/henk717/KoboldAI into wi-fixes
2023-07-30 13:52:20 -05:00
henk717
21d20854e4
Merge pull request #414 from one-some/submit-ctx-menu
...
Submit context menu
2023-07-30 01:58:44 +02:00
henk717
276efa66e6
Merge pull request #428 from one-some/do-not-add-bugs
...
Workaround for option panel popping up too much
2023-07-30 01:50:05 +02:00
henk717
e94b4b9bc2
Merge pull request #431 from one-some/united
...
Move pickle stuff into modeling/pickling.py
2023-07-29 18:58:20 +02:00
somebody
eb95c45ff1
Fix import
2023-07-28 16:03:55 -05:00
somebody
184c3d9302
Move pickle stuff into modeling/pickling.py
...
Ended up not moving to utils.py because most of the stuff in there
isn't really model related, and it feels messy to just throw whatever in
there. Originally the file was named "modeling/utils.py" and was going
to be a place for assorted model-related functions, but I think this is
better.
2023-07-28 15:38:29 -05:00
Henk
37babe1edd
Allow _rebuild_meta_tensor_no_storage
2023-07-28 15:04:25 +02:00
Henk
889fe8d548
Fix Peft
2023-07-26 19:35:55 +02:00
henk717
af5580638e
Update README_GPTQ.md
2023-07-26 16:16:39 +02:00
somebody
b20f320b22
Redo workaround
2023-07-25 22:46:02 -05:00
somebody
e2b3fa1ead
Merge branch 'united' of https://github.com/henk717/KoboldAI into submit-ctx-menu
2023-07-25 15:00:27 -05:00
Henk
79226ea66d
Hide TPU API during load
2023-07-25 21:51:03 +02:00
henk717
e799178436
Merge pull request #423 from one-some/fix-introduced-bugs
...
Fix streaming bugs
2023-07-25 18:58:16 +02:00
henk717
4497b09056
Merge pull request #424 from one-some/qptq-lazy
...
gptq is soooo lazy
2023-07-25 02:59:50 +02:00
somebody
c80de5120c
Cleanup
2023-07-24 19:45:33 -05:00
somebody
ad4528b5a6
critical change
2023-07-24 17:17:57 -05:00
somebody
a73420c49c
really really really sketchy breakmodel implementation
...
im gonna go lie down for an extended period of time
2023-07-24 17:15:59 -05:00
somebody
ec040620ec
Merge branch 'united' of https://github.com/henk717/KoboldAI into qptq-lazy
2023-07-24 13:26:16 -05:00
somebody
34aa333c44
Last debug
2023-07-24 13:11:06 -05:00
somebody
43a4abaf63
Remove even more debug
2023-07-24 13:10:33 -05:00
somebody
929917efe9
Remove shrieking
2023-07-24 13:09:43 -05:00
somebody
4a6cccb002
Import fix
2023-07-24 13:09:15 -05:00
somebody
a6aafb2525
GPTQ: Patch QuantLinear to not use CPU RAM
2023-07-24 13:07:30 -05:00
somebody
9cc6972c1c
Shh!
2023-07-24 11:30:33 -05:00
somebody
30640acca7
Editor: Don't allow editing or syncing during generation
...
Only bad things can come from that!
Also filter out stream buffer when fixing dirty game text (just in
case!)
2023-07-24 11:26:20 -05:00
somebody
fc7fa991d5
Streaming: Fix streaming not being cleaned up before commentator speaks
...
super duper critical
2023-07-24 10:57:24 -05:00
henk717
04a5e05692
Merge pull request #422 from one-some/fix-prioritization
...
Fix prioritization (probably)
2023-07-24 17:40:36 +02:00
somebody
2fb877db40
Backends: Probably fix sorting
2023-07-24 10:28:22 -05:00
somebody
81e4c8a807
Backends: Fix GPTQ priority
2023-07-24 10:25:44 -05:00
somebody
ba313883b6
Merge branch 'united' of https://github.com/henk717/KoboldAI into submit-ctx-menu
2023-07-24 10:09:38 -05:00
somebody
1df03d9a27
Basic
2023-07-23 20:54:04 -05:00
Henk
30495cf8d8
Fix GPT2
2023-07-24 02:05:07 +02:00
Henk
9fc9cb92f7
Fancy streaming by default
2023-07-24 01:39:16 +02:00
henk717
c60a29e544
Merge pull request #419 from one-some/streaming-fix-2
...
Rework single-gen streaming
2023-07-24 01:31:51 +02:00
Henk
70d2da55e5
Readme changes
2023-07-24 01:03:46 +02:00
somebody
1371150cbd
Merge branch 'united' of https://github.com/henk717/KoboldAI into wi-fixes
2023-07-23 18:00:28 -05:00
somebody
f4593ed04b
Streaming: Fix bad streamingwindow sync
...
sometimes the good solution is the best solution
2023-07-23 17:50:53 -05:00
somebody
8de610df8c
Streaming: Rework single-gen streaming
...
Now has its own packet and element seperate from actions. Fixes several
bugs related to desyncing. Also adds smooth typing effect
(fixes https://github.com/henk717/KoboldAI/issues/263 )
2023-07-23 17:32:52 -05:00
Henk
a963c97acb
Make 4-bit the default part 2
2023-07-24 00:06:20 +02:00
Henk
3409853dfc
Remove GPTQ for Colab
2023-07-23 23:26:35 +02:00
Henk
70dddf9fdc
Prioritize GPTQ
2023-07-23 23:22:02 +02:00
henk717
ae9ec38ae2
Merge pull request #418 from one-some/united
...
index prioritization
2023-07-23 23:09:42 +02:00
Henk
0f913275a9
4-bit as Default
2023-07-23 23:08:11 +02:00
somebody
3aa677ce11
Indexed prioritization
...
not a great way to get that data
2023-07-23 16:04:23 -05:00
Henk
89637ae9d7
GPTQ Requirements
2023-07-23 22:51:47 +02:00
henk717
1facc73b66
Merge pull request #367 from 0cc4m/4bit-plugin
...
GPTQ module
2023-07-23 22:32:20 +02:00
0cc4m
73953068c0
Remove exllama backend, pending further fixes
2023-07-23 22:12:31 +02:00
0cc4m
973aea12ea
Only import big python modules for GPTQ once they get used
2023-07-23 22:07:34 +02:00