509 Commits

Author SHA1 Message Date
Gnome Ann
bf4e7742ac Patch GPTJForCausalLM, if it exists, to support soft prompting 2021-10-28 17:18:28 -04:00
Gnome Ann
40b4631f6c Clamp input_ids in place
Apparently transformers maintains an internal reference to input_ids
(to use for repetition penalty) so we have to clamp the internal
version, too, because otherwise transformers will throw an out-of-bounds
error upon attempting to access token IDs that are not in the
vocabulary.
2021-10-28 16:52:39 -04:00
Gnome Ann
24d5d63c9f Use the correct generation min and max when using soft prompt 2021-10-28 16:39:59 -04:00
Gnome Ann
511817132a Don't change the shape of transformer.wte 2021-10-28 15:39:59 -04:00
Gnome Ann
a1ae11630a Make sure to cast vars.sp to the correct dtype 2021-10-28 13:22:07 -04:00
Gnome Ann
1556bd32a5 Use torch.where to inject the soft prompt instead of torch.cat 2021-10-28 13:20:14 -04:00
Gnome Ann
248e0bd24b Fix soft prompt loading code 2021-10-28 00:29:42 -04:00
Gnome Ann
4e3cc93020 Merge branch 'united' into sp 2021-10-23 11:45:03 -04:00
henk717
7b73d7cfdd Single Line Mode
Adds Single Line mode, optimized for things like chatbot testing and other cases where you want to have control over what happens after a paragraph.

This can also be used as a foundation for a chatbot optimized interface mode.
2021-10-23 17:30:48 +02:00
Gnome Ann
9e82ce34a6 HTML-escape strings in the soft prompt selection menu 2021-10-22 14:25:25 -04:00
Gnome Ann
1f449a9dda Soft prompt support (6B Colabs not supported yet) 2021-10-22 14:18:10 -04:00
Gnome Ann
3501f03153 Create settings directory if it doesn't exist when using InferKit/OAI 2021-10-21 23:33:32 -04:00
henk717
0f38dbc0ed Using VE's fork for now
Switching the official huggingface to VE's fork for the time being until some of these changes land upstream.
2021-10-19 11:33:49 +02:00
henk717
fa0f8af1d6
Merge branch 'KoboldAI:main' into united 2021-10-15 08:23:06 +02:00
henk717
9513240dfb
Version bump
Since VE fixed important things in the editor i want users to be able to see this easier
2021-10-15 08:22:32 +02:00
henk717
fd7f9b7edf
Merge pull request #76 from VE-FORBRYDERNE/editor
Fix enter key behaviour in the editor when not using Firefox
2021-10-15 08:21:35 +02:00
Gnome Ann
fdbe730a1f Fix an incorrect document.queryCommandSupported call 2021-10-13 12:51:31 -04:00
Gnome Ann
99d26c44e0 Handle CRLF newlines properly when pasting 2021-10-13 12:43:45 -04:00
Gnome Ann
718af6f7fa Pasting fallback for browsers with no execCommand support 2021-10-13 12:41:43 -04:00
Gnome Ann
407d8f7419 Also apply the enter patch to pasted text 2021-10-13 12:05:15 -04:00
Gnome Ann
3f5a3102a9 Change application.js version to avoid caching issues 2021-10-13 00:48:21 -04:00
Gnome Ann
aaa0c3374e Fix problems with stories that end in newlines
Today I learned that the editor only works properly when the last
<chunk> tag has a <br> inside it at the end.  This last <br> is
invisible and is automatically created by all major browsers when you
use the enter key to type a newline at the end of a story to "prevent
the element from collapsing".  When there's more than one <br> at the
end of the last <chunk>, only the last of those <br>s is invisible, so
if you have three <br>s, they are rendered as two newlines.  This only
applies to the last <chunk>, so if the second last <chunk> has three
<br>s at the end, they are still rendered as three newlines.  Since
the browser is really insistent on doing this, this commit mostly deals
with dynamically creating and deleting <br> tags at the ends of <chunk>
tags as needed to provide a consistent experience, and making sure
that all <br> tags actually go inside of <chunk> tags to prevent
breaking the editor.  The latter behaviour was exhibited by Chrome and
caused a bug when you added a newline at the end of your story using
the editor.
2021-10-13 00:42:03 -04:00
Gnome Ann
b3d33cc852 Fix enter key behaviour in the editor when not using Firefox 2021-10-12 00:09:02 -04:00
henk717
0b62ed0892
Merge branch 'KoboldAI:main' into united 2021-10-07 03:17:03 +02:00
henk717
436d492b42
Merge pull request #74 from AngryBeeSec/patch-1
Update docker-compose.yml
2021-10-07 03:16:10 +02:00
AngryBeeSec
18d30a1235
Update docker-compose.yml
Should fix GPU issues on Arch based systems.
2021-10-06 20:50:50 -04:00
henk717
c854a62549 Clarified GPU Layers
breakmodel_layers and layers is confusing, changed the new method to breakmodel_gpulayers. The old one should no longer be used by people, but since it works in reverse we leave it in so scripts don't break.
2021-10-06 18:55:01 +02:00
henk717
bd063f7590
Merge pull request #19 from VE-FORBRYDERNE/multi-gpu
Multiple GPU support
2021-10-06 18:50:58 +02:00
Gnome Ann
3649ba9fa4 Breakmodel's CUDA stream should be on primary device 2021-10-06 12:04:56 -04:00
henk717
82c7eaffb5
Merge branch 'KoboldAI:main' into united 2021-10-06 00:26:08 +02:00
henk717
8893916fef
Don't always submit prompt by default
Feedback from users is that its better to not always submit the prompt, this is consistent with the randomly generated stories. You can always toggle it on if you need this for coherency. This change does not override existing user settings.
2021-10-06 00:25:05 +02:00
Gnome Ann
aa59f8b4b2 Fix CPU layers not displaying correctly when using --layers 2021-10-05 11:29:47 -04:00
Gnome Ann
91352ea9f1 Change the command line flags for breakmodel 2021-10-05 11:22:09 -04:00
Gnome Ann
a1e4405aa6 Automatically use breakmodel instead of GPU-only where supported
There's really no reason to use GPU-only mode if breakmodel is supported
because breakmodel can run in GPU-only mode too.
2021-10-05 10:36:51 -04:00
Gnome Ann
fb90a7ed17 Change the help text for breakmodel to be more helpful 2021-10-05 10:31:28 -04:00
Gnome Ann
f9e6a6da17 Slightly increased performance in breakmodel mode
Commit a283d34b2731abfe7f5f1e939117491f0755cedb made breakmodel mode
slower. Performance has been restored to how it was before that commit.
2021-10-05 10:25:06 -04:00
Gnome Ann
231621e7c2 Use AutoModelForCausalLM for custom models with a model_type 2021-10-05 09:45:12 -04:00
Gnome Ann
18fa742b02 Merge branch 'united' into multi-gpu 2021-10-05 09:43:20 -04:00
Gnome Ann
a283d34b27 Multiple GPU support 2021-10-05 09:38:57 -04:00
henk717
6152d390c0
Merge branch 'KoboldAI:main' into united 2021-10-03 04:32:20 +02:00
Gnome Ann
0937bb33e7 Clarify licensing for breakmodel.py 2021-10-02 12:19:37 -04:00
henk717
6d30d30b39
Merge pull request #73 from VE-FORBRYDERNE/webkit-patch
Move using_webkit_patch to earlier in the code
2021-10-02 17:48:58 +02:00
Gnome Ann
a42b580027 Merge branch 'united' into multi-gpu 2021-10-02 11:44:26 -04:00
Gnome Ann
e69e15bec7 Move using_webkit_patch to earlier in the code
I don't know it this or help or not but I hope it does.
2021-09-30 22:58:54 -04:00
henk717
4451edc7a2
Merge branch 'KoboldAI:main' into united 2021-10-01 01:48:23 +02:00
henk717
a6d1546247
Merge pull request #72 from VE-FORBRYDERNE/webkit-patch
Safari support
2021-10-01 01:16:59 +02:00
Gnome Ann
d7893a0b52 Check for WebKit after connecting to the server
For some reason the original way only works in Safari after pressing the
refresh button. It did not work if you typed the URL into the address
bar in Safari without refreshing afterwards.
2021-09-30 19:00:15 -04:00
Gnome Ann
00c1f49ca1 Fix slow typing speed on mobile devices 2021-09-30 17:52:35 -04:00
Gnome Ann
a923b3bfa7 Safari compatibility
Only tested on macOS so far, not iOS.
2021-09-30 14:13:54 -04:00
henk717
e9ff779dc2
Merge branch 'KoboldAI:main' into united 2021-09-30 18:33:02 +02:00