387 Commits

Author SHA1 Message Date
henk717
57344935f6
--model without breakmodel disables bmsupported
Last commit it only did a warning, now it will turn bmsupported off so that the GPU routine is used.
2022-01-30 17:16:35 +01:00
henk717
f0c0a990ea NoBreakmodel variable
Adds a Nobreakmodel var that allows Breakmodel to be turned off. This can be done trough commandline or a model config (In case Neo is used by the models config without it being a true Neo model that is compatible with breakmodel).

In addition I removed the args.colab check for breakmodel support and instead make args.colab activate nobreakmodel. And I have added a new check so that breakmodel is not even attempted if you do not specify the layers but do launch a model from the command line.
2022-01-30 17:06:15 +01:00
henk717
5b5a479f29 Threading + Memory Sizes
Polish effort to suppress a warning and list more accurate VRAM as tested with the full 2048 max tokens.
2022-01-30 13:56:25 +01:00
henk717
fca7f8659f Badwords unification
TPU's no longer use hardcoded badwords but instead use the var
2022-01-29 18:09:53 +01:00
henk717
f9f25c01e4 HTML escape the last commit
</s> didn't work, needed to be HTML escaped (Thanks for the tip VE!)
2022-01-28 19:21:05 +01:00
henk717
be0e57185f Improved Model Support
Changed the model VRAM requirements to what you'd need to comfortably run the model rather than barely (Like with the manual). Will probably revise this in a later commit.

More importantly, it now supports models that use </s> which will be required to support XGLM and Fairseq models.
2022-01-28 18:03:30 +01:00
henk717
987e78f980 More loading fixes
My last attempt at fixing this caused GPT2 to break, since the other fix is an edge case we assume that the GPT2 method should be used, and if that fails we try the other one to catch rare errors with bad model config's.
2022-01-25 06:39:23 +01:00
Gnome Ann
3f18888eec Repetition penalty slope and range 2022-01-24 15:30:38 -05:00
henk717
0846d57368 0.17 polish 2022-01-23 01:05:09 +01:00
henk717
c9999b6388
Merge pull request #70 from VE-FORBRYDERNE/patch
Don't throw an error in `update_story_chunk` if you try to edit a nonexistent chunk
2022-01-22 23:24:34 +01:00
henk717
4e7440804c
Merge pull request #69 from VE-FORBRYDERNE/lua
Lua compatibility enhancements
2022-01-22 23:23:47 +01:00
henk717
f79db7059a Fall back to old json load
Turns out model_config does not work on models that have no model_type defined. In case this happens we now fall back to the old .json loading method. This will not work in --colab mode if its not already a local model, but since almost all modern models define a model type and to my knowledge all models on huggingface do that should not be an issue. If it is we can always ask the model creator to either update it, distribute the model differently or load that model with --remote instead of --colab.
2022-01-22 23:21:19 +01:00
Gnome Ann
bf2b02d366 Don't error in update_story_chunk if chunk index doesn't exist 2022-01-21 21:19:32 -05:00
Gnome Ann
fab0913270 Call setgamesaved(False) in update_story_chunk and remove_story_chunk 2022-01-21 16:39:51 -05:00
Gnome Ann
72a7aac2c7 Sync memory properly after random game request 2022-01-20 15:14:55 -05:00
ebolam
dffd00265b Added autosave feature. When action is submitted it will save if the save setting is on and if the filename is set. 2022-01-20 07:46:34 -05:00
henk717
9532b56cb8 Universal Model Settings
No longer depends on a local config file enabling the configuration to work in --colab mode.
2022-01-20 10:11:11 +01:00
Gnome Ann
c703729f0b Set eventlet threadpool size back to 1 2022-01-20 02:10:57 -05:00
Gnome Ann
f0c39c004a Deleting world info entries should call setgamesaved(False) 2022-01-18 19:36:20 -05:00
henk717
4ca06ebcf3
Merge pull request #65 from VE-FORBRYDERNE/sp
Show author and SP length in soft prompt menu
2022-01-18 23:51:02 +01:00
henk717
1e0f9ada08
Add adventure 2.7B
Its on Huggingface now, so lets add it to the menu!
2022-01-18 23:50:21 +01:00
Gnome Ann
3018322963 Detect and show properly when story is unsaved 2022-01-18 17:20:45 -05:00
Gnome Ann
1951ccd2ce Show author and SP length in soft prompt menu 2022-01-18 16:30:09 -05:00
Gnome Ann
4da1a2d247 Prevent tokenizer from taking extra time the first time it's used 2022-01-17 22:55:25 -05:00
Gnome Ann
703c092577 Fix settings callback, and genout.shape[-1] in tpumtjgenerate() 2022-01-17 14:52:29 -05:00
Gnome Ann
3ba0e3f9d9 Dynamic TPU backend should support dynamic warpers and abort button 2022-01-17 14:10:32 -05:00
Gnome Ann
6502af086f Use vars._actions in tpumtjgenerate and its callbacks 2022-01-17 13:24:11 -05:00
Gnome Ann
45bfde8d5d generated_cols needs to be set properly by TPU static backend 2022-01-17 13:19:57 -05:00
Gnome Ann
9594b2db1c Fix soft prompt length calculation in calcsubmitbudget()
In TPU instances, `vars.sp.shape[0]` is not always the actual number of
tokens in the soft prompt. We have to use `vars.sp_length` to get an
accurate token count.
2022-01-17 13:17:20 -05:00
Gnome Ann
74f79081d1 Use vars.model_type to check for GPT-2 models 2022-01-17 13:13:54 -05:00
Gnome Ann
54a587d6a3 Show confirmation dialog when navigating away from UI window 2022-01-17 12:11:06 -05:00
Gnome Ann
1627afa8c5 Merge branch 'united' into patch 2022-01-17 10:44:34 -05:00
Gnome Ann
33f9f2dc82 Show message when TPU backend is compiling 2022-01-16 21:09:10 -05:00
Gnome Ann
03b16ed920 Merge branch 'united' into patch 2022-01-16 00:36:55 -05:00
Gnome Ann
4f0c8b6552 Merge branch 'united' into xmap 2022-01-15 23:32:12 -05:00
Gnome Ann
f4eb896a69 Use original TPU backend if possible 2022-01-15 23:31:07 -05:00
henk717
9802d041aa Colab Optimizations
Breakmodel is useless on Colab, so for the sake of efficiency if --colab is present we will always assume a model is incompatible. The same applies to the conversion, colab's are discarded so converting the model to a .bin file only wastes time since the HDD isn't fast. Finally we automatically set all the useful variables for Colab, so that in the future this can be removed from ckds and other scripts.

Lastly ckds has been adapted not to copy the examples folder and to add the new --colab parameter.

Local players are much better off running the old --remote command.
2022-01-16 00:56:03 +01:00
henk717
9bcc24c07e
Merge pull request #58 from VE-FORBRYDERNE/xmap
Dynamic TPU backend xmaps
2022-01-15 16:20:58 +01:00
Gnome Ann
877fa39b8a Change TPU regeneration indicator message 2022-01-14 23:21:27 -05:00
Gnome Ann
bdfde33e8a Add an indicator for when dynamic WI scan is triggered in TPU Colabs 2022-01-14 23:13:55 -05:00
Gnome Ann
e0fdce2cc6 Fix TPU generation modifier 2022-01-14 23:00:06 -05:00
Gnome Ann
932c393d6a Add TPU support for dynamic WI scan and generation modifiers 2022-01-14 21:39:02 -05:00
Gnome Ann
cf9a4b7e6b Fix typos in error messages 2022-01-13 22:33:55 -05:00
henk717
53b91c6406 Small changes 2022-01-14 02:03:46 +01:00
Gnome Ann
a3d6dc93e8 xmaps for moving things onto TPU 2022-01-12 21:45:30 -05:00
Gnome Ann
f0b5cc137f Merge branch 'united' into patch 2022-01-12 19:50:01 -05:00
henk717
49e2bcab1a Allow unique chatnames in multiplayer
No longer update the chatname outside of the config, this will not effect singleplayer tab at all, but it will allow people in multiplayer to chat with their own names.
2022-01-11 21:31:44 +01:00
henk717
3f88b4f840 Server clarification
To prevent confusion with users who have not used KoboldAI for a while, or who are following old tutorials I have added a disclaimer that informs people that most Colab links should not be used with this feature and instead opened in the browser.
2022-01-11 00:35:20 +01:00
henk717
d2947bd1cc Small model description update 2022-01-11 00:29:35 +01:00
Gnome Ann
43586c8f60 Fix some of the logic for generation aborting 2022-01-10 17:09:47 -05:00