428 Commits

Author SHA1 Message Date
ebolam
633152ee84 Fixed Retry bug due to redo/pin code 2022-02-10 10:01:07 -05:00
ebolam
586b989582 Redo bug fix 2022-02-06 18:53:24 -05:00
ebolam
98609a8abc Merge branch 'united' of https://github.com/ebolam/KoboldAI into united 2022-02-06 13:48:34 -05:00
ebolam
80ae054cb5
Merge branch 'henk717:united' into united 2022-02-06 13:42:59 -05:00
ebolam
9e17ea9636 Fixed model downloading problem where models were downloaded multiple times 2022-02-06 13:42:46 -05:00
henk717
c38108d818
Merge pull request #73 from VE-FORBRYDERNE/xglm-breakmodel
Breakmodel support for the fairseq models
2022-02-06 18:05:59 +01:00
ebolam
02c7ca3e84
Merge branch 'henk717:united' into united 2022-02-03 08:11:06 -05:00
ebolam
0684a221cd Changed pin icon for re-dos to be a circular arrow that is not clickable to make it clear it is a redo action and cannot be cleared. 2022-02-03 08:08:43 -05:00
henk717
3ee63b28c5 Defaults and Downloads
Default settings for the new repetition penalty settings (Better suggestions very much welcome since broader community testing has not been done).

Updated the Readme with the link to the offline installer.
2022-02-03 13:13:26 +01:00
Gnome Ann
4904af6adc Fix a mistake in the previous commit 2022-02-02 23:04:59 -05:00
Gnome Ann
78f52063c7 Fix XGLM soft prompts 2022-02-02 22:45:16 -05:00
Ben Fox
e2d2ebcae6 upstream merge 2022-02-02 15:04:59 -05:00
Gnome Ann
d847d04605 Fix some typos in XGLM breakmodel 2022-02-01 16:00:46 -05:00
Gnome Ann
8e1169ea61 Enable vars.bmsupported when using XGLM 2022-02-01 15:31:59 -05:00
Gnome Ann
e7f65cee09 XGLM breakmodel 2022-02-01 13:04:35 -05:00
henk717
c14e6fe5d2 Revert parralism
Testing is done, seems to cause issues in the order things happen with the interface.
2022-02-01 18:58:48 +01:00
henk717
d68a91ecd3 Save model values
Without saving these they get lost after someone saves. So saving them is more important than the model being able to override them after the fact.
2022-02-01 18:37:52 +01:00
henk717
b8e08cdd63 Enable Tokenizer Parralism
Has proven to be safe in my internal testing and does help with the interface lag at boot.

Enabling this so it can get wider testing.
2022-02-01 12:00:53 +01:00
henk717
ecd7b328ec Further Polishing
Multiple smaller changes to get 1.17 in shape for its release.
2022-02-01 11:15:44 +01:00
henk717
36b6dcb641 Increase newlinemode compatibility
Ran into issues with other modes like chatmode and adventure, moved it further down the pipeline and converting </s> back to \n before processing additional formatting.

Still has an issue with the html formatting not working, but at least the AI works now.
2022-01-31 19:39:32 +01:00
henk717
90fd67fd16 Update aiserver.py 2022-01-31 19:06:02 +01:00
henk717
b69e3f86e1 Update aiserver.py
Removes a debug line
2022-01-31 18:57:47 +01:00
henk717
8466068267 Don't save newlinemode
On second thought, it is probably better to not save this. Advanced users can add this themselves and that way newer versions of the model can override it if redownloaded.
2022-01-31 18:41:23 +01:00
henk717
729be62821 </s> new line mode
Needed for Fairseq and XGLM models that do not understand the regular \n .
2022-01-31 18:39:34 +01:00
henk717
03433810f1 KML improvements
Don't parse > since that has a different meaning for us, also whitelisting a few more markdown tags so lists work.
2022-01-30 20:07:47 +01:00
henk717
a484244392 Welcome Message API
Allows model creators to customize the welcome message using Markdown and Limited HTML

Existing United users need to run install_requirements..bat again, you can leave the existing dependencies intact.
2022-01-30 19:47:30 +01:00
henk717
ddfa21e6dd Breakmodel Fixes
Multiple old references and one mistake in my last commit fixed
2022-01-30 17:40:43 +01:00
henk717
57344935f6
--model without breakmodel disables bmsupported
Last commit it only did a warning, now it will turn bmsupported off so that the GPU routine is used.
2022-01-30 17:16:35 +01:00
henk717
f0c0a990ea NoBreakmodel variable
Adds a Nobreakmodel var that allows Breakmodel to be turned off. This can be done trough commandline or a model config (In case Neo is used by the models config without it being a true Neo model that is compatible with breakmodel).

In addition I removed the args.colab check for breakmodel support and instead make args.colab activate nobreakmodel. And I have added a new check so that breakmodel is not even attempted if you do not specify the layers but do launch a model from the command line.
2022-01-30 17:06:15 +01:00
henk717
5b5a479f29 Threading + Memory Sizes
Polish effort to suppress a warning and list more accurate VRAM as tested with the full 2048 max tokens.
2022-01-30 13:56:25 +01:00
henk717
fca7f8659f Badwords unification
TPU's no longer use hardcoded badwords but instead use the var
2022-01-29 18:09:53 +01:00
henk717
f9f25c01e4 HTML escape the last commit
</s> didn't work, needed to be HTML escaped (Thanks for the tip VE!)
2022-01-28 19:21:05 +01:00
henk717
be0e57185f Improved Model Support
Changed the model VRAM requirements to what you'd need to comfortably run the model rather than barely (Like with the manual). Will probably revise this in a later commit.

More importantly, it now supports models that use </s> which will be required to support XGLM and Fairseq models.
2022-01-28 18:03:30 +01:00
ebolam
1470b1666d Fixed single gen redo 2022-01-27 20:17:13 -05:00
ebolam
2278b7c103 Changed behavior of redo if there is only 1 option to just select it 2022-01-26 21:07:55 -05:00
ebolam
06bbe429d9 Bug fix for redo/pinning persisting over new game requests 2022-01-26 21:02:36 -05:00
ebolam
b0f1bdf2fd
Merge branch 'henk717:united' into united 2022-01-26 11:27:12 -05:00
henk717
987e78f980 More loading fixes
My last attempt at fixing this caused GPT2 to break, since the other fix is an edge case we assume that the GPT2 method should be used, and if that fails we try the other one to catch rare errors with bad model config's.
2022-01-25 06:39:23 +01:00
Gnome Ann
3f18888eec Repetition penalty slope and range 2022-01-24 15:30:38 -05:00
ebolam
bd0732fbd6 Fix for redo with options.
Added debug menu
2022-01-24 12:54:44 -05:00
ebolam
47ec22873d bug-fix if settings directory is a symlink. 2022-01-22 21:43:32 -05:00
ebolam
f54f46b068 bugfix for metadata saving 2022-01-22 20:30:14 -05:00
henk717
0846d57368 0.17 polish 2022-01-23 01:05:09 +01:00
ebolam
bdd358f40f Merge branch 'united' of https://github.com/henk717/KoboldAI into henk717-united 2022-01-22 17:57:33 -05:00
henk717
c9999b6388
Merge pull request #70 from VE-FORBRYDERNE/patch
Don't throw an error in `update_story_chunk` if you try to edit a nonexistent chunk
2022-01-22 23:24:34 +01:00
henk717
4e7440804c
Merge pull request #69 from VE-FORBRYDERNE/lua
Lua compatibility enhancements
2022-01-22 23:23:47 +01:00
henk717
f79db7059a Fall back to old json load
Turns out model_config does not work on models that have no model_type defined. In case this happens we now fall back to the old .json loading method. This will not work in --colab mode if its not already a local model, but since almost all modern models define a model type and to my knowledge all models on huggingface do that should not be an issue. If it is we can always ask the model creator to either update it, distribute the model differently or load that model with --remote instead of --colab.
2022-01-22 23:21:19 +01:00
ebolam
9df758c1f4 added quiet option to suppress any story text from showing in the console (reduce logs when running in a docker container) 2022-01-22 15:30:56 -05:00
ebolam
12e7b6d10b Added --share command line parameter so we can set host=0.0.0.0 on local instances without editing code
moved save location of downloaded models to models/XXXXXX so we can more easily set this as a volume in docker
2022-01-22 14:47:28 -05:00
Gnome Ann
bf2b02d366 Don't error in update_story_chunk if chunk index doesn't exist 2022-01-21 21:19:32 -05:00