Commit Graph

792 Commits

Author SHA1 Message Date
henk717 932aabc2f3
Merge pull request #103 from henk717/main
Modern ROCm Docker
2022-03-06 19:02:38 +01:00
henk717 4332074c89 Modern ROCm Docker
Brings the ROCm container up to a modern standard in line with the CUDA docker.
2022-03-06 19:01:25 +01:00
henk717 657de72ada
Merge: Better name formatting for chatmode 2022-03-04 18:24:39 +01:00
MrReplikant ff1be78f72
Merge pull request #1 from MrReplikant/MrReplikant-patch-1
Fixed unnecessary spacing in chatmode
2022-03-04 08:46:43 -06:00
MrReplikant ae143e896c
Fixed unnecessary spacing in chatmode
This makes it go from "john :" to "John:", as it's supposed to be. As simple as it is, it can easily throw a chatbot model for a loop.
2022-03-04 08:46:00 -06:00
henk717 749d4a1c48 Update Colab Descriptions (GPU) 2022-03-04 11:33:05 +01:00
henk717 fade5fdd60 Update model descriptions (TPU) 2022-03-04 11:31:03 +01:00
henk717 2aeb2c6607 Add Janeway 6B and Shinen 6B 2022-03-04 09:53:34 +01:00
henk717 f9ac23ba4e Add Janeway and Shinen 2022-03-02 09:51:25 +01:00
henk717 c8ece04b1d
Merge pull request #99 from VE-FORBRYDERNE/mutation-observer
Re-enable the editor mutation observer
2022-03-02 09:39:03 +01:00
Gnome Ann d8e99b12f1 Re-enable the editor mutation observer 2022-02-28 19:00:26 -05:00
henk717 acc5804820
Merge pull request #97 from mrseeker/patch-2
Add description of Janeway
2022-02-19 23:27:13 +01:00
henk717 ba7f0de0d5
Merge pull request #98 from AngryBeeSec/main
Update play-cuda.sh
2022-02-19 23:22:04 +01:00
AngryBeeSec b6d885cf0a
Update play-cuda.sh
Allows the use of newer models
2022-02-19 16:26:20 -05:00
Julius ter Pelkwijk 2b133548be
Add description of Janeway 2022-02-18 14:37:28 +01:00
henk717 90be138ac5 Add Janeway to the GPU Colab 2022-02-18 14:26:29 +01:00
henk717 f06acb59be
Add the Janeway model
New model released by Mr.Seeker
2022-02-18 14:18:41 +01:00
henk717 e571a17f84
Update readme.md 2022-02-15 20:18:21 +01:00
henk717 50a96485a9
Fix dm-haiku
They did a change that breaks compatibility with our other dependencies, forcing version 0.0.5 to fix this.
2022-02-14 18:05:50 +01:00
henk717 223af82083 GPU Link Fix 2022-02-06 12:05:00 +01:00
henk717 03e017bffd
Release of 1.17
Release of 1.17
2022-02-06 11:59:58 +01:00
henk717 040455b100 Enable Colab Mode in Colab
Now that we are ready to upstream we can safely enable the intended colab mode without breaking the old version.
2022-02-06 11:52:55 +01:00
henk717 91f3f25f4d
Update readme.md 2022-02-04 01:00:26 +01:00
henk717 3ee63b28c5 Defaults and Downloads
Default settings for the new repetition penalty settings (Better suggestions very much welcome since broader community testing has not been done).

Updated the Readme with the link to the offline installer.
2022-02-03 13:13:26 +01:00
henk717 c14e6fe5d2 Revert parralism
Testing is done, seems to cause issues in the order things happen with the interface.
2022-02-01 18:58:48 +01:00
henk717 d68a91ecd3 Save model values
Without saving these they get lost after someone saves. So saving them is more important than the model being able to override them after the fact.
2022-02-01 18:37:52 +01:00
henk717 b8e08cdd63 Enable Tokenizer Parralism
Has proven to be safe in my internal testing and does help with the interface lag at boot.

Enabling this so it can get wider testing.
2022-02-01 12:00:53 +01:00
henk717 ecd7b328ec Further Polishing
Multiple smaller changes to get 1.17 in shape for its release.
2022-02-01 11:15:44 +01:00
henk717 36b6dcb641 Increase newlinemode compatibility
Ran into issues with other modes like chatmode and adventure, moved it further down the pipeline and converting </s> back to \n before processing additional formatting.

Still has an issue with the html formatting not working, but at least the AI works now.
2022-01-31 19:39:32 +01:00
henk717 90fd67fd16 Update aiserver.py 2022-01-31 19:06:02 +01:00
henk717 b69e3f86e1 Update aiserver.py
Removes a debug line
2022-01-31 18:57:47 +01:00
henk717 8466068267 Don't save newlinemode
On second thought, it is probably better to not save this. Advanced users can add this themselves and that way newer versions of the model can override it if redownloaded.
2022-01-31 18:41:23 +01:00
henk717 729be62821 </s> new line mode
Needed for Fairseq and XGLM models that do not understand the regular \n .
2022-01-31 18:39:34 +01:00
henk717 44d49ea732 Remove Huggingface Repo
Huggingface's repo is further behind than conda-forge so we will no longer offer it in the installer. The more is loaded from conda-forge the better. The same transformers package will still be installed, but a newer one from conda-forge is now guaranteed.
2022-01-31 16:21:10 +01:00
henk717 03433810f1 KML improvements
Don't parse > since that has a different meaning for us, also whitelisting a few more markdown tags so lists work.
2022-01-30 20:07:47 +01:00
henk717 a484244392 Welcome Message API
Allows model creators to customize the welcome message using Markdown and Limited HTML

Existing United users need to run install_requirements..bat again, you can leave the existing dependencies intact.
2022-01-30 19:47:30 +01:00
henk717 ddfa21e6dd Breakmodel Fixes
Multiple old references and one mistake in my last commit fixed
2022-01-30 17:40:43 +01:00
henk717 57344935f6
--model without breakmodel disables bmsupported
Last commit it only did a warning, now it will turn bmsupported off so that the GPU routine is used.
2022-01-30 17:16:35 +01:00
henk717 f0c0a990ea NoBreakmodel variable
Adds a Nobreakmodel var that allows Breakmodel to be turned off. This can be done trough commandline or a model config (In case Neo is used by the models config without it being a true Neo model that is compatible with breakmodel).

In addition I removed the args.colab check for breakmodel support and instead make args.colab activate nobreakmodel. And I have added a new check so that breakmodel is not even attempted if you do not specify the layers but do launch a model from the command line.
2022-01-30 17:06:15 +01:00
henk717 5b5a479f29 Threading + Memory Sizes
Polish effort to suppress a warning and list more accurate VRAM as tested with the full 2048 max tokens.
2022-01-30 13:56:25 +01:00
henk717 fca7f8659f Badwords unification
TPU's no longer use hardcoded badwords but instead use the var
2022-01-29 18:09:53 +01:00
henk717 4a4fa4ca29 Update readme.md 2022-01-29 12:45:15 +01:00
henk717 f9f25c01e4 HTML escape the last commit
</s> didn't work, needed to be HTML escaped (Thanks for the tip VE!)
2022-01-28 19:21:05 +01:00
henk717 be0e57185f Improved Model Support
Changed the model VRAM requirements to what you'd need to comfortably run the model rather than barely (Like with the manual). Will probably revise this in a later commit.

More importantly, it now supports models that use </s> which will be required to support XGLM and Fairseq models.
2022-01-28 18:03:30 +01:00
henk717 9356573ac9 Merge branch 'united' of https://github.com/henk717/KoboldAI into united 2022-01-25 06:39:54 +01:00
henk717 987e78f980 More loading fixes
My last attempt at fixing this caused GPT2 to break, since the other fix is an edge case we assume that the GPT2 method should be used, and if that fails we try the other one to catch rare errors with bad model config's.
2022-01-25 06:39:23 +01:00
henk717 2d7f39247d TPU descriptions 2022-01-25 06:22:32 +01:00
henk717 2bb263c65d Reordering Settings
More settings reordering so similar settings are on the same rows now that we have more settings for the repetition penalty. Amount to generate is now top left so some muscle memory may be lost with the temp. But the settings that control AI randomness are on the same row now, and repetition related settings are next to each other as well.
2022-01-25 06:10:39 +01:00
henk717 392c59d48b
Merge pull request #72 from VE-FORBRYDERNE/rep-pen
Repetition penalty slope and range
2022-01-25 05:39:09 +01:00
Gnome Ann 3f18888eec Repetition penalty slope and range 2022-01-24 15:30:38 -05:00