kalomaze
2065f95edc
Sampler priority support
2024-02-10 14:57:41 -06:00
Cohee
d8d4732614
Add koboldcpp param aliases
2024-02-09 15:32:40 +02:00
kingbri
530455979f
Textgen: Add tabby to logprobs
...
Tabby now supports logprob returns. Add support to the backend gate.
Signed-off-by: kingbri <bdashore3@proton.me>
2024-02-08 20:59:54 -05:00
kingbri
f372b2ac16
Textgen: Fix dynatemp defaults
...
The default min and max temp values should be 1
Signed-off-by: kingbri <bdashore3@proton.me>
2024-02-08 20:57:40 -05:00
Cohee
f1cadfde2b
Fix llamacpp server
2024-02-04 17:41:46 +02:00
kingbri
3e34123b31
Textgen: Fix sampling param requests
...
1. TabbyAPI was not being included in temperature_last
2. DynaTemp parameters were still being sent even though the checkbox
was off. Cleanup the parameters if dynatemp is off for backends that
don't take in a dynatemp boolean.
Signed-off-by: kingbri <bdashore3@proton.me>
2024-02-04 01:44:07 -05:00
Cohee
6d0d92aaff
Add grammar and dynatemp for aphrodite
2024-02-02 20:06:46 +02:00
Alexander Abushady
da68d35ca5
Remove Noisy Sampling
...
Leaves DynaTemp Exponent and Quadratic Sampling.
2024-01-30 21:50:50 -05:00
Alexander Abushady
92aae0da04
Enter the Kalo Zone
2024-01-28 15:14:21 -05:00
AlpinDale
8af3939f09
fix: logprobs for Aphrodite
...
Since Aphrodite follows the same logprobs style as ooba, we can enable it easily.
2024-01-26 07:15:39 +00:00
khanon
60044c18a4
Implement Token Probabilities UI using logprobs
2024-01-25 18:34:46 -06:00
NWilson
f29f934c6b
Progress
2024-01-24 06:59:27 -06:00
Cohee
570d5a30bd
[skip ci] Lint fix
2024-01-20 20:40:40 +02:00
Cohee
5cb4dbc280
Trim TG types after split
2024-01-15 20:45:21 +02:00
Cohee
22e605168b
Properly reset seed with neutralize samplers
2024-01-15 20:44:07 +02:00
RossAscends
c6487fa9cb
make sampler nuke button work with zensliders
2024-01-14 18:33:40 +09:00
RossAscends
80b39bdac1
add neutralize samplers button for text completion slider panel
2024-01-14 18:07:43 +09:00
RossAscends
44bd57f763
filter ooba-only samplers out of non-ooba text completion slider panels
2024-01-14 17:26:22 +09:00
Cohee
1367642eb4
Fix maxTokens reference for n_predict alias
2024-01-12 12:20:03 +02:00
Cohee
8659bd57b7
Add koboldcpp rep pen aliases
2024-01-12 12:12:53 +02:00
Alexander Abushady
ec63cd8b6d
Dynatemp Range
...
Kobold Dynatemp range set when deactivated, now will work properly
2024-01-09 00:54:20 -05:00
Alexander Abushady
5ad980cf99
Fix for realzies
2024-01-09 00:02:53 -05:00
Alexander Abushady
04a5d8390d
Dynatemp UI v3.1
...
fixes for html positioning as well as api settings.
2024-01-08 23:58:06 -05:00
Alexander Abushady
b334acec23
DynaTemp UI v3
...
Backend bros won't play along now we gotta make them hold hands.
2024-01-08 22:20:27 -05:00
Cohee
2bea90a264
Merge pull request #1633 from AAbushady/new-dynatemp-ui
...
DynaTemp UI v2
2024-01-06 00:33:25 +02:00
Alexander Abushady
224dcc83a1
Update DynaTemp Sliders
...
More inline with changes made that will be implemented in KCPP main.
2024-01-05 14:36:39 -05:00
Cohee
5f93c30a96
#1627 Bypass status check and custom model for textgen type
2024-01-05 19:15:07 +02:00
Cohee
86d715cc16
Improve tokenizer detection
2024-01-05 16:17:06 +02:00
Alexander Abushady
d66be24518
DynaTemp UI v2
...
With DynaTemp moving into main KCPP now ST can have DynaTemp too!
2024-01-05 02:08:44 -05:00
Cohee
47cb017a45
#1589 Add 'cache_prompt' for l.cpp
2023-12-25 02:42:03 +02:00
Cohee
67dd52c21b
#1309 Ollama text completion backend
2023-12-19 16:38:11 +02:00
Cohee
edd737e8bd
#371 Add llama.cpp inference server support
2023-12-18 22:38:28 +02:00
Cohee
6e8104873e
#1569 Add logit bias for text completions
2023-12-18 18:57:10 +02:00
Cohee
b0d9f14534
Re-add Together as a text completion source
2023-12-17 23:38:03 +02:00
valadaptive
5b3c96df50
Rename /textgenerationwebui endpoint
...
I'd like to migrate over to using "textgen" to mean text-generation APIs
in general, so I've renamed the /textgenerationwebui/* endpoints to
/backends/text-completions/*.
2023-12-12 16:40:14 -05:00
Cohee
b0e7b73a32
Fix streaming processor error handler hooks
2023-12-08 02:01:08 +02:00
valadaptive
cdcd913805
Don't stream events if the API returned a 4xx code
2023-12-07 18:00:36 -05:00
valadaptive
5540c165cf
Refactor server-sent events parsing
...
Create one server-sent events stream class which implements the entire
spec (different line endings, chunking, etc) and use it in all the
streaming generators.
2023-12-07 18:00:36 -05:00
Cohee
0a092629ce
Fix undefined variable reference
2023-12-03 23:24:39 +02:00
valadaptive
24ccef3aba
Fix "settings" variable being shadowed
2023-12-03 15:44:36 -05:00
valadaptive
9c33ddbafc
Make textgen settings type checks more concise
2023-12-03 14:56:01 -05:00
valadaptive
047c897ead
Remove is[API] functions
...
Just use an equality comparison. It's a bit longer, but only because
"textgenerationwebui_settings" is a long identifier.
2023-12-03 14:56:01 -05:00
valadaptive
ba54e3dea0
Replaces is_[api] params with api_type param
...
These were 5 mutually-exclusive booleans, which can be replaced with one
param that takes on 5 values, one for each API type.
2023-12-03 14:56:01 -05:00
Cohee
939e938ba2
Disallow multiswipe for quiet gens
2023-12-03 20:56:25 +02:00
Cohee
1786b0d340
#1403 Add Aphrodite multi-swipe
2023-12-03 20:40:09 +02:00
Cohee
64a3564892
lint: Comma dangle
2023-12-02 22:06:57 +02:00
Cohee
c63cd87cc0
lint: Require semicolons
2023-12-02 21:11:06 +02:00
valadaptive
a37f874e38
Require single quotes
2023-12-02 13:04:51 -05:00
valadaptive
60ac1aa1c7
Enable no-empty lint
2023-12-02 10:32:07 -05:00
Cohee
61908935f5
Stop string for user-continue. Trim spaces after name2
2023-11-22 16:16:48 +02:00
Cohee
ddbdceba64
Add sampler order for koboldcpp under text completions
2023-11-21 01:47:57 +02:00
Cohee
1ebfddf07e
Use mistral and yi tokenizers for custom token bans
2023-11-21 01:04:27 +02:00
Cohee
96caddfd71
Add koboldcpp as Text Completion source
2023-11-19 17:14:53 +02:00
kingbri
f31b996cb5
Text Generation: Add TabbyAPI support
...
TabbyAPI is an exllamav2 only API server that aims to provide a simple
experience for loading and chatting with exl2 models.
SillyTavern currently doesn't have the ability to load and unload models,
so only add the OAI compatible completion endpoints.
The repository can be found here:
https://github.com/theroyallab/tabbyAPI
Signed-off-by: kingbri <bdashore3@proton.me>
2023-11-17 00:37:39 -05:00
Cohee
f4630f9808
Add lastMessage macro. Substitute params in CFG negative prompts
2023-11-16 16:58:49 +02:00
Cohee
61764a9a21
Change mancer base URL via debug menu
2023-11-13 11:13:39 +02:00
RossAscends
52c07e0895
setup Aphrodite-specific API flag handling
2023-11-13 16:36:01 +09:00
Cohee
480099ee97
Mancer will work in legacy API mode. Remove Soft Prompt mentions.
2023-11-08 18:16:47 +02:00
Cohee
738c1d82c4
Clarify legacy API notification
2023-11-08 17:54:08 +02:00
Cohee
cbc0b41773
Prevent doubled legacy toast
2023-11-08 10:28:02 +02:00
Cohee
b6d29d2980
Return force /api append for Legacy ooba with relaxed URL off
2023-11-08 10:25:49 +02:00
Cohee
e76c18c104
Legacy ooba API compatibility shim
2023-11-08 10:13:28 +02:00
RossAscends
00c029a65a
only api format for ooba, more visible checkboxes
2023-11-08 14:08:51 +09:00
Cohee
b2629d9718
Refactor status checks and indicators.
2023-11-08 02:52:03 +02:00
Cohee
bda15ef007
Adjust aphro streaming error parser
2023-11-08 00:25:06 +02:00
Cohee
2c7b954a8d
#1328 New API schema for ooba / mancer / aphrodite
2023-11-08 00:17:13 +02:00
RossAscends
a5b5dfdcb6
temperature last for ooba
2023-11-05 23:50:07 +09:00
RossAscends
937c5d7e5f
Min P for Ooba, fix preset loading for ZenSliders
2023-11-05 23:08:06 +09:00
RossAscends
447c43c9d7
placeholders for Aphro samplers
2023-11-04 11:16:12 +09:00
RossAscends
c79bf951b7
minor skill issue with slider overhaul
2023-10-26 20:41:12 +09:00
RossAscends
e9107870c0
Merge branch 'staging' of https://github.com/Cohee1207/SillyTavern into staging
2023-10-26 13:20:49 +09:00
RossAscends
d50124e937
appwide slider overhaul
2023-10-26 13:20:47 +09:00
kingbri
4f67e9f38b
Ooba: Add frequency and presence penalty sliders
...
These options essentially supersede repetition penalty, but since
ooba also gives the option, add these new sliders in conjunction
with the repetition penalty ones.
More testing is needed to find good default values.
Signed-off-by: kingbri <bdashore3@proton.me>
2023-10-25 15:22:47 -04:00
Cohee
c4fbc8373d
Add raw generate function
2023-10-11 17:56:52 +03:00
Cohee
d265179f46
Don't crash ST server on invalid streaming URL
2023-10-08 23:42:28 +03:00
valden80
788bbe969f
Feature, ability to add banned words sequences from anywhere through {{banned "..."}} macro. ( #1202 )
...
* Add files via upload
* Add files via upload
* Add files via upload
* Add files via upload
* Add files via upload
* Fix constant assignment, reformat code
---------
Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
2023-10-05 13:10:41 +03:00
Cohee
bb47712696
Split oobabooga/mancer sources. Add aphrodite support
2023-09-28 19:10:00 +03:00
Cohee
306cf51da4
#1180 Add custom token bans. Return grammar strings
2023-09-27 22:09:09 +03:00
Cohee
773d42b886
Suspend ooba grammar
2023-09-25 14:07:13 +03:00
Cohee
86e9974410
Add grammar for ooba llama.cpp
2023-09-24 17:57:00 +03:00
Cohee
322511caa9
Remove legacy Pygmalion formatting, part 2
2023-09-06 14:19:29 +03:00
Cohee
f48cc0db31
Unify cfgValues parsing between ooba/Novel
2023-08-23 18:26:56 +03:00
Cohee
fad6c164cb
Don't set negative prompt from CFG extension to ooba at guidance scale 1.0
2023-08-23 17:44:38 +03:00
Cohee
6923044a34
Allow CFG on impersonation
2023-08-22 19:35:56 +03:00
kingbri
25c7686462
CFG: Pass entire guidance object
...
If an API doesn't use the negative prompt from generate, allow
the negative prompt to be refetched in the specific API. This
requires the full guidance scale object instead of just the number.
Signed-off-by: kingbri <bdashore3@proton.me>
2023-08-22 10:17:20 -04:00
kingbri
92e6c6a998
CFG: Use ST prompt builder for negatives
...
Make the generate function build a negative prompt in addition to the
normal one. This allows for nonconflicting insertion with other extension
prompts and World Info.
Signed-off-by: kingbri <bdashore3@proton.me>
2023-08-20 00:44:39 -04:00
kingbri
7191f7a8ad
CFG: Add insertion depth and custom separator
...
Insertion depth allows for CFG to variably inject itself into the
negative prompt. This is similar to how Author's note works.
However, this method of insertion depth conflicts with AN and
world info where negatives can be meshed between two lines
of those specific insertions.
A custom separator must be wrapped in quotes, otherwise the default
separator is a newline for negative cascading.
Signed-off-by: kingbri <bdashore3@proton.me>
2023-08-19 01:29:59 -04:00
kingbri
cdbca6d9fd
CFG: Include the entire prompt with negative prompt
...
CFG with LLMs works differently than stable diffusion. The main principle
is prompt mixing and utilizing the differences between the two prompts
rather than a full "negative prompt" of what the user doesn't want.
SillyTavern its own way of formatting a prompt sent to an LLM backend.
Therefore, take that prompt and add negatives to it.
Signed-off-by: kingbri <bdashore3@proton.me>
2023-08-18 22:18:09 -04:00
Cohee
32eb877fdb
Add native presetable CFG scale for ooba
2023-08-14 13:06:20 +03:00
kingbri
5bb6c89868
CFG: Add per-chat CFG and fixes
...
Per-chat CFG applies a CFG setting per-chat only rather than character
or globally. This overrides all other CFG settings (this will be changed).
Also add fixes to remove character CFG entries properly and not to
apply CFG if the scale is 1 as that won't do anything to generation.
Signed-off-by: kingbri <bdashore3@proton.me>
2023-08-11 01:39:15 -04:00
kingbri
0c94740560
Extensions: Add global CFG support
...
CFG is bundled as an extension which is more flexible in terms of
storing settings and consolidating code.
Global CFG is currently supported and the old power user setting
is auto-migrated.
Signed-off-by: kingbri <bdashore3@proton.me>
2023-08-11 01:39:15 -04:00
50h100a
53e41bdda8
Relax URL requirements when Mancer is enabled.
2023-08-08 16:12:03 -04:00
50h100a
c1ab0212e5
update comments
2023-08-07 17:10:05 -04:00
50h100a
128945aaaa
Copy 'relaxed api url' functionality over to webui
2023-08-07 16:46:32 -04:00
50h100a
2fc6813e66
code review adjustments part 1
2023-08-03 06:07:54 -04:00
Cohee
130559d499
#746 Add preset manager for ooba/kobold
2023-07-23 23:52:31 +03:00
Cohee
1b9ca4c9d9
Unhide advanced parameters for Novel
2023-07-23 02:09:03 +03:00
ThisIsPIRI
e2f1f4a57a
Add repetition penalty range for ooba
2023-07-22 07:38:35 +09:00
Cohee
edd41989fd
Initial commit
2023-07-20 20:32:15 +03:00