206 Commits

Author SHA1 Message Date
Cohee
0ccdfe4bb7 Fix duped line 2024-02-22 02:45:35 +02:00
Cohee
0c1cf9ff2e Send sampler priority as array 2024-02-21 00:53:54 +02:00
kalomaze
32ee58e5e6 fix kcpp order reset 2024-02-19 18:12:56 -06:00
NWilson
b5887960b6 Merge branch 'release' into InfermaticAI 2024-02-16 08:53:04 -06:00
kalomaze
9fed7ed742 Make the neutralize option turn off quad sampling 2024-02-10 17:57:00 -06:00
kalomaze
70deb11d27 implement jank js + drag n drop sampler priority 2024-02-10 16:32:46 -06:00
kalomaze
818029288e Remove sending it if it's Aphrodite or TabbyAPI 2024-02-10 15:22:24 -06:00
kalomaze
2065f95edc Sampler priority support 2024-02-10 14:57:41 -06:00
Cohee
d8d4732614 Add koboldcpp param aliases 2024-02-09 15:32:40 +02:00
kingbri
530455979f Textgen: Add tabby to logprobs
Tabby now supports logprob returns. Add support to the backend gate.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-02-08 20:59:54 -05:00
kingbri
f372b2ac16 Textgen: Fix dynatemp defaults
The default min and max temp values should be 1

Signed-off-by: kingbri <bdashore3@proton.me>
2024-02-08 20:57:40 -05:00
Cohee
f1cadfde2b Fix llamacpp server 2024-02-04 17:41:46 +02:00
kingbri
3e34123b31 Textgen: Fix sampling param requests
1. TabbyAPI was not being included in temperature_last
2. DynaTemp parameters were still being sent even though the checkbox
was off. Cleanup the parameters if dynatemp is off for backends that
don't take in a dynatemp boolean.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-02-04 01:44:07 -05:00
Cohee
6d0d92aaff Add grammar and dynatemp for aphrodite 2024-02-02 20:06:46 +02:00
Alexander Abushady
da68d35ca5 Remove Noisy Sampling
Leaves DynaTemp Exponent and Quadratic Sampling.
2024-01-30 21:50:50 -05:00
Alexander Abushady
92aae0da04 Enter the Kalo Zone 2024-01-28 15:14:21 -05:00
AlpinDale
8af3939f09
fix: logprobs for Aphrodite
Since Aphrodite follows the same logprobs style as ooba, we can enable it easily.
2024-01-26 07:15:39 +00:00
khanon
60044c18a4 Implement Token Probabilities UI using logprobs 2024-01-25 18:34:46 -06:00
NWilson
f29f934c6b Progress 2024-01-24 06:59:27 -06:00
Cohee
570d5a30bd [skip ci] Lint fix 2024-01-20 20:40:40 +02:00
Cohee
5cb4dbc280 Trim TG types after split 2024-01-15 20:45:21 +02:00
Cohee
22e605168b Properly reset seed with neutralize samplers 2024-01-15 20:44:07 +02:00
RossAscends
c6487fa9cb make sampler nuke button work with zensliders 2024-01-14 18:33:40 +09:00
RossAscends
80b39bdac1 add neutralize samplers button for text completion slider panel 2024-01-14 18:07:43 +09:00
RossAscends
44bd57f763 filter ooba-only samplers out of non-ooba text completion slider panels 2024-01-14 17:26:22 +09:00
Cohee
1367642eb4 Fix maxTokens reference for n_predict alias 2024-01-12 12:20:03 +02:00
Cohee
8659bd57b7 Add koboldcpp rep pen aliases 2024-01-12 12:12:53 +02:00
Alexander Abushady
ec63cd8b6d Dynatemp Range
Kobold Dynatemp range set when deactivated, now will work properly
2024-01-09 00:54:20 -05:00
Alexander Abushady
5ad980cf99 Fix for realzies 2024-01-09 00:02:53 -05:00
Alexander Abushady
04a5d8390d Dynatemp UI v3.1
fixes for html positioning as well as api settings.
2024-01-08 23:58:06 -05:00
Alexander Abushady
b334acec23 DynaTemp UI v3
Backend bros won't play along now we gotta make them hold hands.
2024-01-08 22:20:27 -05:00
Cohee
2bea90a264
Merge pull request #1633 from AAbushady/new-dynatemp-ui
DynaTemp UI v2
2024-01-06 00:33:25 +02:00
Alexander Abushady
224dcc83a1 Update DynaTemp Sliders
More inline with changes made that will be implemented in KCPP main.
2024-01-05 14:36:39 -05:00
Cohee
5f93c30a96 #1627 Bypass status check and custom model for textgen type 2024-01-05 19:15:07 +02:00
Cohee
86d715cc16 Improve tokenizer detection 2024-01-05 16:17:06 +02:00
Alexander Abushady
d66be24518 DynaTemp UI v2
With DynaTemp moving into main KCPP now ST can have DynaTemp too!
2024-01-05 02:08:44 -05:00
Cohee
47cb017a45 #1589 Add 'cache_prompt' for l.cpp 2023-12-25 02:42:03 +02:00
Cohee
67dd52c21b #1309 Ollama text completion backend 2023-12-19 16:38:11 +02:00
Cohee
edd737e8bd #371 Add llama.cpp inference server support 2023-12-18 22:38:28 +02:00
Cohee
6e8104873e #1569 Add logit bias for text completions 2023-12-18 18:57:10 +02:00
Cohee
b0d9f14534 Re-add Together as a text completion source 2023-12-17 23:38:03 +02:00
valadaptive
5b3c96df50 Rename /textgenerationwebui endpoint
I'd like to migrate over to using "textgen" to mean text-generation APIs
in general, so I've renamed the /textgenerationwebui/* endpoints to
/backends/text-completions/*.
2023-12-12 16:40:14 -05:00
Cohee
b0e7b73a32 Fix streaming processor error handler hooks 2023-12-08 02:01:08 +02:00
valadaptive
cdcd913805 Don't stream events if the API returned a 4xx code 2023-12-07 18:00:36 -05:00
valadaptive
5540c165cf Refactor server-sent events parsing
Create one server-sent events stream class which implements the entire
spec (different line endings, chunking, etc) and use it in all the
streaming generators.
2023-12-07 18:00:36 -05:00
Cohee
0a092629ce Fix undefined variable reference 2023-12-03 23:24:39 +02:00
valadaptive
24ccef3aba Fix "settings" variable being shadowed 2023-12-03 15:44:36 -05:00
valadaptive
9c33ddbafc Make textgen settings type checks more concise 2023-12-03 14:56:01 -05:00
valadaptive
047c897ead Remove is[API] functions
Just use an equality comparison. It's a bit longer, but only because
"textgenerationwebui_settings" is a long identifier.
2023-12-03 14:56:01 -05:00
valadaptive
ba54e3dea0 Replaces is_[api] params with api_type param
These were 5 mutually-exclusive booleans, which can be replaced with one
param that takes on 5 values, one for each API type.
2023-12-03 14:56:01 -05:00