Commit Graph

119 Commits

Author SHA1 Message Date
Cohee
95c49029f7 Add aphrodite model selector 2024-03-01 23:02:43 +02:00
gabriel dhimoila
76669ff8bb add max_tokens_second 2024-02-29 00:55:25 +01:00
Cohee
f962ad5c02 Add OpenRouter as a text completion source 2024-02-25 22:47:07 +02:00
Cohee
fc289126fa Add event type for text completion generation request settings ready 2024-02-24 21:45:33 +02:00
Cohee
d140b8d5be Parse non-streaming tabby logprobs 2024-02-24 20:10:53 +02:00
Cohee
8848818d67 Fix dynatemp neutralization 2024-02-24 15:32:12 +02:00
Cohee
299bd9d563 Merge branch 'staging' into llamacpp-sampler-order 2024-02-24 15:10:58 +02:00
Cohee
13aebc623a
Merge pull request #1854 from deciare/llamacpp-probs
Request and display token probabilities from llama.cpp backend
2024-02-24 15:06:28 +02:00
Cohee
9287ff18de Fix for non-streaming 2024-02-24 14:50:06 +02:00
Deciare
9eba076ae4 Sampler order for llama.cpp server backend 2024-02-23 23:01:04 -05:00
Deciare
936fbac6c5 Merge remote-tracking branch 'origin/staging' into llamacpp-probs 2024-02-23 17:45:54 -05:00
Deciare
344b9eedbc Request token probabilities from llama.cpp backend
llama.cpp server token probabilities are given as values ranging from
0 to 1 instead of as logarithms.
2024-02-23 14:01:46 -05:00
NWilson
f569424f3e Merge branch 'staging' into InfermaticAI 2024-02-22 08:32:10 -06:00
Cohee
0ccdfe4bb7 Fix duped line 2024-02-22 02:45:35 +02:00
Cohee
0c1cf9ff2e Send sampler priority as array 2024-02-21 00:53:54 +02:00
kalomaze
32ee58e5e6 fix kcpp order reset 2024-02-19 18:12:56 -06:00
NWilson
b5887960b6 Merge branch 'release' into InfermaticAI 2024-02-16 08:53:04 -06:00
kalomaze
9fed7ed742 Make the neutralize option turn off quad sampling 2024-02-10 17:57:00 -06:00
kalomaze
70deb11d27 implement jank js + drag n drop sampler priority 2024-02-10 16:32:46 -06:00
kalomaze
818029288e Remove sending it if it's Aphrodite or TabbyAPI 2024-02-10 15:22:24 -06:00
kalomaze
2065f95edc Sampler priority support 2024-02-10 14:57:41 -06:00
Cohee
d8d4732614 Add koboldcpp param aliases 2024-02-09 15:32:40 +02:00
kingbri
530455979f Textgen: Add tabby to logprobs
Tabby now supports logprob returns. Add support to the backend gate.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-02-08 20:59:54 -05:00
kingbri
f372b2ac16 Textgen: Fix dynatemp defaults
The default min and max temp values should be 1

Signed-off-by: kingbri <bdashore3@proton.me>
2024-02-08 20:57:40 -05:00
Cohee
f1cadfde2b Fix llamacpp server 2024-02-04 17:41:46 +02:00
kingbri
3e34123b31 Textgen: Fix sampling param requests
1. TabbyAPI was not being included in temperature_last
2. DynaTemp parameters were still being sent even though the checkbox
was off. Cleanup the parameters if dynatemp is off for backends that
don't take in a dynatemp boolean.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-02-04 01:44:07 -05:00
Cohee
6d0d92aaff Add grammar and dynatemp for aphrodite 2024-02-02 20:06:46 +02:00
Alexander Abushady
da68d35ca5 Remove Noisy Sampling
Leaves DynaTemp Exponent and Quadratic Sampling.
2024-01-30 21:50:50 -05:00
Alexander Abushady
92aae0da04 Enter the Kalo Zone 2024-01-28 15:14:21 -05:00
AlpinDale
8af3939f09
fix: logprobs for Aphrodite
Since Aphrodite follows the same logprobs style as ooba, we can enable it easily.
2024-01-26 07:15:39 +00:00
khanon
60044c18a4 Implement Token Probabilities UI using logprobs 2024-01-25 18:34:46 -06:00
NWilson
f29f934c6b Progress 2024-01-24 06:59:27 -06:00
Cohee
570d5a30bd [skip ci] Lint fix 2024-01-20 20:40:40 +02:00
Cohee
5cb4dbc280 Trim TG types after split 2024-01-15 20:45:21 +02:00
Cohee
22e605168b Properly reset seed with neutralize samplers 2024-01-15 20:44:07 +02:00
RossAscends
c6487fa9cb make sampler nuke button work with zensliders 2024-01-14 18:33:40 +09:00
RossAscends
80b39bdac1 add neutralize samplers button for text completion slider panel 2024-01-14 18:07:43 +09:00
RossAscends
44bd57f763 filter ooba-only samplers out of non-ooba text completion slider panels 2024-01-14 17:26:22 +09:00
Cohee
1367642eb4 Fix maxTokens reference for n_predict alias 2024-01-12 12:20:03 +02:00
Cohee
8659bd57b7 Add koboldcpp rep pen aliases 2024-01-12 12:12:53 +02:00
Alexander Abushady
ec63cd8b6d Dynatemp Range
Kobold Dynatemp range set when deactivated, now will work properly
2024-01-09 00:54:20 -05:00
Alexander Abushady
5ad980cf99 Fix for realzies 2024-01-09 00:02:53 -05:00
Alexander Abushady
04a5d8390d Dynatemp UI v3.1
fixes for html positioning as well as api settings.
2024-01-08 23:58:06 -05:00
Alexander Abushady
b334acec23 DynaTemp UI v3
Backend bros won't play along now we gotta make them hold hands.
2024-01-08 22:20:27 -05:00
Cohee
2bea90a264
Merge pull request #1633 from AAbushady/new-dynatemp-ui
DynaTemp UI v2
2024-01-06 00:33:25 +02:00
Alexander Abushady
224dcc83a1 Update DynaTemp Sliders
More inline with changes made that will be implemented in KCPP main.
2024-01-05 14:36:39 -05:00
Cohee
5f93c30a96 #1627 Bypass status check and custom model for textgen type 2024-01-05 19:15:07 +02:00
Cohee
86d715cc16 Improve tokenizer detection 2024-01-05 16:17:06 +02:00
Alexander Abushady
d66be24518 DynaTemp UI v2
With DynaTemp moving into main KCPP now ST can have DynaTemp too!
2024-01-05 02:08:44 -05:00
Cohee
47cb017a45 #1589 Add 'cache_prompt' for l.cpp 2023-12-25 02:42:03 +02:00