Commit Graph

119 Commits

Author SHA1 Message Date
Cohee 95c49029f7 Add aphrodite model selector 2024-03-01 23:02:43 +02:00
gabriel dhimoila 76669ff8bb add max_tokens_second 2024-02-29 00:55:25 +01:00
Cohee f962ad5c02 Add OpenRouter as a text completion source 2024-02-25 22:47:07 +02:00
Cohee fc289126fa Add event type for text completion generation request settings ready 2024-02-24 21:45:33 +02:00
Cohee d140b8d5be Parse non-streaming tabby logprobs 2024-02-24 20:10:53 +02:00
Cohee 8848818d67 Fix dynatemp neutralization 2024-02-24 15:32:12 +02:00
Cohee 299bd9d563 Merge branch 'staging' into llamacpp-sampler-order 2024-02-24 15:10:58 +02:00
Cohee 13aebc623a
Merge pull request #1854 from deciare/llamacpp-probs
Request and display token probabilities from llama.cpp backend
2024-02-24 15:06:28 +02:00
Cohee 9287ff18de Fix for non-streaming 2024-02-24 14:50:06 +02:00
Deciare 9eba076ae4 Sampler order for llama.cpp server backend 2024-02-23 23:01:04 -05:00
Deciare 936fbac6c5 Merge remote-tracking branch 'origin/staging' into llamacpp-probs 2024-02-23 17:45:54 -05:00
Deciare 344b9eedbc Request token probabilities from llama.cpp backend
llama.cpp server token probabilities are given as values ranging from
0 to 1 instead of as logarithms.
2024-02-23 14:01:46 -05:00
NWilson f569424f3e Merge branch 'staging' into InfermaticAI 2024-02-22 08:32:10 -06:00
Cohee 0ccdfe4bb7 Fix duped line 2024-02-22 02:45:35 +02:00
Cohee 0c1cf9ff2e Send sampler priority as array 2024-02-21 00:53:54 +02:00
kalomaze 32ee58e5e6 fix kcpp order reset 2024-02-19 18:12:56 -06:00
NWilson b5887960b6 Merge branch 'release' into InfermaticAI 2024-02-16 08:53:04 -06:00
kalomaze 9fed7ed742 Make the neutralize option turn off quad sampling 2024-02-10 17:57:00 -06:00
kalomaze 70deb11d27 implement jank js + drag n drop sampler priority 2024-02-10 16:32:46 -06:00
kalomaze 818029288e Remove sending it if it's Aphrodite or TabbyAPI 2024-02-10 15:22:24 -06:00
kalomaze 2065f95edc Sampler priority support 2024-02-10 14:57:41 -06:00
Cohee d8d4732614 Add koboldcpp param aliases 2024-02-09 15:32:40 +02:00
kingbri 530455979f Textgen: Add tabby to logprobs
Tabby now supports logprob returns. Add support to the backend gate.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-02-08 20:59:54 -05:00
kingbri f372b2ac16 Textgen: Fix dynatemp defaults
The default min and max temp values should be 1

Signed-off-by: kingbri <bdashore3@proton.me>
2024-02-08 20:57:40 -05:00
Cohee f1cadfde2b Fix llamacpp server 2024-02-04 17:41:46 +02:00
kingbri 3e34123b31 Textgen: Fix sampling param requests
1. TabbyAPI was not being included in temperature_last
2. DynaTemp parameters were still being sent even though the checkbox
was off. Cleanup the parameters if dynatemp is off for backends that
don't take in a dynatemp boolean.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-02-04 01:44:07 -05:00
Cohee 6d0d92aaff Add grammar and dynatemp for aphrodite 2024-02-02 20:06:46 +02:00
Alexander Abushady da68d35ca5 Remove Noisy Sampling
Leaves DynaTemp Exponent and Quadratic Sampling.
2024-01-30 21:50:50 -05:00
Alexander Abushady 92aae0da04 Enter the Kalo Zone 2024-01-28 15:14:21 -05:00
AlpinDale 8af3939f09
fix: logprobs for Aphrodite
Since Aphrodite follows the same logprobs style as ooba, we can enable it easily.
2024-01-26 07:15:39 +00:00
khanon 60044c18a4 Implement Token Probabilities UI using logprobs 2024-01-25 18:34:46 -06:00
NWilson f29f934c6b Progress 2024-01-24 06:59:27 -06:00
Cohee 570d5a30bd [skip ci] Lint fix 2024-01-20 20:40:40 +02:00
Cohee 5cb4dbc280 Trim TG types after split 2024-01-15 20:45:21 +02:00
Cohee 22e605168b Properly reset seed with neutralize samplers 2024-01-15 20:44:07 +02:00
RossAscends c6487fa9cb make sampler nuke button work with zensliders 2024-01-14 18:33:40 +09:00
RossAscends 80b39bdac1 add neutralize samplers button for text completion slider panel 2024-01-14 18:07:43 +09:00
RossAscends 44bd57f763 filter ooba-only samplers out of non-ooba text completion slider panels 2024-01-14 17:26:22 +09:00
Cohee 1367642eb4 Fix maxTokens reference for n_predict alias 2024-01-12 12:20:03 +02:00
Cohee 8659bd57b7 Add koboldcpp rep pen aliases 2024-01-12 12:12:53 +02:00
Alexander Abushady ec63cd8b6d Dynatemp Range
Kobold Dynatemp range set when deactivated, now will work properly
2024-01-09 00:54:20 -05:00
Alexander Abushady 5ad980cf99 Fix for realzies 2024-01-09 00:02:53 -05:00
Alexander Abushady 04a5d8390d Dynatemp UI v3.1
fixes for html positioning as well as api settings.
2024-01-08 23:58:06 -05:00
Alexander Abushady b334acec23 DynaTemp UI v3
Backend bros won't play along now we gotta make them hold hands.
2024-01-08 22:20:27 -05:00
Cohee 2bea90a264
Merge pull request #1633 from AAbushady/new-dynatemp-ui
DynaTemp UI v2
2024-01-06 00:33:25 +02:00
Alexander Abushady 224dcc83a1 Update DynaTemp Sliders
More inline with changes made that will be implemented in KCPP main.
2024-01-05 14:36:39 -05:00
Cohee 5f93c30a96 #1627 Bypass status check and custom model for textgen type 2024-01-05 19:15:07 +02:00
Cohee 86d715cc16 Improve tokenizer detection 2024-01-05 16:17:06 +02:00
Alexander Abushady d66be24518 DynaTemp UI v2
With DynaTemp moving into main KCPP now ST can have DynaTemp too!
2024-01-05 02:08:44 -05:00
Cohee 47cb017a45 #1589 Add 'cache_prompt' for l.cpp 2023-12-25 02:42:03 +02:00