Commit Graph

187 Commits

Author SHA1 Message Date
Cohee 2478e17164 Remove legacy API for Aphrodite 2024-09-15 08:05:53 +03:00
Cohee 50de678980 Hide beam search for vllm. It never worked. 2024-09-14 16:53:21 +03:00
Cohee f0d361bc7a Remove unused beam search 2024-09-14 16:41:22 +03:00
Cohee 28837ff883 Hard code include_stop_str_in_output 2024-09-14 16:32:50 +03:00
AlpinDale 1cc935796f fix early_stopping 2024-09-14 12:45:29 +00:00
AlpinDale fde76069e0 remove beam search 2024-09-14 12:42:21 +00:00
AlpinDale 9c94348491 clean up 2024-09-14 12:38:19 +00:00
AlpinDale efd477da04 chore: slight refactor of aphrodite samplers 2024-09-13 10:34:06 +00:00
Cohee 7534e137ae Parse Tabby streaming error. 2024-09-08 22:24:03 +03:00
Cohee 42fa3c79d7 Add Tabby model selection 2024-09-08 22:23:25 +03:00
Cohee b16915cfb9 Remove truncation_length from textgen settings 2024-09-03 14:06:10 +00:00
Cohee ae2d0f04ed Add XTC for koboldcpp 2024-08-31 20:18:51 +03:00
Cohee 696c83f96d [chore] Fix eslint 2024-08-19 21:36:28 +03:00
Cohee d77363cd7c Merge branch 'staging' into feat/xtc 2024-08-19 21:35:35 +03:00
Vitor e28257096a added xtc parameter for ooba 2024-08-19 01:32:45 -03:00
Cohee 9215dfd0c6 Replace macros in DRY sequence breakers 2024-08-18 13:50:58 +03:00
Wolfsblvt 28a9c45c31 /api-url slash command to get/set server url 2024-08-18 01:05:25 +02:00
Cohee 8ff4a4a36a Don't modify legacy URL path for inappropriate API types 2024-08-01 15:01:38 +03:00
Cohee e6e8d7726b Generate random seed for HF endpoint 2024-07-26 12:50:39 +00:00
Cohee 5f2a73ac9f Expose "Allow fallback providers" for OpenRouter 2024-07-19 23:34:16 +03:00
Cohee b66e589b30 Don't use dynatemp for unsupported backends 2024-07-02 14:17:10 +00:00
Cohee b62cbdeebd Merge branch 'staging' into DarokCx/release 2024-06-28 19:09:12 +03:00
Cohee bbb1a6e578 Add huggingface inference as text completion source 2024-06-28 18:17:27 +03:00
DarokCx 29ff0876a7 Added additional headers 2024-06-28 08:20:15 -04:00
DarokCx bd5592de7b Added featherless, connect button not working 2024-06-27 09:06:11 -04:00
Cohee 41ab90bb8e Support more parameters for Infermatic 2024-06-24 19:16:20 +03:00
Cohee a3dbcf3c2a Fix context and response size not being passed to Ollama 2024-06-24 03:48:34 +03:00
Cohee 1dd21caa66 Adjust number of VLLM logprobs 2024-06-09 00:59:40 +03:00
Cohee ff680f46cc Add rep_pen_slope control for koboldcpp 2024-06-05 22:05:41 +03:00
kingbri 4528655bb7 Textgen: Add multiswipe support for TabbyAPI
Tabby now supports batching and the "n" parameter for both non-streaming
and streaming. Add this into SillyTavern.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-05-28 00:55:57 -04:00
Cohee e8b96fec02 Merge branch 'staging' into new-samplers 2024-05-22 23:26:47 +03:00
kokansei 75a1ef4304
Add DRY Samplers to ST Staging (#2211)
* Add files via upload

* Add files via upload

* Delete public/index.html

* Add files via upload

* Delete public/scripts/textgen-settings.js

* Add files via upload

* Delete public/scripts/power-user.js

* Add files via upload

* Delete public/scripts/power-user.js

* Add files via upload

* Update power-user.js

* Update index.html

* Fix control attribution

* Fix app loading

* Put sequence breakers under DRY block

* DRY for DRY

* Update public/index.html

Co-authored-by: Philipp Emanuel Weidmann <pew@worldwidemann.com>

* Merge fix

* Add llamacpp control. Add default value for sequence breakers

* Forgot reset

---------

Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
Co-authored-by: Philipp Emanuel Weidmann <pew@worldwidemann.com>
2024-05-22 20:46:52 +03:00
kingbri 74b6ed97c2 Textgen: Add repetition decay for TabbyAPI
Repetition decay softens the drop off for repetition penalty. It's
best paired with rep pen range.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-05-22 00:09:10 -04:00
kingbri 99d143263d Textgen: Add skew sampling
Adds the option from skew sampling from exllamaV2

Signed-off-by: kingbri <bdashore3@proton.me>
2024-05-21 23:48:33 -04:00
kingbri a12df762a0 Textgen: Add speculative_ngram for TabbyAPI
Speculative ngram allows for a different method of speculative
decoding. Using a draft model is still preferred.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-05-21 23:37:36 -04:00
Cohee ee913be46b
Merge pull request #2266 from sasha0552/vllm-fixes
vLLM fixes
2024-05-19 14:23:07 +03:00
RossAscends c7232ae23c WIP textgen API custom sampler display 2024-05-19 15:06:29 +09:00
sasha0552 db5e2d95c2
vLLM fixes
* Enable seed field for vLLM

* Enable beam search for vLLM

* Set the default length penalty to 1
(There is validation error from vLLM when beam search is disabled and the value is not equal to 1)
2024-05-19 04:34:11 +00:00
Cohee 4227968dfa Allow using JSON schema with llamacpp server 2024-05-18 18:50:48 +03:00
Cohee c7d75b7789 llamacpp broke 2024-05-12 21:41:07 +03:00
Cohee 27ccc6b090 Minor stylistic changes 2024-05-11 11:38:22 +03:00
kingbri 62faddac8d Textgen: Add banned_strings
TabbyAPI supports the ability to ban the presence of strings during
a generation. Add this support in SillyTavern by handling lines
enclosed in quotes as a special case.

Signed-off-by: kingbri <bdashore3@proton.me>
2024-05-11 00:58:29 -04:00
Cohee c73bfbd7b0 Safari bruh moment 2024-05-06 21:21:03 +03:00
Cohee 7063fce2af Selectable openrouter providers 2024-05-06 19:26:20 +03:00
Cohee 05db2552b3 Fix Top K disabled state for Infermatic.
Also an icon.
2024-05-04 02:37:05 +03:00
Cohee 7bfd666321 Add llama 3 tokenizer 2024-05-03 23:59:39 +03:00
Cohee 7b87f44518 Clean-up API-specific settings 2024-05-03 20:02:13 +03:00
sasha0552 2bd239fe81
Initial vLLM support 2024-05-02 22:40:40 +00:00
Cohee 022c180b62 Lint and clean-up 2024-04-15 00:39:15 +03:00
Cohee 3e60919289 Specify LLM prompt in case JSON schema is not supported 2024-04-14 17:13:54 +03:00