Cohee
725cfdebca
Merge pull request #2993 from theo77186/llamacpp_xtc
...
Enable XTC support for llama.cpp
2024-10-16 20:52:06 +03:00
Cohee
276316ae62
Auto-add missing samplers for llama.cpp
2024-10-16 20:51:32 +03:00
theo77186
37f3a24ac6
Enable XTC support for llama.cpp
2024-10-16 18:30:46 +02:00
Cohee
24884b3206
TextGenWebUI: remove legacy API mode
2024-10-15 00:32:56 +03:00
Cohee
80f9d90918
Merge branch 'staging' into featherless-models
2024-10-09 21:51:59 +03:00
Cohee
151a77cd36
Initialize page on load
2024-10-09 21:51:31 +03:00
Cohee
3335dbf1a7
Add empty tool calls to streaming processors
2024-10-02 01:59:53 +03:00
Cohee
ac1bc95551
Xtra Type Cafety
2024-09-28 13:59:03 +03:00
Cohee
b1bb41313f
Insert missing ooba samplers
2024-09-28 13:56:57 +03:00
Cohee
8bd1c88d15
Enable XTC
2024-09-28 13:52:55 +03:00
Cohee
e59dacc27f
Merge branch 'staging' into xtc-ooba
2024-09-28 13:49:49 +03:00
Cohee
aaf8fa13e5
Remove condition
2024-09-21 13:18:52 +03:00
AlpinDale
a5fea35f12
chore: update aphrodite samplers
2024-09-21 08:18:06 +00:00
Cohee
0207794a2b
Fix Qwen and Command tokenizers not used for logit bias
2024-09-17 13:01:19 +00:00
Alexander Abushady
eb6097b1e8
Fixed Aphrodite Seed
...
Aphrodite's seed value needs to be undefined to be random.
2024-09-16 08:21:56 -04:00
Cohee
2478e17164
Remove legacy API for Aphrodite
2024-09-15 08:05:53 +03:00
Cohee
50de678980
Hide beam search for vllm. It never worked.
2024-09-14 16:53:21 +03:00
Cohee
f0d361bc7a
Remove unused beam search
2024-09-14 16:41:22 +03:00
Cohee
28837ff883
Hard code include_stop_str_in_output
2024-09-14 16:32:50 +03:00
AlpinDale
1cc935796f
fix early_stopping
2024-09-14 12:45:29 +00:00
AlpinDale
fde76069e0
remove beam search
2024-09-14 12:42:21 +00:00
AlpinDale
9c94348491
clean up
2024-09-14 12:38:19 +00:00
AlpinDale
efd477da04
chore: slight refactor of aphrodite samplers
2024-09-13 10:34:06 +00:00
Cohee
7534e137ae
Parse Tabby streaming error.
2024-09-08 22:24:03 +03:00
Cohee
42fa3c79d7
Add Tabby model selection
2024-09-08 22:23:25 +03:00
Cohee
b16915cfb9
Remove truncation_length from textgen settings
2024-09-03 14:06:10 +00:00
Cohee
cd8dd5fc8a
XTC for ooba
2024-08-31 20:31:34 +03:00
Cohee
ae2d0f04ed
Add XTC for koboldcpp
2024-08-31 20:18:51 +03:00
Cohee
696c83f96d
[chore] Fix eslint
2024-08-19 21:36:28 +03:00
Cohee
d77363cd7c
Merge branch 'staging' into feat/xtc
2024-08-19 21:35:35 +03:00
Vitor
e28257096a
added xtc parameter for ooba
2024-08-19 01:32:45 -03:00
Cohee
9215dfd0c6
Replace macros in DRY sequence breakers
2024-08-18 13:50:58 +03:00
Wolfsblvt
28a9c45c31
/api-url slash command to get/set server url
2024-08-18 01:05:25 +02:00
Cohee
8ff4a4a36a
Don't modify legacy URL path for inappropriate API types
2024-08-01 15:01:38 +03:00
Cohee
e6e8d7726b
Generate random seed for HF endpoint
2024-07-26 12:50:39 +00:00
Cohee
5f2a73ac9f
Expose "Allow fallback providers" for OpenRouter
2024-07-19 23:34:16 +03:00
Cohee
b66e589b30
Don't use dynatemp for unsupported backends
2024-07-02 14:17:10 +00:00
Cohee
b62cbdeebd
Merge branch 'staging' into DarokCx/release
2024-06-28 19:09:12 +03:00
Cohee
bbb1a6e578
Add huggingface inference as text completion source
2024-06-28 18:17:27 +03:00
DarokCx
29ff0876a7
Added additional headers
2024-06-28 08:20:15 -04:00
DarokCx
bd5592de7b
Added featherless, connect button not working
2024-06-27 09:06:11 -04:00
Cohee
41ab90bb8e
Support more parameters for Infermatic
2024-06-24 19:16:20 +03:00
Cohee
a3dbcf3c2a
Fix context and response size not being passed to Ollama
2024-06-24 03:48:34 +03:00
Cohee
1dd21caa66
Adjust number of VLLM logprobs
2024-06-09 00:59:40 +03:00
Cohee
ff680f46cc
Add rep_pen_slope control for koboldcpp
2024-06-05 22:05:41 +03:00
kingbri
4528655bb7
Textgen: Add multiswipe support for TabbyAPI
...
Tabby now supports batching and the "n" parameter for both non-streaming
and streaming. Add this into SillyTavern.
Signed-off-by: kingbri <bdashore3@proton.me>
2024-05-28 00:55:57 -04:00
Cohee
e8b96fec02
Merge branch 'staging' into new-samplers
2024-05-22 23:26:47 +03:00
kokansei
75a1ef4304
Add DRY Samplers to ST Staging ( #2211 )
...
* Add files via upload
* Add files via upload
* Delete public/index.html
* Add files via upload
* Delete public/scripts/textgen-settings.js
* Add files via upload
* Delete public/scripts/power-user.js
* Add files via upload
* Delete public/scripts/power-user.js
* Add files via upload
* Update power-user.js
* Update index.html
* Fix control attribution
* Fix app loading
* Put sequence breakers under DRY block
* DRY for DRY
* Update public/index.html
Co-authored-by: Philipp Emanuel Weidmann <pew@worldwidemann.com>
* Merge fix
* Add llamacpp control. Add default value for sequence breakers
* Forgot reset
---------
Co-authored-by: Cohee <18619528+Cohee1207@users.noreply.github.com>
Co-authored-by: Philipp Emanuel Weidmann <pew@worldwidemann.com>
2024-05-22 20:46:52 +03:00
kingbri
74b6ed97c2
Textgen: Add repetition decay for TabbyAPI
...
Repetition decay softens the drop off for repetition penalty. It's
best paired with rep pen range.
Signed-off-by: kingbri <bdashore3@proton.me>
2024-05-22 00:09:10 -04:00
kingbri
99d143263d
Textgen: Add skew sampling
...
Adds the option from skew sampling from exllamaV2
Signed-off-by: kingbri <bdashore3@proton.me>
2024-05-21 23:48:33 -04:00