Cohee
00f0f755fc
Support comma-separated list of llama.cpp sequence breakers #3026
2024-10-28 11:44:26 +00:00
Cohee
542f77aeb8
Safe sequence breakers parse
2024-10-28 11:39:59 +00:00
Beinsezii
ace2902cb8
llama.cpp Enable dry w/ array convert
...
The new PR that was merged needs an array instead of a str
https://github.com/ggerganov/llama.cpp/pull/9702
2024-10-26 16:07:07 -07:00
Cohee
725cfdebca
Merge pull request #2993 from theo77186/llamacpp_xtc
...
Enable XTC support for llama.cpp
2024-10-16 20:52:06 +03:00
Cohee
276316ae62
Auto-add missing samplers for llama.cpp
2024-10-16 20:51:32 +03:00
theo77186
37f3a24ac6
Enable XTC support for llama.cpp
2024-10-16 18:30:46 +02:00
Cohee
24884b3206
TextGenWebUI: remove legacy API mode
2024-10-15 00:32:56 +03:00
Cohee
80f9d90918
Merge branch 'staging' into featherless-models
2024-10-09 21:51:59 +03:00
Cohee
151a77cd36
Initialize page on load
2024-10-09 21:51:31 +03:00
Cohee
3335dbf1a7
Add empty tool calls to streaming processors
2024-10-02 01:59:53 +03:00
Cohee
ac1bc95551
Xtra Type Cafety
2024-09-28 13:59:03 +03:00
Cohee
b1bb41313f
Insert missing ooba samplers
2024-09-28 13:56:57 +03:00
Cohee
8bd1c88d15
Enable XTC
2024-09-28 13:52:55 +03:00
Cohee
e59dacc27f
Merge branch 'staging' into xtc-ooba
2024-09-28 13:49:49 +03:00
Cohee
aaf8fa13e5
Remove condition
2024-09-21 13:18:52 +03:00
AlpinDale
a5fea35f12
chore: update aphrodite samplers
2024-09-21 08:18:06 +00:00
Cohee
0207794a2b
Fix Qwen and Command tokenizers not used for logit bias
2024-09-17 13:01:19 +00:00
Alexander Abushady
eb6097b1e8
Fixed Aphrodite Seed
...
Aphrodite's seed value needs to be undefined to be random.
2024-09-16 08:21:56 -04:00
Cohee
2478e17164
Remove legacy API for Aphrodite
2024-09-15 08:05:53 +03:00
Cohee
50de678980
Hide beam search for vllm. It never worked.
2024-09-14 16:53:21 +03:00
Cohee
f0d361bc7a
Remove unused beam search
2024-09-14 16:41:22 +03:00
Cohee
28837ff883
Hard code include_stop_str_in_output
2024-09-14 16:32:50 +03:00
AlpinDale
1cc935796f
fix early_stopping
2024-09-14 12:45:29 +00:00
AlpinDale
fde76069e0
remove beam search
2024-09-14 12:42:21 +00:00
AlpinDale
9c94348491
clean up
2024-09-14 12:38:19 +00:00
AlpinDale
efd477da04
chore: slight refactor of aphrodite samplers
2024-09-13 10:34:06 +00:00
Cohee
7534e137ae
Parse Tabby streaming error.
2024-09-08 22:24:03 +03:00
Cohee
42fa3c79d7
Add Tabby model selection
2024-09-08 22:23:25 +03:00
Cohee
b16915cfb9
Remove truncation_length from textgen settings
2024-09-03 14:06:10 +00:00
Cohee
cd8dd5fc8a
XTC for ooba
2024-08-31 20:31:34 +03:00
Cohee
ae2d0f04ed
Add XTC for koboldcpp
2024-08-31 20:18:51 +03:00
Cohee
696c83f96d
[chore] Fix eslint
2024-08-19 21:36:28 +03:00
Cohee
d77363cd7c
Merge branch 'staging' into feat/xtc
2024-08-19 21:35:35 +03:00
Vitor
e28257096a
added xtc parameter for ooba
2024-08-19 01:32:45 -03:00
Cohee
9215dfd0c6
Replace macros in DRY sequence breakers
2024-08-18 13:50:58 +03:00
Wolfsblvt
28a9c45c31
/api-url slash command to get/set server url
2024-08-18 01:05:25 +02:00
Cohee
8ff4a4a36a
Don't modify legacy URL path for inappropriate API types
2024-08-01 15:01:38 +03:00
Cohee
e6e8d7726b
Generate random seed for HF endpoint
2024-07-26 12:50:39 +00:00
Cohee
5f2a73ac9f
Expose "Allow fallback providers" for OpenRouter
2024-07-19 23:34:16 +03:00
Cohee
b66e589b30
Don't use dynatemp for unsupported backends
2024-07-02 14:17:10 +00:00
Cohee
b62cbdeebd
Merge branch 'staging' into DarokCx/release
2024-06-28 19:09:12 +03:00
Cohee
bbb1a6e578
Add huggingface inference as text completion source
2024-06-28 18:17:27 +03:00
DarokCx
29ff0876a7
Added additional headers
2024-06-28 08:20:15 -04:00
DarokCx
bd5592de7b
Added featherless, connect button not working
2024-06-27 09:06:11 -04:00
Cohee
41ab90bb8e
Support more parameters for Infermatic
2024-06-24 19:16:20 +03:00
Cohee
a3dbcf3c2a
Fix context and response size not being passed to Ollama
2024-06-24 03:48:34 +03:00
Cohee
1dd21caa66
Adjust number of VLLM logprobs
2024-06-09 00:59:40 +03:00
Cohee
ff680f46cc
Add rep_pen_slope control for koboldcpp
2024-06-05 22:05:41 +03:00
kingbri
4528655bb7
Textgen: Add multiswipe support for TabbyAPI
...
Tabby now supports batching and the "n" parameter for both non-streaming
and streaming. Add this into SillyTavern.
Signed-off-by: kingbri <bdashore3@proton.me>
2024-05-28 00:55:57 -04:00
Cohee
e8b96fec02
Merge branch 'staging' into new-samplers
2024-05-22 23:26:47 +03:00