Commit Graph

38 Commits

Author SHA1 Message Date
Henk
9e275de5d9 HF 4.35 2023-11-08 18:47:02 +01:00
Henk
829ea147fc Undo broken samplers 2023-10-14 20:38:35 +02:00
henk717
009c48ab10 Merge pull request #475 from StefanGliga/united
Add epsilon and eta sampling
2023-10-14 18:07:37 +02:00
Stefan Gligorijevic
05cc14f561 Add epsilon and eta sampling 2023-09-29 20:43:33 +02:00
ebolam
ac0f627616 Implemented abort function in model classes.
Horde stopper not working properly
2023-09-27 12:43:42 -04:00
somebody
93233068cc Merge branch 'united' of https://github.com/henk717/KoboldAI into wi-fixes 2023-07-30 13:52:20 -05:00
somebody
6b26cbbd0a Backends: Fix ReadOnly
Since somewhere in the pipeline ReadOnly is ignored, the bug wasn't
actually apparent unless using things like the Robot Button in WI cards.
2023-07-22 17:20:40 -05:00
somebody
e5d0a597a1 Generation Mode: UNTIL_EOS
This mode enables the EOS token and will generate infinitely until
hitting it.
2023-07-21 15:36:32 -05:00
somebody
8d5ae38b45 Context Menu: Show if gen mode is supported
- adds callback support to `enabledOn` in context menu items
- adds `supported_gen_modes` variable for frontend to check if a gen
  mode is supported
- adds `get_supported_gen_modes` to `InferenceModel` to get supported
  gen modes
- takes advantage of cool enum features for less enum-handling code
2023-07-21 14:29:41 -05:00
somebody
3a43b254b8 Add basic support for some of the quick stoppers 2023-07-21 13:27:30 -05:00
ebolam
13db6821fe Fix for default inference model is_valid and requested_parameters having vram as a required parameter. 2023-07-15 11:11:29 -04:00
somebody
bce1a907e5 Update aux device to depend on primary device 2023-07-03 19:36:31 -05:00
ebolam
06f59a7b7b Moved model backends to separate folders
added some model backend settings save/load
2023-05-18 20:14:33 -04:00
ebolam
182ecff202 Added in model backend to the command line arguments 2023-05-18 16:01:17 -04:00
ebolam
c6b17889d0 Updated to latest united 2023-05-12 07:53:27 -04:00
ebolam
4605d10c37 Next iteration. Model Loading is broken completely now :) 2023-05-11 12:08:35 -04:00
Henk
edd9c7d782 Warning polish 2023-05-11 15:13:59 +02:00
ebolam
71aee4dbd8 First concept of model plugins with a conceptual UI.
Completely breaks UI2 model loading.
2023-05-10 16:30:46 -04:00
henk717
724ba43dc1 Merge pull request #342 from one-some/model-structure-and-maybe-rwkv
Move overrides to better places
2023-05-03 03:34:17 +02:00
somebody
efe268df60 Move overrides to better places 2023-05-02 20:18:33 -05:00
Henk
de7b760048 Typo Fix 2023-05-03 01:02:50 +02:00
somebody
111028642e Fix tokenizer fallback for llama 2023-05-01 19:42:52 -05:00
onesome
9579298df7 Better fallback 2023-04-25 22:28:07 -05:00
somebody
4dcf570407 Fix legacy model loading 2023-04-15 12:57:35 -05:00
somebody
9d70646e4d Lazyload: Safetensors 2023-04-02 15:40:34 -05:00
somebody
ffe85ce8a1 Modeling: Fix logits processors (probs, biasing, lua) 2023-03-17 16:56:47 -05:00
somebody
8d0bc404a5 Model: More Jax import fixes and formatting 2023-03-17 15:36:44 -05:00
somebody
b93c339145 Model: Lazyload backends 2023-03-13 20:29:29 -05:00
somebody
bf8b60ac2d Model: Add GenericTokenizer
Because Hugging Face doesnt have a consistant API across their own
libraries
2023-03-13 17:36:58 -05:00
somebody
cd8ccf0a5e Modeling: Add seed parameter to raw_generate
Yahooo, decoupling from koboldai_vars. This makes the generation test
pass in `test_generation.py`, and makes full determinism outside of
core_generate work.
2023-03-12 21:49:10 -05:00
somebody
8c8bdfaf6a Model: Fix assorted bugs
and ignore warnings in pytest
2023-03-10 20:00:39 -06:00
somebody
52095054a3 Model: Reformat and clean up 2023-03-10 20:00:39 -06:00
somebody
fb0b2f0467 Model: Ditch awful current_model hack
thanks to whjms for spotting that this could be zapped
2023-03-10 20:00:38 -06:00
somebody
3b05359e4b Model: Refuse to serve certain models over the API 2023-03-07 16:54:08 -06:00
somebody
5d9bd96ad8 Model: Fix API support 2023-03-04 21:15:12 -06:00
somebody
70cddc46e2 Model: Small cleanup 2023-03-04 19:02:10 -06:00
somebody
27b7635c95 Model: Fix TPU 2023-03-04 19:02:00 -06:00
somebody
54cecd4d5d Model: And another refactor 2023-03-01 19:16:35 -06:00