Commit Graph

28 Commits

Author SHA1 Message Date
ebolam
13db6821fe Fix for default inference model is_valid and requested_parameters having vram as a required parameter. 2023-07-15 11:11:29 -04:00
somebody
bce1a907e5 Update aux device to depend on primary device 2023-07-03 19:36:31 -05:00
ebolam
06f59a7b7b Moved model backends to separate folders
added some model backend settings save/load
2023-05-18 20:14:33 -04:00
ebolam
182ecff202 Added in model backend to the command line arguments 2023-05-18 16:01:17 -04:00
ebolam
c6b17889d0 Updated to latest united 2023-05-12 07:53:27 -04:00
ebolam
4605d10c37 Next iteration. Model Loading is broken completely now :) 2023-05-11 12:08:35 -04:00
Henk
edd9c7d782 Warning polish 2023-05-11 15:13:59 +02:00
ebolam
71aee4dbd8 First concept of model plugins with a conceptual UI.
Completely breaks UI2 model loading.
2023-05-10 16:30:46 -04:00
henk717
724ba43dc1 Merge pull request #342 from one-some/model-structure-and-maybe-rwkv
Move overrides to better places
2023-05-03 03:34:17 +02:00
somebody
efe268df60 Move overrides to better places 2023-05-02 20:18:33 -05:00
Henk
de7b760048 Typo Fix 2023-05-03 01:02:50 +02:00
somebody
111028642e Fix tokenizer fallback for llama 2023-05-01 19:42:52 -05:00
onesome
9579298df7 Better fallback 2023-04-25 22:28:07 -05:00
somebody
4dcf570407 Fix legacy model loading 2023-04-15 12:57:35 -05:00
somebody
9d70646e4d Lazyload: Safetensors 2023-04-02 15:40:34 -05:00
somebody
ffe85ce8a1 Modeling: Fix logits processors (probs, biasing, lua) 2023-03-17 16:56:47 -05:00
somebody
8d0bc404a5 Model: More Jax import fixes and formatting 2023-03-17 15:36:44 -05:00
somebody
b93c339145 Model: Lazyload backends 2023-03-13 20:29:29 -05:00
somebody
bf8b60ac2d Model: Add GenericTokenizer
Because Hugging Face doesnt have a consistant API across their own
libraries
2023-03-13 17:36:58 -05:00
somebody
cd8ccf0a5e Modeling: Add seed parameter to raw_generate
Yahooo, decoupling from koboldai_vars. This makes the generation test
pass in `test_generation.py`, and makes full determinism outside of
core_generate work.
2023-03-12 21:49:10 -05:00
somebody
8c8bdfaf6a Model: Fix assorted bugs
and ignore warnings in pytest
2023-03-10 20:00:39 -06:00
somebody
52095054a3 Model: Reformat and clean up 2023-03-10 20:00:39 -06:00
somebody
fb0b2f0467 Model: Ditch awful current_model hack
thanks to whjms for spotting that this could be zapped
2023-03-10 20:00:38 -06:00
somebody
3b05359e4b Model: Refuse to serve certain models over the API 2023-03-07 16:54:08 -06:00
somebody
5d9bd96ad8 Model: Fix API support 2023-03-04 21:15:12 -06:00
somebody
70cddc46e2 Model: Small cleanup 2023-03-04 19:02:10 -06:00
somebody
27b7635c95 Model: Fix TPU 2023-03-04 19:02:00 -06:00
somebody
54cecd4d5d Model: And another refactor 2023-03-01 19:16:35 -06:00