ebolam
13db6821fe
Fix for default inference model is_valid and requested_parameters having vram as a required parameter.
2023-07-15 11:11:29 -04:00
somebody
bce1a907e5
Update aux device to depend on primary device
2023-07-03 19:36:31 -05:00
ebolam
06f59a7b7b
Moved model backends to separate folders
...
added some model backend settings save/load
2023-05-18 20:14:33 -04:00
ebolam
182ecff202
Added in model backend to the command line arguments
2023-05-18 16:01:17 -04:00
ebolam
c6b17889d0
Updated to latest united
2023-05-12 07:53:27 -04:00
ebolam
4605d10c37
Next iteration. Model Loading is broken completely now :)
2023-05-11 12:08:35 -04:00
Henk
edd9c7d782
Warning polish
2023-05-11 15:13:59 +02:00
ebolam
71aee4dbd8
First concept of model plugins with a conceptual UI.
...
Completely breaks UI2 model loading.
2023-05-10 16:30:46 -04:00
henk717
724ba43dc1
Merge pull request #342 from one-some/model-structure-and-maybe-rwkv
...
Move overrides to better places
2023-05-03 03:34:17 +02:00
somebody
efe268df60
Move overrides to better places
2023-05-02 20:18:33 -05:00
Henk
de7b760048
Typo Fix
2023-05-03 01:02:50 +02:00
somebody
111028642e
Fix tokenizer fallback for llama
2023-05-01 19:42:52 -05:00
onesome
9579298df7
Better fallback
2023-04-25 22:28:07 -05:00
somebody
4dcf570407
Fix legacy model loading
2023-04-15 12:57:35 -05:00
somebody
9d70646e4d
Lazyload: Safetensors
2023-04-02 15:40:34 -05:00
somebody
ffe85ce8a1
Modeling: Fix logits processors (probs, biasing, lua)
2023-03-17 16:56:47 -05:00
somebody
8d0bc404a5
Model: More Jax import fixes and formatting
2023-03-17 15:36:44 -05:00
somebody
b93c339145
Model: Lazyload backends
2023-03-13 20:29:29 -05:00
somebody
bf8b60ac2d
Model: Add GenericTokenizer
...
Because Hugging Face doesnt have a consistant API across their own
libraries
2023-03-13 17:36:58 -05:00
somebody
cd8ccf0a5e
Modeling: Add seed parameter to raw_generate
...
Yahooo, decoupling from koboldai_vars. This makes the generation test
pass in `test_generation.py`, and makes full determinism outside of
core_generate work.
2023-03-12 21:49:10 -05:00
somebody
8c8bdfaf6a
Model: Fix assorted bugs
...
and ignore warnings in pytest
2023-03-10 20:00:39 -06:00
somebody
52095054a3
Model: Reformat and clean up
2023-03-10 20:00:39 -06:00
somebody
fb0b2f0467
Model: Ditch awful current_model hack
...
thanks to whjms for spotting that this could be zapped
2023-03-10 20:00:38 -06:00
somebody
3b05359e4b
Model: Refuse to serve certain models over the API
2023-03-07 16:54:08 -06:00
somebody
5d9bd96ad8
Model: Fix API support
2023-03-04 21:15:12 -06:00
somebody
70cddc46e2
Model: Small cleanup
2023-03-04 19:02:10 -06:00
somebody
27b7635c95
Model: Fix TPU
2023-03-04 19:02:00 -06:00
somebody
54cecd4d5d
Model: And another refactor
2023-03-01 19:16:35 -06:00