Commit Graph

4221 Commits

Author SHA1 Message Date
0cc4m
ecd065a881 Overhaul 4-bit support to load with a toggle 2023-03-21 21:40:59 +00:00
0cc4m
4cfc1219d4 Add gptq as submodule 2023-03-20 19:13:46 +00:00
0cc4m
3b7505dc28 Merge remote-tracking branch 'united/united' into 4bit 2023-03-20 19:06:40 +00:00
0cc4m
858657f669 Fix zipfile folder identification fix for Windows 2023-03-20 09:16:30 +01:00
somebody
91bb433b5f GenericTokenizer: Fall back to defined tokenizer
Shouldn't be relied on for model-agnostic code, but for loading
processes where you know the tokenizer class used it should be okie
dokie
2023-03-19 19:03:20 -05:00
0cc4m
60acf59316 Improve 4-bit llama support, add 4-bit gptj and gptneox support 2023-03-19 21:20:13 +00:00
henk717
94eb8ff825 TPU Message 2023-03-19 14:52:14 +01:00
Llama
5b8db52abb Merge pull request #26 from henk717/united
Merge united
2023-03-17 23:32:27 -07:00
somebody
864f9ed8c3 Add small debug for TPU 2023-03-17 17:06:48 -05:00
somebody
ffe85ce8a1 Modeling: Fix logits processors (probs, biasing, lua) 2023-03-17 16:56:47 -05:00
somebody
692dbfeb37 Merge branch 'united' of https://github.com/henk717/KoboldAI into model-structure-and-maybe-rwkv 2023-03-17 16:20:13 -05:00
somebody
8d0bc404a5 Model: More Jax import fixes and formatting 2023-03-17 15:36:44 -05:00
Henk
90a7eb6153 LLama tokenizer settings 2023-03-17 12:40:08 +01:00
Henk
1235b71bb5 Merge branch 'main' into united 2023-03-17 01:48:10 +01:00
Henk
219b824b9b SocketIO Requirements Pin 2023-03-17 01:28:59 +01:00
henk717
86c87b23c0 Merge branch 'KoboldAI:main' into united 2023-03-17 01:17:58 +01:00
0cc4m
5d17692c79 Remove except Exception so that errors actually show up 2023-03-16 05:24:58 +00:00
YellowRoseCx
b3b454bbe4 Update huggingface.yml 2023-03-15 00:03:43 -05:00
YellowRoseCx
bf677a32f6 Merge remote-tracking branch 'catboxanon/test/4bit' into yr4bit 2023-03-14 17:09:06 -05:00
YellowRoseCx
2909910bcc Merge branch 'henk717:united' into dev-yr 2023-03-14 17:04:33 -05:00
jojorne
8378ff9e26 Fix typo 2023-03-14 04:20:32 -03:00
jojorne
8d5a581d5d Enable renaming deleting wi root folder by creating a new one 2023-03-14 04:04:01 -03:00
somebody
03af06638c Modeling: Maybe fix samplers 2023-03-13 20:42:35 -05:00
somebody
65b60085e3 Undo debug 2023-03-13 20:30:46 -05:00
somebody
b93c339145 Model: Lazyload backends 2023-03-13 20:29:29 -05:00
somebody
adc11fdbc9 TPUMTJ: Fix loading bar
I don't know why it works but I know it works
2023-03-13 20:13:05 -05:00
somebody
938c97b75a RWKV: Fix yet another typo 2023-03-13 19:39:19 -05:00
somebody
3adc67c7a4 RWKV: Move import right before usage
So we don't needlessly compile the cuda kernel
2023-03-13 19:37:45 -05:00
somebody
14b2543c7c RWKV: Fix typo 2023-03-13 19:36:58 -05:00
somebody
b10b201701 Model: Add basic RWKV implementation 2023-03-13 19:34:38 -05:00
henk717
db7b53f52d Merge pull request #310 from nkpz/united
Fix out of range error after editing actions
2023-03-14 01:22:14 +01:00
catboxanon
5f3770bb58 Merge branch 'henk717:united' into test/4bit 2023-03-13 19:34:17 -04:00
somebody
bf8b60ac2d Model: Add GenericTokenizer
Because Hugging Face doesnt have a consistant API across their own
libraries
2023-03-13 17:36:58 -05:00
henk717
5249045c35 Merge pull request #304 from YellowRoseCx/united-yr
added local rng_states variable and fixed minor typo
2023-03-13 22:46:00 +01:00
somebody
60793eb121 Make modellist easier to work with 2023-03-13 15:40:24 -05:00
henk717
c96e96f95e Merge pull request #307 from jojorne/jojorne-patch-fix-save-loading-with-wi-features
Fix save loading between v1 and v2 to v3 with wi features
2023-03-13 21:40:11 +01:00
somebody
0320678b27 Model: WIP horde and API tests 2023-03-13 14:11:06 -05:00
Henk
8da04a98a4 Better Runtime Isolation 2023-03-13 18:41:25 +01:00
Nick Perez
0dce4c700f Just reverse the range 2023-03-13 07:00:51 -04:00
Nick Perez
b4b24f1389 Fix out of range after deletion in for loop 2023-03-13 06:21:25 -04:00
somebody
cd8ccf0a5e Modeling: Add seed parameter to raw_generate
Yahooo, decoupling from koboldai_vars. This makes the generation test
pass in `test_generation.py`, and makes full determinism outside of
core_generate work.
2023-03-12 21:49:10 -05:00
jojorne
4b8d4cde7d fix spacing 2023-03-12 20:41:34 -03:00
jojorne
4219e3e8d3 Removing the root folder is not supported 2023-03-12 20:38:58 -03:00
jojorne
e5c1b0506a Renaming the root folder is not supported 2023-03-12 20:05:40 -03:00
catboxanon
bde31217f1 improve model None check 2023-03-11 12:15:58 -05:00
catboxanon
1808b0d2ec Another safety check for if model is not loaded 2023-03-11 12:13:22 -05:00
jojorne
53f06903c2 revert more unrelated code 2023-03-11 13:54:01 -03:00
jojorne
c87ef60db1 revert more unrelated code 2023-03-11 13:48:41 -03:00
jojorne
e4ad8547a7 revert unrelated code 2023-03-11 13:39:19 -03:00
jojorne
47242e9abe remove debug code 2023-03-11 13:21:22 -03:00