Commit Graph

4401 Commits

Author SHA1 Message Date
somebody
ebe478458f Just use string sort on key
Doesn't seem to affect cache hit number
2023-07-03 19:59:53 -05:00
somebody
bce1a907e5 Update aux device to depend on primary device 2023-07-03 19:36:31 -05:00
somebody
6f7e6422ef Actually get correct primary device 2023-07-03 19:04:48 -05:00
somebody
59c731f805 Fix static primary_device
and some small cleanup
2023-07-03 18:37:48 -05:00
somebody
32917fd651 Remove unused file open 2023-07-03 17:51:54 -05:00
somebody
7f869a54d8 Just use accelerate on tpu 2023-07-03 17:18:48 -05:00
somebody
1bb2d2621c Make TPU in line with new lazyload behavior 2023-07-03 17:12:07 -05:00
somebody
31a3046a18 Load empty modules without accelerate 2023-07-03 17:07:18 -05:00
somebody
686c3d1592 Don't patch lazyload on TPU 2023-07-03 16:52:18 -05:00
one-some
39049e8a46 Merge pull request #20 from henk7171/accelerate-offloading
Accelerate offloading fixes
2023-07-03 16:06:20 -04:00
Henk
b603690e4c Link message fix 2023-07-03 14:28:33 +02:00
Henk
a0e21ad4e6 Restore finished loading messages 2023-07-03 14:16:21 +02:00
Henk
062e3ed27c Torch bump 2023-07-02 23:19:20 +02:00
Henk
4a45a0e551 Fix bar spam 2023-07-02 23:02:02 +02:00
Henk
51c4622694 Hide all Nvidia GPU's in CPU mode 2023-07-02 22:27:26 +02:00
Henk
81e72329af CPU fixes 2023-07-02 21:50:23 +02:00
0cc4m
0e4b6571d5 Fix non-tuple return from gptq function 2023-06-28 22:50:04 +02:00
henk717
c66f995c97 Merge pull request #382 from jojorne/jojorne-fix-move-delete-wi-item-for-ui1
Fix move and delete WI item for UI1
2023-06-28 00:30:29 +02:00
jojorne
edcd065980 Fix move and delete wi item for UI1 2023-06-27 17:48:30 -03:00
henk717
df70f92c9e Merge pull request #378 from jojorne/jojorne-patch-fix-wi-for-ui1
Fix WI for UI1
2023-06-27 14:04:20 +02:00
0cc4m
c753671ac1 Add exllama superhot positional embeddings compression support 2023-06-27 07:39:37 +02:00
henk717
b5bdb1d380 Merge pull request #377 from ebolam/Model_Plugins
Fix for model backends that use toggles always returning true
2023-06-26 01:54:25 +02:00
jojorne
f8962d0636 Fix WI for UI1 2023-06-22 18:04:43 -03:00
one-some
e62e3560bf Merge pull request #19 from henk7171/accelerate-offloading
Remove wrong usegpu behavior
2023-06-22 15:05:03 -05:00
Henk
1da4580e8b Remove wrong usegpu behavior 2023-06-22 07:07:02 +02:00
somebody
5ee20bd7d6 Fix for CPU loading 2023-06-21 21:18:43 -05:00
somebody
b81f61b820 Clean debug 2023-06-21 18:35:56 -05:00
somebody
e319d383f6 Merge branch 'Model_Plugins' of https://github.com/ebolam/KoboldAI into accelerate-offloading 2023-06-21 18:25:22 -05:00
ebolam
03a0542f71 Fix for model backends that use toggles always returning true 2023-06-21 19:19:12 -04:00
somebody
d4b923a054 Remove debug 2023-06-21 17:41:15 -05:00
somebody
5278174a62 Materialize on cpu 2023-06-21 17:40:47 -05:00
somebody
947bcc58e4 Experiments 2023-06-21 17:33:14 -05:00
somebody
0012158eac Remove old 2023-06-21 16:58:59 -05:00
somebody
6bdcf2645e Merge branch 'united' of https://github.com/henk717/KoboldAI into accelerate-offloading 2023-06-21 16:58:39 -05:00
somebody
c40649a74e Probably fix f32 2023-06-21 16:54:41 -05:00
somebody
70f113141c Fix Transformers 4.30 2023-06-21 16:40:12 -05:00
somebody
c56214c275 Fix loading bar 2023-06-21 16:27:22 -05:00
Henk
adf108ecd6 TPU link fix 2023-06-21 22:21:22 +02:00
Henk
b41b868528 Remove duplicate links 2023-06-21 22:13:55 +02:00
Henk
a13c7d0f40 New link messages 2023-06-21 21:55:18 +02:00
Henk
fc4d659e13 Merge branch 'main' into united 2023-06-21 21:32:55 +02:00
somebody
aca2b532d7 Remove debug 2023-06-21 14:15:38 -05:00
somebody
5f224e1366 Restore choice of lazyload or not 2023-06-21 14:13:14 -05:00
somebody
0052ad401a Basic breakmodel ui support
Seems to work
2023-06-21 13:57:32 -05:00
Henk
0c19855587 HF_Hub bump 2023-06-21 19:40:07 +02:00
Henk
bbecdaeedb Silently disable MTJ when Jax is not installed 2023-06-21 17:08:45 +02:00
0cc4m
adad81639d Remove rocm gptq install from environments file 2023-06-21 15:47:46 +02:00
0cc4m
e8741a1b57 Disable scaled_dot_product_attention if torch version < 2 2023-06-20 09:19:43 +02:00
0cc4m
a191855b37 Track token generation progress 2023-06-19 19:14:26 +02:00
0cc4m
e874f0c1c2 Add token streaming support for exllama 2023-06-19 19:14:26 +02:00