Commit Graph

4558 Commits

Author SHA1 Message Date
ebolam
869bcadd03 Fix for toggles showing as check boxes in model loading
Fix for resubmit_model_info loosing selected model backend
2023-07-15 15:48:31 -04:00
ebolam
aaec472f73 Fix for toggles not showing up right in model backend settings 2023-07-15 15:20:18 -04:00
ebolam
bc337bf090 Merge branch 'henk717:united' into Model_Plugins 2023-07-15 15:02:00 -04:00
Henk
0622810bc4 Better way of doing the if statement 2023-07-15 20:00:29 +02:00
Henk
521d6ff227 Clean-up accidental uploads 2023-07-15 19:55:37 +02:00
Henk
23a104a4fe Only show 4-bit toggle on valid model 2023-07-15 19:42:26 +02:00
Henk
71b6e8d6d4 Fix accidental parameters overwrite 2023-07-15 19:35:40 +02:00
Henk
c43d60772b BnB dependency check 2023-07-15 18:56:13 +02:00
Henk
160effb9ea Add 4-bit BnB toggle 2023-07-15 18:20:10 +02:00
ebolam
13db6821fe Fix for default inference model is_valid and requested_parameters having vram as a required parameter. 2023-07-15 11:11:29 -04:00
Henk
c833ab23e4 Add scipy 2023-07-15 14:57:42 +02:00
Henk
2c50d5d092 Don't ruin breakmodel 2023-07-15 14:14:06 +02:00
Henk
003105ae8b Bump BnB 2023-07-15 04:26:58 +02:00
Henk
1f045110a4 Basic 4-bit backend 2023-07-15 02:49:31 +02:00
henk717
6dd73287c9 Merge pull request #391 from one-some/basic-hf-backend
Basic HF backend
2023-07-15 02:20:38 +02:00
onesome
7e2e75070b Fix prioritization in load from dir
In the future the module/backend list should probably contain objects
that would make this whole deal a lot less hackier.
2023-07-14 18:47:52 -05:00
onesome
afa8766ea6 Add is_valid 2023-07-14 18:01:18 -05:00
somebody
8549c7c896 Basic backend module prioritization
not secure; we're loading these modules so they can obviously execute
code that manipulates the prioritization
2023-07-12 19:03:49 -05:00
somebody
f67cb7fa05 Make basic hf independant from hf 2023-07-12 18:36:30 -05:00
somebody
d17ce8461d Use device_map="auto" 2023-07-12 17:27:48 -05:00
somebody
60473d4c23 Fix and add some documentation to basic hf backend 2023-07-12 17:16:05 -05:00
Henk
1fed1b0524 New dependencies 2023-07-12 22:00:17 +02:00
onesome
8077d6c3f9 Self-contained sampler patch (Don't merge)
Completely untested 3:00 AM code; beware! I will test and add more
documentation tomorrow.
2023-07-12 03:22:43 -05:00
henk717
922fcdd6ac Merge pull request #392 from ebolam/Model_Plugins
Fix for UI2 model loading not showing progress
2023-07-11 19:02:17 +02:00
ebolam
56e1c5df3a Fix for UI2 model loading not showing progress 2023-07-10 20:59:16 -04:00
henk717
95913aebb8 Merge pull request #390 from one-some/accelerate-offloading
Fix bleeding edge model loads and add lazyload fallback
2023-07-09 01:02:12 +02:00
somebody
20b4b4bcef Add basic hf backend 2023-07-08 17:12:16 -05:00
somebody
f9c38acea8 Merge branch 'accelerate-offloading' into basic-hf-backend 2023-07-08 17:04:23 -05:00
somebody
3928d86339 Fall back to unpatched HF 2023-07-08 14:36:45 -05:00
somebody
c2ee30af32 Add --panic to raise when loading fails 2023-07-08 14:04:46 -05:00
somebody
fd6f66a98d Patch _rebuild_from_type_v2 to not try converting LazyTensors to Tensors 2023-07-08 13:57:05 -05:00
henk717
60965b7b0c Merge pull request #389 from one-some/accelerate-offloading
Stub seek_offset for cache sorting in load loop
2023-07-07 22:40:08 +02:00
somebody
802929f5f2 Patch safetensors again 2023-07-07 14:54:40 -05:00
somebody
35f3687667 Merge branch 'united' of https://github.com/henk717/KoboldAI into accelerate-offloading 2023-07-07 14:54:12 -05:00
somebody
cfe1f5b514 Stub seek_offset for cache sorting in load loop
The way Safetensors individual weight loading is implemented doesn't
take full advantage of the cache ordering system thing, so this can just
be left at zero for now.
2023-07-07 14:49:46 -05:00
Henk
76d21bb142 Universal ziproot for lazy_loader 2023-07-07 13:37:32 +02:00
henk717
f0d161b9c6 Merge pull request #387 from LostRuins/concedo_united
Updated Kobold Lite to v46
2023-07-06 15:51:04 +02:00
Concedo
e3d8443edf Updated Kobold Lite to v46 2023-07-06 21:41:51 +08:00
Henk
548db92df5 No longer pin United 2023-07-06 14:55:54 +02:00
henk717
58c2dc08ae Merge pull request #386 from one-some/accelerate-offloading
Use VE's patched load_from_state_dict on TPU for loading empty weights
2023-07-06 14:54:54 +02:00
somebody
6b83944e9b Use VE's patched load_from_state_dict on TPU for loading empty weights 2023-07-05 18:36:57 -05:00
Henk
c72c0d0052 Pin United branch properly 2023-07-05 03:17:00 +02:00
Henk
22cf752411 Revert United colab pin since it doesn't work 2023-07-05 02:38:04 +02:00
Henk
1769691eda Possible fix 2 2023-07-05 02:35:16 +02:00
Henk
a192c74058 Possible colab URL fix 2023-07-05 02:28:48 +02:00
Henk
def39f4118 Typo fix 2023-07-05 00:02:29 +02:00
Henk
041d4135c8 Route TPU to stable commit while we fix the loader 2023-07-05 00:01:44 +02:00
Henk
16240878bc Restore --peft support 2023-07-04 20:42:29 +02:00
henk717
94c62a4f90 Merge pull request #385 from one-some/accelerate-offloading
Accelerate offloading
2023-07-04 03:14:51 +02:00
somebody
ebe478458f Just use string sort on key
Doesn't seem to affect cache hit number
2023-07-03 19:59:53 -05:00