mirror of
https://github.com/SillyTavern/SillyTavern.git
synced 2025-03-03 19:37:46 +01:00
Speculative ngram allows for a different method of speculative decoding. Using a draft model is still preferred. Signed-off-by: kingbri <bdashore3@proton.me>