mirror of
https://github.com/SillyTavern/SillyTavern.git
synced 2025-02-20 22:20:39 +01:00
Speculative ngram allows for a different method of speculative decoding. Using a draft model is still preferred. Signed-off-by: kingbri <bdashore3@proton.me>