From a92951f47e3391d786ca39bd5bddb1773e781bf0 Mon Sep 17 00:00:00 2001 From: anhad Date: Tue, 24 Oct 2023 10:08:34 +0530 Subject: [PATCH 1/2] Updated Readme.md --- README.md | 1 - 1 file changed, 1 deletion(-) diff --git a/README.md b/README.md index 789b78d1..f754e60b 100644 --- a/README.md +++ b/README.md @@ -68,7 +68,6 @@ Each edition features different models and requires different hardware to run, t | Model | Style | Description | | --- | --- | --- | -| [Pygmalion-6b](https://huggingface.co/PygmalionAI/pygmalion-6b) by Pygmalion AI | NSFW/Chat | Pymalion 6B is a proof-of-concept dialogue model based on EleutherAI's GPT-J-6B. Warning: This model is NOT suitable for use by minors. It will output X-rated content under certain circumstances. The fine-tuning dataset consisted of 56MB of dialogue data gathered from multiple sources, which includes both real and partially machine-generated conversations. | | [Nerys-6b](https://huggingface.co/KoboldAI/OPT-6B-nerys-v2) by Mr Seeker | Novel/Adventure | Nerys is a hybrid model based on Pike (A newer Janeway), on top of the Pike dataset you also get some Light Novels, Adventure mode support and a little bit of Shinen thrown in the mix. The end result is a very diverse model that is heavily biased towards SFW novel writing, but one that can go beyond its novel training and make for an excellent adventure model to. Adventure mode is best played from a second person perspective, but can be played in first or third person as well. Novel writing can be done best from the first or third person. | | [Erebus-6.7b](https://huggingface.co/KoboldAI/OPT-6.7B-Erebus) by Mr Seeker | NSFW | Erebus is our community's flagship NSFW model, being a combination of multiple large datasets that include Literotica, Shinen and erotic novels from Nerys and featuring thourough tagging support it covers the vast majority of erotic writing styles. This model is capable of replacing both the Lit and Shinen models in terms of content and style and has been well received as (one of) the best NSFW models out there. If you wish to use this model for commercial or non research usage we recommend choosing the 20B version as that one is not subject to the restrictive OPT license. | | [Skein-6b](https://huggingface.co/KoboldAI/GPT-J-6B-Skein) by Mr Seeker | Adventure | This model is designed for creative story generation. It can understand both free-form text and text written in interactive fiction style with actions starting with "> You". Trained with light novels and assorted interactive fiction. | From 3287328fe4f490d9509a1c6111026101161c01ef Mon Sep 17 00:00:00 2001 From: anhad Date: Wed, 25 Oct 2023 14:53:00 +0530 Subject: [PATCH 2/2] Update the model list in both Read.md and Colab markdown --- README.md | 25 ++++++++------ colab/GPU.ipynb | 91 +++++++++++++++++++++++++++---------------------- 2 files changed, 65 insertions(+), 51 deletions(-) diff --git a/README.md b/README.md index f754e60b..a5556754 100644 --- a/README.md +++ b/README.md @@ -68,17 +68,22 @@ Each edition features different models and requires different hardware to run, t | Model | Style | Description | | --- | --- | --- | -| [Nerys-6b](https://huggingface.co/KoboldAI/OPT-6B-nerys-v2) by Mr Seeker | Novel/Adventure | Nerys is a hybrid model based on Pike (A newer Janeway), on top of the Pike dataset you also get some Light Novels, Adventure mode support and a little bit of Shinen thrown in the mix. The end result is a very diverse model that is heavily biased towards SFW novel writing, but one that can go beyond its novel training and make for an excellent adventure model to. Adventure mode is best played from a second person perspective, but can be played in first or third person as well. Novel writing can be done best from the first or third person. | -| [Erebus-6.7b](https://huggingface.co/KoboldAI/OPT-6.7B-Erebus) by Mr Seeker | NSFW | Erebus is our community's flagship NSFW model, being a combination of multiple large datasets that include Literotica, Shinen and erotic novels from Nerys and featuring thourough tagging support it covers the vast majority of erotic writing styles. This model is capable of replacing both the Lit and Shinen models in terms of content and style and has been well received as (one of) the best NSFW models out there. If you wish to use this model for commercial or non research usage we recommend choosing the 20B version as that one is not subject to the restrictive OPT license. | -| [Skein-6b](https://huggingface.co/KoboldAI/GPT-J-6B-Skein) by Mr Seeker | Adventure | This model is designed for creative story generation. It can understand both free-form text and text written in interactive fiction style with actions starting with "> You". Trained with light novels and assorted interactive fiction. | -| [Adventure 6b](https://huggingface.co/KoboldAI/GPT-J-6B-Adventure) by Mr Seeker | Adventure | This is a clone of the AI Dungeon Classic model and is best known for the epic wackey adventures that AI Dungeon Classic players love. | -| [PPO-Pygway-6b](https://huggingface.co/KoboldAI/PPO_Pygway-6b-Mix) by TeH_Venom | Instruct-tuned Chat | PPO-Pygway is a model that merges together KoboldAI/GPT-J-6B-Janeway, reciprocate/ppo_hh_gpt-j, and Pygmalion/Pygmalion-6b; all three models were blended in a two step process using a simple weighted parameter method. This model may generate NSFW content. | -| [Janeway-6b](https://huggingface.co/KoboldAI/GPT-J-6B-Janeway) by Mr Seeker | Novel | GPT-J 6B-Janeway is a finetune created using EleutherAI's GPT-J 6B model. The training data contains around 2210 ebooks, mostly in the sci-fi and fantasy genres. The dataset is based on the same dataset used by GPT-Neo-2.7B-Picard, with 20% more data in various genres.| -| [Lit-6b](https://huggingface.co/hakurei/lit-6B) by Hakurei | Novel | Lit-6B is a GPT-J 6B model fine-tuned on 2GB of a diverse range of light novels, erotica, and annotated literature for the purpose of generating novel-like fictional text. The model used for fine-tuning is GPT-J, which is a 6 billion parameter auto-regressive language model trained on The Pile. | -| [Nerybus-6.7b](https://huggingface.co/KoboldAI/OPT-6.7B-Nerybus-Mix) by Concedo | Novel/NSFW | This model is based on OPT-6.7b-Erebus and Merged with Nerys retaining its NSFW knowledge, but was then further biased towards SFW novel stories. If you seek a balance between a SFW Novel model and a NSFW model this model should be a good choice. | -| [Shinen-6b](https://huggingface.co/KoboldAI/GPT-J-6B-Shinen) by Mr Seeker | Novel/NSFW | Shinen is an alternative to the OPT-license based Erebus model. While it is a Novel model it is unsuitable for SFW stories due to its heavy NSFW bias. Shinen will not hold back. It is meant to be used in KoboldAI's regular mode. | +| [Nerys](https://huggingface.co/KoboldAI/fairseq-dense-2.7B-Nerys) by Mr Seeker | Novel/Adventure | Nerys is a hybrid model based on Pike (A newer Janeway), on top of the Pike dataset you also get some Light Novels, Adventure mode support and a little bit of Shinen thrown in the mix. The end result is a very diverse model that is heavily biased towards SFW novel writing, but one that can go beyond its novel training and make for an excellent adventure model to. Adventure mode is best played from a second person perspective, but can be played in first or third person as well. Novel writing can be done best from the first or third person. | +| [Tiefighter 13B by KoboldAI](https://huggingface.co/KoboldAI/LLaMA2-13B-Tiefighter) | Hybrid | Tiefighter 13B is a versatile language model that merges the well-established base model, Undi95/Xwin-MLewd-13B-V0.2, incorporating models like Hermes, Beluga, Airoboros, and more. It's known for its creative capabilities. To enhance its functionality, two Loras are applied:1. [PocketDoc/Dans-RetroRodeo-13b (Lora 1)] introduces an adventure bias, controllable at 5% when merged with the base model.2. [Blackroot/Llama-2-13BStorywriter-LORA (Lora 2)] improves storywriting abilities and complements the adventure bias when merged at 10%.Tiefighter 13B is proficient in generating stories, chat, and following instructions.| +| [Janeway](https://huggingface.co/KoboldAI/GPT-Neo-2.7B-Janeway) by Mr Seeker | Novel | Janeway is a model created from Picard's dataset combined with a brand new collection of ebooks. This model is trained on 20% more content than Picard and has been trained on literature from various genres. Although the model is mainly focussed on SFW, romantic scenes might involve a degree of nudity. | +| [Picard](https://huggingface.co/KoboldAI/GPT-Neo-2.7B-Picard) by Mr Seeker | Novel | Picard is a model trained for SFW Novels based on Neo 2.7B. It is focused on Novel style writing without the NSFW bias. While the name suggests a sci-fi model this model is designed for Novels of a variety of genre's. It is meant to be used in KoboldAI's regular mode. | +| [AID](https://huggingface.co/KoboldAI/GPT-Neo-2.7B-AID) by melastacho | Adventure | Also know as Adventure 2.7B this is a clone of the AI Dungeon Classic model and is best known for the epic wackey adventures that AI Dungeon Classic players love. | +| [OPT](https://huggingface.co/facebook/opt-2.7b) by Metaseq | Generic | OPT is considered one of the best base models as far as content goes, its behavior has the strengths of both GPT-Neo and Fairseq Dense. Compared to Neo duplicate and unnecessary content has been left out, while additional literature was added in similar to the Fairseq Dense model. The Fairseq Dense model however lacks the broader data that OPT does have. The biggest downfall of OPT is its license, which prohibits any commercial usage, or usage beyond research purposes. | +| [Fairseq Dense](https://huggingface.co/KoboldAI/fairseq-dense-2.7B) | Generic | Trained by Facebook Researchers this model stems from the MOE research project within Fairseq. This particular version has been converted by us for use in KoboldAI. It is known to be on par with the larger models from EleutherAI and considered as better for pop culture and language tasks. Because the model has never seen a new line (enter) it may perform worse on formatting and paragraphing. Compared to other models the dataset focuses primarily on literature and contains little else. | +| [MythoMax 13B](https://huggingface.co/TheBloke/MythoMax-L2-13B-GPTQ) by Gryphe | Roleplay | An improved, potentially even perfected variant of MythoMix, my MythoLogic-L2 and Huginn merge using a highly experimental tensor type merge technique¹. | +| [Holomax 13B by KoboldAI](https://huggingface.co/KoboldAI/LLaMA2-13B-Holomax) | Adventure | This is an expansion merge to the well-praised MythoMax model from Gryphe (60%) using MrSeeker's KoboldAI Holodeck model (40%). The goal of this model is to enhance story-writing capabilities while preserving the desirable traits of the MythoMax model as much as possible (It does limit chat reply length). | +| [Airoboros 13B](https://huggingface.co/jondurbin/airoboros-13b) by Jon Durbin | Generic | This is an instruction fine-tuned llama-2 model, using synthetic instructions generated by airoboros⁵. | +| [Emerhyst 13B](https://huggingface.co/Undi95/Emerhyst-13B) by Undi | Roleplay | An attempt using BlockMerge_Gradient to get better result. In addition, LimaRP v3 was used⁷. | +| [Chronos 13B](https://huggingface.co/elinas/chronos-13b) by Elinas | Generic | This model is primarily focused on chat, roleplay, and storywriting, but can accomplish other tasks such as simple reasoning and coding. Chronos generates very long outputs with coherent text, largely due to the human inputs it was trained on. | +| [Spring Dragon by Henk717](https://huggingface.co/Henk717/spring-dragon) | Adventure | This model is a recreation attempt of the AI Dungeon 2 Dragon model. To achieve this, the "text_adventures.txt" dataset was used, which was bundled with the original AI Dungeon 2 GitHub release prior to the online service. It is worth noting that the same dataset file was used to create the Dragon model, where Dragon is a GPT-3 175B Davinci model from 2020. | +| [Holodeck By KoboldAI](https://huggingface.co/KoboldAI/LLAMA2-13B-Holodeck-1) | Adventure |LLAMA2 13B-Holodeck is a finetune created using Meta's llama 2 model.The training data contains around 3000 ebooks in various genres. Most parts of the dataset have been prepended using the following text: [Genre: , | +| [Neo](https://huggingface.co/EleutherAI/gpt-neo-2.7B) by EleutherAI | Generic | This is the base model for all the other 2.7B models, it is best used when you have a use case that we have no other models available for, such as writing blog articles or programming. It can also be a good basis for the experience of some of the softprompts if your softprompt is not about a subject the other models cover. | | [Various 2.7b models]() by various | Various smaller models are also possible to load in GPU colab. | | - ### Styles | Type | Description | diff --git a/colab/GPU.ipynb b/colab/GPU.ipynb index 533a5ce8..39af7d19 100644 --- a/colab/GPU.ipynb +++ b/colab/GPU.ipynb @@ -1,28 +1,10 @@ { - "nbformat": 4, - "nbformat_minor": 0, - "metadata": { - "colab": { - "name": "ColabKobold GPU", - "private_outputs": true, - "provenance": [], - "include_colab_link": true - }, - "kernelspec": { - "display_name": "Python 3", - "name": "python3" - }, - "language_info": { - "name": "python" - }, - "accelerator": "GPU" - }, "cells": [ { "cell_type": "markdown", "metadata": { - "id": "view-in-github", - "colab_type": "text" + "colab_type": "text", + "id": "view-in-github" }, "source": [ "\"Open" @@ -58,24 +40,26 @@ }, { "cell_type": "code", + "execution_count": null, "metadata": { "id": "ewkXkyiFP2Hq" }, + "outputs": [], "source": [ "#@title <-- Tap this if you play on Mobile { display-mode: \"form\" }\n", "%%html\n", "Press play on the music player to keep the tab alive, then start KoboldAI below (Uses only 13MB of data)
\n", "