Skip to Main Content

Reddit Crushes Post Fine-tune

Online Experience

Reddit Crush Online

Results

Untitled

Untitled2

Untitled3

Data Description

The training data comes from the Reddit Crushes subreddit

All posts under the Crushes topic were collected.

Brief Introduction

Collect post data from Reddit Crush, convert it to plain text data, and fine-tune training on the TheBloke/Wizard-Vicuna-7B-Uncensored-HF model.

Tutorial

Train Fine-Tune Model

  1. Install text-generation-webui

    Follow the installation guide of text-generation-webui to install text-generation-webui.

  2. Start text-generation-webui, select Model from the top tab Untitled

  3. In the Model tab, enter TheBloke/Wizard-Vicuna-7B-Uncensored-HF, and then click Download to download the base model (you can also manually download the model and put it in the models directory of the text-generation-webui installation directory). Untitled

  4. Download the prepared reddit crush dataset and place the dataset in the training/datasets directory of the text-generation-webui installation directory.

    Untitled

  5. Switch to the training tab of text-generation-webui

    Untitled

  6. Select the reddit crush dataset

    Switch to Raw text file and select the reddit crush dataset

    Untitled

  7. Start training

    Use the default parameters for training. If you want to increase the context length, you can increase the cutoff parameter.

    Click Start LoRA Training to start training.

    After training starts, you can see the training progress in text-generation-webui.

    Untitled

    Wait for the model training to complete. Usually, this will take 1-8 hours.

    The product of the training process is saved in the lora directory. You can also interrupt the training midway and use the existing checkpoint model in the lora directory.

    Untitled

  8. Use the model

    Manually create a folder named reddit_crush in the lora directory.

    Copy the latest checkpoint model from the folder to the lora directory.

    Untitled

Use Fine-Tune Model

  1. Switch to the Model tab, and first select Model Untitled

  2. Select LoRA. If the choice of Lora does not appear, please click the refresh button on the right or check whether the LoRA model has been placed in the LoRA directory of text-generation-webui.

    Untitled

  3. Apply LoRA, click Apply the selected LoRAs Untitled

  4. After the application is successful, you can see the prompt Untitled

  5. Use the model

    Switch to the Text generation tab, then output the beginning of the story, click Generate, and the continuation of the story will be automatically generated on the right side. Untitled