Running An LLM Locally Is Easy: Install Your Own Mini ChatGPT

Summary

This article provides a guide on how to install and run a local AI chatbot on a Windows PC using the text-generation-webui interface and the dolphin-2.1-mistral model.

Abstract

The article discusses the limitations of popular AI models like ChatGPT and Gemini, which are only available online and lack customization options. It then introduces an alternative solution, which is to install a local AI chatbot on a personal computer. The author shares their experience of setting up a private AI chatbot on their Windows PC using the text-generation-webui interface and the dolphin-2.1-mistral model. The article provides step-by-step instructions on how to install the UI, download the weights, and start the chat. The author also recommends using the biggest model that the GPU can support to get decent results.

Bullet points

ChatGPT and Gemini are impressive AI models, but they are only available online and lack customization options.
An alternative solution is to install a local AI chatbot on a personal computer.
The author shares their experience of setting up a private AI chatbot on their Windows PC using the text-generation-webui interface and the dolphin-2.1-mistral model.
Step-by-step instructions are provided on how to install the UI, download the weights, and start the chat.
It is recommended to use the biggest model that the GPU can support to get decent results.
The author recommends using the dolphin-2.1-mistral-7b.Q6_K.gguf model for good results on a 3060 with 12 GB of VRAM.

Running An LLM Locally Is Easy: Install Your Own Mini ChatGPT

ChatGPT is an impressive tool, and even with the introduction of ChatGPT-4, it remains the top model in the market. Google is making strides with Gemini and Bard, showcasing significant enhancements in recent months.

However, what do these models have in common? A shared trait among these models is their online-only functionality, adhering to the regulations imposed by the respective companies. For those seeking complete customization options, other companies like Mistral AI or Meta have gone a step further by releasing their source models. This enables users to deploy them on their own setups, either locally or on a personal server.

I tried to set up a local installation of my private AI chatbot on my Windows PC and it was much easier than I excepted, here is how I did it:

Install the UI

In order to use these powerful language models, we need an interface to interact with them. For this, I chose the text-generation-webui, which is easy to install and has a clean UI. It also allows you to manage and load different models with ease.

Just clone or download the repository (it will be a zip file) and run one of these files, depending on your operating system: start_linux.sh, start_windows.bat, start_macos.sh, or start_wsl.bat

A terminal will open, and you’ll need to type a few answers to complete the installation. Then, go to http://localhost:7860.

Download the weights

I decided to try a model named dolphin-2.1-mistral, a finetuned version, made by Eric Hartford, which is based on two components:

Dolphin 2.1, an open-source and uncensored, and commercially licensed dataset and series of instruct-tuned language models based on Microsoft’s Orca paper.

The Mistral-7B Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters. Mistral-7B-v0.1 outperforms Llama 2 13B on all benchmarks we tested.

From TheBloke on HuggingFace

You can download the weights on HuggingFace and put it in the models/ folder of the UI.

Running An LLM Locally Is Easy: Install Your Own Mini ChatGPT

Install the UI

Download the weights

Start the Chat

How to Run an LLM Locally - AI GUILD HUB

If you are looking to run a Large Language Model (LLM), like ChatGPT, offline and on your local PC, then you should…