AI models

Oct 8, 2023

1. Text generation

UIs:

ollama (CLI)
GPT4All (GUI)
serge (web)
text-generation-webui (web)

Best models:

TheBloke/Mistral-7B-OpenOrca-GGUF: Runs very fast on Mac M1 via text-generation-webui
wizardlm-13b-v1.1-superhot-8k.ggmlv3.q4_0.bin (GPT4All): very good, supports LocalDocs
Mistral (ollama): https://github.com/jmorganca/ollama, very fast and good on Mac M1, supports local files with some coding
WizardLM-Uncensored-7B (serge): On Intel NUC

GitHub project: https://github.com/andreas-mausch/dockerize-ollama-mistral-7b-openorca

2. Image generation

https://github.com/AUTOMATIC1111/stable-diffusion-webui

3. Text-to-Speech

https://github.com/rsxdalv/tts-generation-webui

Best model: Bark

With audio/music generation included.

4. Speech-to-text

GitHub project: https://github.com/andreas-mausch/dockerize-whisper

5. Image Captioning

BLIP2: Requires a short python code

Model: Salesforce/blip2-opt-2.7b

GitHub project: https://github.com/andreas-mausch/dockerize-clip-interrogator

6. Image object detection and classification

GitHub project: https://github.com/andreas-mausch/dockerize-detr-resnet-50

undefined

7. Glossary

https://www.reddit.com/r/LargeLanguageModels/comments/13jvi7r/whats_the_difference_between_ggml_and_gptq_models/

GPQT: best for CUDA
GGML: best for CPU, but can also offload some layers on the GPU
GGUF: Newer version of GGML, replacing it