Oobabooga – Musings on AI

Llama 3 on Web UI

Dracones April 22, 2024

When doing inference with Llama 3 Instruct on Text Generation Web UI, up front you can get pretty decent inference speeds on a the M1 Mac Ultra, even with a…

Mac Oobabooga

Text Generation Web UI(llama.cpp)

Dracones April 21, 2024

In this post I’ll be walking through setting up Text Generation Web UI for inference on GGUF models using llama.cpp for Mac. Future posts will go deeper into optimizing Text…