Skip to content
#

llama-server

Here are 39 public repositories matching this topic...

Enterprise-grade local RAG API assistant backend running on traditional Azure CPU infrastructure. Serves a quantized Llama 3.2 GGUF model via llama-server with offline ChromaDB ingestion and stateful MySQL transaction logging.

  • Updated May 31, 2026
  • Python

Improve this page

Add a description, image, and links to the llama-server topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llama-server topic, visit your repo's landing page and select "manage topics."

Learn more