Name	Name	Last commit message	Last commit date
parent directory ..
01-getting-started.md	01-getting-started.md
02-models.md	02-models.md
03-text-generation.md	03-text-generation.md
04-image-generation.md	04-image-generation.md
05-audio.md	05-audio.md
06-embeddings.md	06-embeddings.md
07-configuration.md	07-configuration.md
08-integration.md	08-integration.md
README.md	README.md

layout	title	nav_order	has_children
default	LocalAI Tutorial	92	true

LocalAI Tutorial: Self-Hosted OpenAI Alternative

Run LLMs, image generation, and audio models locally with an OpenAI-compatible API.

🏠 Your Own AI Infrastructure

🎯 What is LocalAI?

LocalAI^{View Repo} is a free, open-source alternative to OpenAI that runs locally. It provides an OpenAI-compatible API for LLMs, image generation, audio transcription, and text-to-speech—all running on consumer hardware.

Key Features

Feature	Description
OpenAI Compatible	Drop-in replacement for OpenAI API
Multi-Modal	Text, images, audio, embeddings
No GPU Required	Runs on CPU (GPU optional)
Model Gallery	Easy model installation
Docker Ready	Simple deployment
Privacy	100% local, no data leaves

flowchart TD
    A[OpenAI SDK/API Calls] --> B[LocalAI Server]
    
    B --> C[LLM Backend]
    B --> D[Image Generation]
    B --> E[Audio Processing]
    B --> F[Embeddings]
    
    C --> G[llama.cpp]
    C --> H[GPT4All]
    
    D --> I[Stable Diffusion]
    D --> J[SDXL]
    
    E --> K[Whisper]
    E --> L[TTS]
    
    F --> M[Sentence Transformers]
    
    classDef api fill:#e1f5fe,stroke:#01579b
    classDef server fill:#f3e5f5,stroke:#4a148c
    classDef backend fill:#fff3e0,stroke:#ef6c00
    classDef model fill:#e8f5e8,stroke:#1b5e20
    
    class A api
    class B server
    class C,D,E,F backend
    class G,H,I,J,K,L,M model

Current Snapshot (auto-updated)

repository: mudler/LocalAI
stars: about 43.7k
latest release: v4.0.0 (published 2026-03-14)

Tutorial Chapters

Chapter 1: Getting Started - Installation and first model
Chapter 2: Model Gallery - Installing and managing models
Chapter 3: Text Generation - Chat and completions
Chapter 4: Image Generation - Stable Diffusion locally
Chapter 5: Audio - Whisper transcription and TTS
Chapter 6: Embeddings - Vector embeddings for RAG
Chapter 7: Configuration - Advanced settings and tuning
Chapter 8: Integrations - Production integrations and optimization

What You'll Learn

Deploy LocalAI with Docker or from source
Install Models from the gallery
Use OpenAI SDK with local models
Generate Images with Stable Diffusion
Transcribe Audio with Whisper
Create Embeddings for RAG applications
Scale for Production use

Prerequisites

Docker (recommended)
8GB+ RAM (more for larger models)
Optional: NVIDIA GPU with CUDA

Quick Start

Docker (CPU)

# Run LocalAI
docker run -p 8080:8080 \
  -v localai-models:/models \
  localai/localai:latest-cpu

# Open http://localhost:8080

Docker (NVIDIA GPU)

docker run -p 8080:8080 \
  --gpus all \
  -v localai-models:/models \
  localai/localai:latest-gpu-nvidia-cuda-12

Docker Compose

version: '3.8'
services:
  localai:
    image: localai/localai:latest-cpu
    ports:
      - "8080:8080"
    volumes:
      - ./models:/models
    environment:
      - DEBUG=true
      - THREADS=4

Install a Model

# Via API
curl http://localhost:8080/models/apply \
  -H "Content-Type: application/json" \
  -d '{"id": "phi-2"}'

# List available models
curl http://localhost:8080/models/available

Use with OpenAI SDK

from openai import OpenAI

# Point to LocalAI
client = OpenAI(
    base_url="http://localhost:8080/v1",
    api_key="not-needed"  # LocalAI doesn't require API key
)

# Chat completion (same as OpenAI!)
response = client.chat.completions.create(
    model="phi-2",
    messages=[
        {"role": "user", "content": "Hello!"}
    ]
)

print(response.choices[0].message.content)

Image Generation

# Generate image with Stable Diffusion
response = client.images.generate(
    model="stablediffusion",
    prompt="A beautiful sunset over mountains",
    size="512x512"
)

# Save image
import base64
image_data = base64.b64decode(response.data[0].b64_json)
with open("sunset.png", "wb") as f:
    f.write(image_data)

Audio Transcription

# Transcribe with Whisper
with open("audio.mp3", "rb") as f:
    transcript = client.audio.transcriptions.create(
        model="whisper-1",
        file=f
    )

print(transcript.text)

Text-to-Speech

# Generate speech
response = client.audio.speech.create(
    model="tts-1",
    voice="alloy",
    input="Hello, this is LocalAI speaking!"
)

# Save audio
with open("speech.mp3", "wb") as f:
    f.write(response.content)

Embeddings

# Generate embeddings for RAG
response = client.embeddings.create(
    model="text-embedding-ada-002",
    input="Hello, world!"
)

embedding = response.data[0].embedding
print(f"Embedding dimension: {len(embedding)}")

Model Gallery

Category	Models
LLM	Phi-2, LLaMA, Mistral, GPT4All
Image	Stable Diffusion, SDXL
Audio	Whisper (all sizes)
TTS	Piper, Coqui
Embedding	all-MiniLM, BGE

Hardware Requirements

Model Size	RAM (CPU)	VRAM (GPU)
3B	4GB	4GB
7B	8GB	6GB
13B	16GB	10GB
70B	64GB+	40GB+

Learning Path

🟢 Beginner Track

Chapters 1-3: Setup and text generation
Run your first local LLM

🟡 Intermediate Track

Chapters 4-6: Images, audio, and embeddings
Build multi-modal applications

🔴 Advanced Track

Chapters 7-8: Configuration and production
Scale local AI infrastructure

Ready to run AI locally? Let's begin with Chapter 1: Getting Started!

Generated for Awesome Code Docs

Navigation & Backlinks

Full Chapter Map

Source References

Generated by AI Codebase Knowledge Builder

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

LocalAI Tutorial: Self-Hosted OpenAI Alternative

🎯 What is LocalAI?

Key Features

Current Snapshot (auto-updated)

Tutorial Chapters

What You'll Learn

Prerequisites

Quick Start

Docker (CPU)

Docker (NVIDIA GPU)

Docker Compose

Install a Model

Use with OpenAI SDK

Image Generation

Audio Transcription

Text-to-Speech

Embeddings

Model Gallery

Hardware Requirements

Learning Path

🟢 Beginner Track

🟡 Intermediate Track

🔴 Advanced Track

Navigation & Backlinks

Full Chapter Map

Source References

FilesExpand file tree

localai-tutorial

Directory actions

More options

Directory actions

More options

Latest commit

History

localai-tutorial

Folders and files

parent directory

README.md

LocalAI Tutorial: Self-Hosted OpenAI Alternative

🎯 What is LocalAI?

Key Features

Current Snapshot (auto-updated)

Tutorial Chapters

What You'll Learn

Prerequisites

Quick Start

Docker (CPU)

Docker (NVIDIA GPU)

Docker Compose

Install a Model

Use with OpenAI SDK

Image Generation

Audio Transcription

Text-to-Speech

Embeddings

Model Gallery

Hardware Requirements

Learning Path

🟢 Beginner Track

🟡 Intermediate Track

🔴 Advanced Track

Navigation & Backlinks

Full Chapter Map

Source References