Ollama

Run open-source LLMs locally with a single command.

Ollama is profiled here as a LLM tool for engineering teams. Read about features, pricing, and how it compares to related options in the tools directory.

LLM Backend Embeddings DeploymentOpen Source

Visit Website GitHub

Description

Short Intro: Ollama is a local LLM runtime built by Jeffrey Morgan and Michael Chiang, the same team that created Kitematic, which Docker acquired and turned into Docker Desktop. Founded through Y Combinator's Winter 2021 batch and publicly released in July 2023, Ollama applies the same accessibility philosophy: one command to pull a model, one to run it, one to manage it. The tool has 172,000+ GitHub stars as of May 2026 and 52 million monthly model pulls, making it the most widely adopted local inference runtime in the category, built on a minimal pre-seed rather than a large VC round.

Key Capabilities:

One-command model pull and run for Llama 4, DeepSeek, Qwen, Gemma, Mistral, Phi, and more
OpenAI-compatible local REST API at localhost for drop-in migration from cloud APIs
Modelfile system for customizing models with system prompts, parameters, and adapter layers
GPU acceleration for NVIDIA CUDA, AMD ROCm, and Apple Metal with CPU fallback
Multimodal support and tool calling for agentic workflows
Cross-platform installation on macOS, Linux, and Windows
Official Docker image for containerized server deployments
First-party Python and JavaScript client libraries
Quantized model support for consumer GPUs and laptops
Desktop app for macOS and Windows released August 2025
ollama launch <app> command for spinning up integrated local applications
Go backend with llama.cpp inference engine via CGo for efficient local serving

See Ollama pricing details →

Alternative tools

Kong AI Gateway
AI traffic governance built on the Kong gateway
Eden AI
One API for many AI models across providers and modalities
Reka
Multimodal models that reason across text, image, and video
Moonshot AI
Open-weight Kimi models tuned for agentic and coding tasks
xAI Grok
Grok model family with real-time search through one API
LLM Guard
Open-source security toolkit for LLM interactions

Used in Stacks

No saved stacks include this tool yet.

Browse more in LLM