Milvus

Open-source vector database built for billion-scale search

Milvus is profiled here as a RAG Framework tool for engineering teams. Read about features, pricing, and how it compares to related options in the tools directory.

RAG Framework Embeddings Vector Database StorageOpen Source

Visit Website GitHub

Description

Milvus is an open-source vector database created by Zilliz in 2019 and now a graduated project of the LF AI and Data Foundation. Its architecture separates compute from storage so indexing and querying scale independently to billions of vectors across distributed nodes. Multiple index types let teams trade recall against speed and memory, and Zilliz Cloud offers the same engine as a managed service. Backed by Zilliz and a large contributor base, it serves vector workloads at scale across recommendation, search, and retrieval systems. The lightweight Milvus Lite build lets developers prototype locally before moving to a distributed cluster.

Key Capabilities:

Vector similarity search scaling to billions of embeddings
Multiple index types including HNSW, IVF, and DiskANN
Hybrid search over dense and sparse vectors with scalar filtering
Distributed architecture with separated compute and storage
Multi-tenancy through collections, partitions, and databases
Apache 2.0 license with the lightweight Milvus Lite for local use

Alternative tools

LanceDB
Embedded multimodal vector database on the Lance format
Sentence Transformers
Python framework for dense text and image embeddings
R2R
Production retrieval system with ingestion and an API
Mem0
Long-term memory layer for AI agents and assistants
Docling
Open-source document conversion built for RAG pipelines
Unstructured
Turn raw documents into LLM-ready structured data

Used in Stacks

No saved stacks include this tool yet.

Browse more in RAG Framework