Vector Databases

Fast Take: Vector databases store embeddings so AI can search and retrieve information by meaning instead of keywords.

Layer: Retrieval Status: Mature Last Updated: 2026-01-06

Decision Box

Use this when:

  • You’re building RAG
  • You need semantic search
  • You store embeddings at scale
  • You need fast similarity retrieval

❌ Ignore this when:

  • Keyword search is enough
  • You have very small datasets
  • Exact matches matter more than similarity
  • You don’t need ranking or relevance scoring

⚠️ Risk if misused:

  • Poor indexing kills performance
  • Wrong distance metric gives bad results
  • No filtering = irrelevant retrieval
  • Scaling costs creep silently

Simple Explanation

⚠️ What it is:

A vector database is a specialized database designed to store and search embeddings efficiently.

Analogy:
It’s a library where books are shelved by meaning, not by title or author.

Why it matters:

  • Confusing RAG with fine-tuning (they solve different problems)
  • Assuming RAG guarantees accuracy without good data prep

Technical Breakdown

Key Concepts:

  • Vector indexing
  • Distance metrics (cosine, dot product, Euclidean)
  • Top-K retrieval
  • Metadata filtering
  • Hybrid search

Implementation Snapshot:

Embeddings → Vector Database → Similarity Search → Retrieved Chunks → LLM

Common Failure Modes:

  • No metadata filters
  • Over-retrieving (too many chunks)
  • Wrong distance metric
  • No re-ranking layer

Cost Reality:


Cost profile: Medium

  • Cost profile: Medium
  • Drivers: data volume, query rate, replication

Top Players

Company / Tool – why it matters here:

  • Pinecone
  • Weaviate
  • Qdrant
  • Milvus
  • Chroma
  • Elasticsearch (vector mode)

Go Deeper

Appears in:

AI Foundations for Builders — Module 2: The Library

This concept is covered in Module 2 – The Library (RAG & Vector Databases)

Term Flow

Prerequisites:

  • Embeddings

Next Concepts:

  • Semantic Search
  • Top-K Retrieval
  • Re-ranking
  • Hybrid Search

Often Confused With:

  • Traditional databases
  • Search engines
  • Full-text indexing
Scroll to Top