AIDB v7
AIDB is an EDB-maintained PostgreSQL extension that brings AI data workflows directly into your Postgres database. Define pipelines in SQL to parse, chunk, embed, and index your data — then query it with semantic search, hybrid search, and LLM-powered retrieval. Common use cases include Retrieval-Augmented Generation (RAG), semantic search over documents and images, and automated document ingestion pipelines from S3 or local storage.
Where to start
If you're new to AIDB, follow this end-to-end workflow:
Overview — Learn the core concepts: what pipelines and knowledge bases are, and how they fit together.
Install — Install the
aidbpackage, configureshared_preload_libraries, and enable the extension in your database.Integrating models — Register the AI model that will power your pipeline. Every pipeline step that generates embeddings or processes text requires a model — either a built-in local model (no setup needed) or an external API model that you register with
aidb.create_model(). Do this before creating pipelines.External storage (optional) — If your source data lives in S3 or a local file system rather than a Postgres table, configure a volume using PGFS so pipelines can read from it.
Data pipelines — Define a pipeline that reads from your data source, transforms it (chunk, parse, embed), and writes the output to a knowledge base. This is where the model you registered in step 3 is put to work.
Knowledge bases — Query the vector embeddings your pipeline produced. Use
aidb.retrieve_text()oraidb.retrieve_key()for semantic search, or combine vector and keyword search with hybrid search.
Documentation map
| Section | What it covers |
|---|---|
| Release notes | What changed in each version |
| Compatibility | Supported platforms and Postgres versions for AIDB and PGFS |
| Install | How to install and configure the AIDB extension |
| Integrating models | Local models, external APIs (OpenAI, NIM, Gemini, OpenRouter), and supported model variants |
| External storage | Using S3-compatible object stores and local file systems as pipeline data sources |
| Data pipelines | Defining pipelines, pipeline steps, orchestration, and examples |
| Knowledge bases | Semantic search, hybrid search, and vector index options |
| SQL functions | Standalone SQL functions for chunking, parsing, OCR, summarization, and embedding |
| Reference | Full API reference for all AIDB functions and views |
- On this page
- Where to start
- Documentation map