The EDB Embeddings component connects a flow to an embedding model running on a Hybrid Manager (HM) model server cluster, and produces an Embeddings model object that downstream components (vector stores, retrievers) can use to encode text.
Choosing the right component
Use the EDB Embeddings component when:
- You want vector embeddings produced by a model running on an HM-hosted model server cluster (OpenAI-compatible or NVIDIA NIM).
- You're feeding a vector store, retriever, or similarity-search component that expects an Embeddings model.
Use a different component if:
- You want CPU-based embeddings produced locally on the database cluster using
aidb. Use EDB Embedded Models. - You want generative responses from a model, not embeddings. Use EDB Model Server.
Prerequisites
An HM model server cluster with an embedding model deployed. See Models for how to create one. A GPU node is typically required.
An HM machine-user access key saved in Langflow as a Global Variable (default name
HM_API_KEY).
Inputs
Connection
| Field | Type | Required | Default | Notes |
|---|---|---|---|---|
| Hybrid Manager URL | Text | No | Advanced. Override the default HM URL. | |
| HM Machine User Key | Secret | Yes | HM_API_KEY | Defaults to the global variable named HM_API_KEY. |
| Hybrid Manager Model Server Cluster Instance | Dropdown | Yes | The model server cluster to call. Populated from your HM model clusters. | |
| External Ingress | Boolean | No | Advanced. Route through the external ingress instead of the in-cluster service. |
Model
| Field | Type | Required | Default | Notes |
|---|---|---|---|---|
| API Client | Dropdown | No | OpenAI | Advanced. OpenAI for the EDB Model Server's OpenAI-compatible API. NVIDIA only when connecting to a native NIM endpoint that needs NVIDIA-specific embedding features. |
| Model | Dropdown | Yes | Populated from the selected model server cluster. Use the refresh button if the list is empty. |
API Client = OpenAI (default)
| Field | Type | Required | Default | Notes |
|---|---|---|---|---|
| Dimensions | Integer | No | Advanced. Number of dimensions for the output vectors. Only supported by certain models. Leave blank to use the model's default. | |
| Max Retries | Integer | No | 3 | Advanced. Maximum retries on a failed embedding call. |
| Chunk Size | Integer | No | 1000 | Advanced. Maximum texts to send in a single batch. |
| Model Kwargs | Dict | No | Advanced. Additional keyword arguments passed through to the OpenAI embeddings client. |
API Client = NVIDIA
| Field | Type | Required | Default | Notes |
|---|---|---|---|---|
| Model Temperature | Float | No | 0.1 | Advanced. Model temperature. |
The OpenAI-specific fields hide when API Client is NVIDIA, and vice versa. Field values are preserved when you switch between clients, so a round-trip doesn't wipe your settings.
Outputs
| Output | Type | Carries |
|---|---|---|
| Embedding Model | Embeddings | A LangChain Embeddings instance pointing at the selected HM model server cluster. Pass this into a vector store or retriever. |