Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docs/aidb/testlayout #6309

Closed
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
@@ -1,18 +1,18 @@
---
title: "EDB Postgres AI - AI Accelerator - Overview"
navTitle: "Overview"
navTitle: "AI Accelerator Overview"
description: "Overview of the EDB Postgres AI AI Accelerator suite of tools including EDB PGvector and Pipelines."
---

## What is the AI Accelerator?

AI Accelerator simplifies AI data management by combining vector search from open source pgvector with Pipelines, an extension for AI data pipeline automation. It handles vector embedding operations for structured and unstructured data, simplifying similarity searches and natural language querying, and streamlines development of AI-powered applications by providing retrieval of AI-ready data in a familiar Postgres environment.
AI Accelerator simplifies AI data management by combining vector search from open source PGvector with Pipelines, an extension for AI data pipeline automation. It handles vector embedding operations for structured and unstructured data, simplifying similarity searches and natural language querying, and streamlines development of AI-powered applications by providing retrieval of AI-ready data in a familiar Postgres environment.

## Features

### Vector Toolkit

The EDB Pipelines extension enables automated vector embedding, storage, and retrieval workflows, and it comes preloaded with pgvector for seamless management of vector data in Postgres. This enables developers to build complex GenAI functionality using SQL commands in the familiar Postgres environment — with just 5 lines of code instead of 130+.
The EDB Pipelines extension enables automated vector embedding, storage, and retrieval workflows, and it comes preloaded with PGvector for seamless management of vector data in Postgres. This enables developers to build complex GenAI functionality using SQL commands in the familiar Postgres environment — with just 5 lines of code instead of 130+.

### Swappable Configurations

Expand Down
File renamed without changes.
18 changes: 15 additions & 3 deletions advocacy_docs/edb-postgres-ai/ai-accelerator/index.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -5,10 +5,22 @@ directoryDefaults:
product: "EDB Postgres AI"
iconName: BrainCircuit
indexCards: simple
description: "All about the EDB Postgres AI - AI Accelerator suite of tools including Pipelines and pgvector."
description: "All about the EDB Postgres AI - AI Accelerator suite of tools including Pipelines and PGvector."
navigation:
- overview
- pipelines
- ai-accelerator-overview
- "#Pipelines"
- pipelines-overview
- installing
- gettingstarted
- "#Components"
- models
- retrievers
- pgfs
- "#Resources"
- reference
- rel_notes
- licenses
- pgvector
redirects:
- /edb-postgres-ai/ai-ml/
---
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ Pipelines has a "Postgres File System" (PGFS) that allows you access external st

PGFS is a key component of Pipelines, allowing you to work with data which is not stored in the database, either because it is too large or because it is not structured in a way that is suitable for a relational database.

pgfs is implemented as a foreign data wrapper (FDW) and is installed as an extension. The extension provides a set of functions to interact with the file system from within the database.
PGFS is implemented as a foreign data wrapper (FDW) and is installed as an extension. The extension provides a set of functions to interact with the file system from within the database.

## Installing the PGFS extension

Expand Down
Original file line number Diff line number Diff line change
@@ -1,38 +1,38 @@
---
title: "AI Accelerator - PGVector"
navTitle: "PGVector"
description: "PGVector is a Postgres extension that provides vector data types and functions to store and manipulate vector data."
title: "AI Accelerator - PGvector"
navTitle: "PGvector"
description: "PGvector is a Postgres extension that provides vector data types and functions to store and manipulate vector data."
deepToC: true
---

## What is PGVector
## What is PGvector

PGvector is an extension for Postgres that enables efficient storage and similarity search of high-dimensional vector data, commonly used for machine learning models, recommendation systems, and natural language processing applications.

### Part of the EDB Postgres AI platform

EDB Postgres AI with pgvector delivers a flexible solution for enterprise AI, integrating seamlessly with existing Postgres environments. It outperforms standalone vector databases with 4.22X faster queries and eliminates data silos via integrations with 18X cost-efficient object storage. This unified platform accelerates AI deployment, simplifies management, and ensures up to 99.999% availability, enabling businesses to innovate rapidly and future-proof their data infrastructure without disrupting current operations.
EDB Postgres AI with PGvector delivers a flexible solution for enterprise AI, integrating seamlessly with existing Postgres environments. It outperforms standalone vector databases with 4.22X faster queries and eliminates data silos via integrations with 18X cost-efficient object storage. This unified platform accelerates AI deployment, simplifies management, and ensures up to 99.999% availability, enabling businesses to innovate rapidly and future-proof their data infrastructure without disrupting current operations.

### Native Vector Data Type Support

pgvector on EDB Postgres AI enables storage of AI/ML model embeddings as first-class data types, allowing efficient indexing and querying of large volumes of AI data stored in object storage, seamlessly integrated with traditional relational data.
PGvector on EDB Postgres AI enables storage of AI/ML model embeddings as first-class data types, allowing efficient indexing and querying of large volumes of AI data stored in object storage, seamlessly integrated with traditional relational data.

### Advanced 4.22X Vector Query

Extends standard SQL with vector-specific operators and functions, enabling complex queries that combine vector operations, relational data, and full SQL capabilities, going far beyond simple similarity searches to support sophisticated AI-driven applications.

### High-performance Indexing

With real-time indexing, storage, and querying of AI data, pgvector enables efficient vector similarity search on embeddings from various LLMs, while leveraging Postgres transactionality for consistent handling of mixed workloads.
With real-time indexing, storage, and querying of AI data, PGvector enables efficient vector similarity search on embeddings from various LLMs, while leveraging Postgres transactionality for consistent handling of mixed workloads.

### Integrated Vector Data Platform

pgvector unifies vector database capabilities with EDB Postgres AI's mature enterprise features, ensuring high availability, robust backup/recovery, strong security, and ACID data integrity, all within a single vendor solution for comprehensive data management and AI workloads.
PGvector unifies vector database capabilities with EDB Postgres AI's mature enterprise features, ensuring high availability, robust backup/recovery, strong security, and ACID data integrity, all within a single vendor solution for comprehensive data management and AI workloads.

## Installation

The extension is included with AI Accelerator's Pipelines and installed automatically when Pipelines is installed.

## Further information

For more information on the pgvector extension, see the [pgvector repository](https://github.com/pgvector/pgvector).
For more information on the PGvector extension, see the [pgvector repository](https://github.com/pgvector/pgvector).
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ description: "Where to start with AI Accelerator Pipelines."
prevNext: True
---

As part of the EDB Postgres AI's AI Accelerator Pipelines abstracts away the complexity of working with AI data. It transforms Postgres into a powerful platform for AI data management, combining vector search from pgvector with automation for complex AI workflows.
As part of the EDB Postgres AI's AI Accelerator Pipelines abstracts away the complexity of working with AI data. It transforms Postgres into a powerful platform for AI data management, combining vector search from PGvector with automation for complex AI workflows.

## Pipelines, Auto Embedding, and Intelligent Retriever

Expand Down
23 changes: 0 additions & 23 deletions advocacy_docs/edb-postgres-ai/ai-accelerator/pipelines/index.mdx

This file was deleted.

Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@ description: "Reference documentation for EDB Postgres AI - AI Accelerator - Pip
deepToC: true
---

This section provides reference documentation for EDB Postgres AI AI Accelerator Pipelines PGFS. It includes information on the functions and views available in the [pgfs](../pgfs) extension that gives aidb access to S3-compatible file systems and local file systems.
This section provides reference documentation for EDB Postgres AI AI Accelerator Pipelines PGFS. It includes information on the functions and views available in the [PGFS](../pgfs) extension that gives aidb access to S3-compatible file systems and local file systems.

## Functions

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@ This is the GA release of EDB Postgres AI - AI Accelerator - Pipelines. Since th
- Support for external models with OpenAI API.
- Secure management of API credentials vie postgres "user mapping" feature.
- Model registry that allows configuring internal/external models.
- pgfs/volumes allows working with external data from S3 object stores and file systems.
- PGFS/volumes allows working with external data from S3 object stores and file systems.
- Low-level primitives for models: encode, transform for image and text data.
- Enhanced retriever concept for data retrieval.
- Support image and text data, including: Postgres tables, external storage, and ad-hoc queries.
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@ highlights: |
- Support for external models with OpenAI API.
- Secure management of API credentials vie postgres "user mapping" feature.
- Model registry that allows configuring internal/external models.
- pgfs/volumes allows working with external data from S3 object stores and file systems.
- PGFS/volumes allows working with external data from S3 object stores and file systems.
- Low-level primitives for models: encode, transform for image and text data.
- Enhanced retriever concept for data retrieval.
- Support image and text data, including: Postgres tables, external storage, and ad-hoc queries.
Expand Down Expand Up @@ -44,7 +44,7 @@ relnotes:
type: Enhancement
severity: High
impact: High
- relnote: pgfs/volumes allows working with external data from S3 object stores and file systems
- relnote: PGFS/volumes allows working with external data from S3 object stores and file systems
details: |
The tech preview could only work with S3 in retrievers. Now, the volume concept allows working with external data in general.
jira: AID-77
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -32,7 +32,7 @@ You can combine any data source with any data type:

Retrievers always need a "vector table" to store the vector embeddings that belong to the source data. We use a model with "encoder" capabilities to generate the vectors for the source data.

Each model has a different "dimensionality" i.e. number of dimensions of the vector (the vector [1, 2, 2] has 3 dimensions). We use pgvector "vector" column types to store the embeddings. And those have a fixed number of dimensions. For this reason, the encoder interface on the models returns how many dimensions the vectors will have.
Each model has a different "dimensionality" i.e. number of dimensions of the vector (the vector [1, 2, 2] has 3 dimensions). We use PGvector "vector" column types to store the embeddings. And those have a fixed number of dimensions. For this reason, the encoder interface on the models returns how many dimensions the vectors will have.

We create the vector table, or support using an existing one. The vector column dimensionality must fit the model.

Expand Down
Loading