EDB POSTGRES AI ANALYTICS ACCELERATOR

WarehousePG:
The Sovereign Data Warehouse

Take back control of your analytical database with WarehousePG—the open source, petabyte-scale data warehouse available on EDB Postgres® AI.

 

EDB

 

Find out how WarehousePG can transform your analytics platform

Select your current data platform scenario.

  • Reclaim control from cloud vendor lock-in
    Reclaim control from cloud vendor lock-in

    Escape cloud-only costs and lock-in

    Eliminate 23% cost unpredictability from cloud data warehouses. WarehousePG delivers predictable performance and pricing for daily driver analytics—no multi-cluster penalties, serverless surprises, or hidden overages.

    Get complete control at petabyte scale

    Achieve total data sovereignty and global compliance by running your data wherever you need it—on-prem, cloud, or hybrid. WarehousePG’s massively parallel processing (MPP) architecture delivers petabyte-scale performance, ensuring consistent scalability and control over growing volumes.

    Protect business continuity with world-leading Postgres expertise

    Minimize operational risk and maximize uptime with real-time observability. You can operate with complete confidence—knowing your platform is backed by 24x7, award-winning support from the world's top Postgres experts, including the core committers and builders of WarehousePG.

  • Adopt a modern analytics platform with hybrid flexibility
    Adopt a modern analytics platform with hybrid flexibility

    Build your future-proof analytics platform

    WarehousePG combines the open foundation, flexible deployment, and enterprise scale you’ve been looking for. Accelerate time-to-value for advanced and agentic analytics with native vector capabilities and in-database machine learning.

    Get complete control at petabyte scale

    Achieve total data sovereignty and global compliance by running your data wherever you need it—on-prem, cloud, or hybrid. WarehousePG’s massively parallel processing (MPP) architecture delivers petabyte-scale performance, ensuring consistent scalability and control over growing volumes.

    Protect business continuity with world-leading Postgres expertise

    Minimize operational risk and maximize uptime with real-time observability. You can operate with complete confidence—knowing your platform is backed by 24x7, award-winning support from the world's top Postgres experts, including the core committers and builders of WarehousePG.

  • Migrate from proprietary or end-of-life Greenplum
    Migrate from proprietary or end-of-life Greenplum

    Modernize your Greenplum investment

    WarehousePG is the drop-in Greenplum replacement that lets you eliminate tech debt and vendor lock-in while upgrading to an AI-ready data foundation and award-winning EDB support. Learn more about Support for Greenplum Workloads.

    Modernize your Greenplum investment

    Get complete control at petabyte scale

    Achieve total data sovereignty and global compliance by running your data wherever you need it—on-prem, cloud, or hybrid. WarehousePG’s massively parallel processing (MPP) architecture delivers petabyte-scale performance, ensuring consistent scalability and control over growing volumes.

    Protect business continuity with world-leading Postgres expertise

    Minimize operational risk and maximize uptime with real-time observability. You can operate with complete confidence—knowing your platform is backed by 24x7, award-winning support from the world's top Postgres experts, including the core committers and builders of WarehousePG.

Trusted by enterprise data leaders

EDB PG AI for WarehousePG provides stability, performance, and control for mission-critical workloads all over the world.

  • Mntn logo

    MNTN, a leading connected TV ad-tech platform, required a solution that guaranteed operational uptime and responsive support for mission-critical performance marketing. WarehousePG provided MNTN with the necessary petabyte-scale stability and performance, backed by EDB's 24x7 expert support — crucial for turning insights into competitive advantage. Get the full MNTN story.

    "The performance is there, the stability is there, the support is responsive as they should be. I’m just happy that there’s somebody there that can be with me in the middle of the night and I’m not, quite literally, hacking open source code trying to get the database recovered."

    – Greg Spiegelberg, Head of Data, MNTN
    Mntn logo
  • Kyobo logo

     

    Kyobo Book Centre, the largest bookstore chain in South Korea, was seeking a strategic escape from unpredictable and soaring compute costs on their 50TB cloud data warehouse. Kyobo adopted WarehousePG to establish cost control, gain superior performance, and meet strict data residency mandates.

    "We have been plagued by runaway costs for querying our 50TB cloud data warehouse. EDB Postgres AI for WarehousePG will give us a way to rein in costs with superior performance — and we can do it with total data sovereignty."

    – Mr. Jung, Heung Sik, Head of IT Support, Kyobo Book Centre
    Kyobo logo
  • Euronext logo

     

    Euronext FX, a leading pan-European market infrastructure, needed to eliminate vendor lock-in and technical debt from their existing Greenplum system. WarehousePG delivered a zero-migration binary swap that immediately provided superior enterprise support and open source control across their four global data centers. This seamless, lift-and-shift path ensured future-proof stability for their high-volume workloads.

    "We're excited to be working with EDB Postgres AI. Its Support for Greenplum Workloads is helping us maintain control of where and how we deploy open source software."

    – Grigoriy Zeleniy, Global CTO, Euronext FX
    Euronext logo

Architecture spotlight

EDB PG AI: Securing and scaling the open source WarehousePG core

PGWarehouse-cart

 

 

Use cases and features

WarehousePG extends EDB PG AI with the sovereign, petabyte-scale capabilities required to accelerate your most demanding analytics workloads, eliminate data fragmentation, and ensure complete compliance.

icon
Enterprise data platform modernization

Reduce costs for high-concurrency workloads by up to 62% and future-proof your analytics investments by rapidly replacing legacy data platforms and proprietary cloud infrastructure with open source, massively scalable WarehousePG.

Key features

  • Postgres and Greenplum compatibility: Modernize in hours, not months, with a simple binary swap for Greenplum workloads and high SQL parity for other legacy system migrations.
  • 24x7 support: Transition and scale confidently with award-winning support from the world's top Postgres experts — including Postgres committers and the builders of WarehousePG.
  • Massively parallel processing (MPP): Achieve petabyte-scale analytics with a Postgres core. The coordinator optimizes queries and manages parallel distribution across multiple Postgres instances.
  • Deployment flexibility: Achieve compliance and sovereignty by deploying across any environment — cloud, on-prem, or hybrid — without proprietary vendor lock-in.
icon
High-concurrency business intelligence

Innovate faster by enabling more teams to query all your data as much as they need to. With up to 63% better concurrency handling than cloud data warehouses, external data integration, and tiered storage, WarehousePG empowers analytics users to run highly complex, concurrent queries for business intelligence and reporting — without compromising system performance or increasing operational overhead.

Key features

  • Comprehensive external data access (PXF): Break down data silos by querying and joining data in external sources like data lakes (Amazon S3, HDFS) using standard SQL.
  • Tiered storage: Reduce costs and improve performance by keeping warmer data in WarehousePG while storing cold data in more cost-effective external storage — without creating new data silos or limiting analyst access.
  • Predictable workload management: Ensure stable performance and SLAs for diverse user groups by isolating resources and prioritizing critical workloads using Linux cGroups V2.
icon
Real-time streaming and high-volume log analysis

Turn live events and massive, continuous data streams into immediate insights and action. Leverage streaming ingest and search capabilities to gain low-latency visibility into operational health, IoT device status, security threats, performance issues — whatever your business requires.

Key features

  • Real-time streaming ingestion (flow server): Enable rapid innovation and up-to-the-second monitoring by handling high-volume event data from sources like Kafka and RabbitMQ.
  • High availability and fault tolerance: Protect data integrity and operational uptime with full ACID compliance and automatic failure detection using the Standby Coordinator and Mirror Segments.
  • Integrated security and auditability: Meet strict compliance mandates with features like Row-Level Security (RLS), column-level access rights, and support for the pgAudit extension.
icon
Advanced and agentic analytics

Accelerate time-to-value for strategic initiatives with a single platform that maximizes data scientist and analyst productivity. EDB Postgres AI supports everything from in-database machine learning and advanced analytics directly where data resides in WarehousePG, to building conversational analytics agents with AI Factory’s low- and no-code tooling. Even as AI agents increase concurrent load on the system, WarehousePG scales to meet this demand 63% more efficiently than leading cloud data warehouses.

Key features

  • In-database machine learning: Train sophisticated models for operations like fraud detection and churn prediction directly on massive datasets, supporting diverse skill sets with MADlib for SQL users and robust in-database Python ML frameworks.
  • Native vector capabilities (pgvector): Power RAG applications and semantic search across unstructured data like text and images, unlocking new revenue streams.
  • Model and agent interoperability: Connect WarehousePG with GenAI applications and agents through plug-and-play integration with AI Factory knowledge bases.

Resources

EDB Postgres AI for WarehousePG Solution Brief


WarehousePG Documentation


WarehousePG OSS Project Website


EDB Postgres AI Support for Greenplum Workloads


The data warehouse of the future is open

Upgrade to WarehousePG for the petabyte-scale performance you expect, the open source flexibility you want, and the sovereign control you need.

 

*Greenplum® is a registered trademark of Broadcom Inc. EDB and EDB Postgres AI are not affiliated with, endorsed by, or sponsored by Broadcom Inc. Any references to Greenplum are for comparative, educational, and interoperability purposes only.
What is WarehousePG?chevron_right

WarehousePG is a an open source, Postgres-based data warehouse built on a Massively Parallel Processing (MPP) architecture, enabling petabyte-scale analytics. It preserves the familiar SQL experience of Postgres, scaling horizontally by distributing data and queries across many Postgres nodes that work together in parallel.

What is massively parallel processing (MPP) and why is it important for analytics?chevron_right

MPP is an architecture purpose-built to execute complex transformations on petabyte-scale datasets with high efficiency. It distributes data and query execution across multiple, independent compute segments (nodes) that work simultaneously. This design removes the performance limits of a single server, delivering consistent scalability as datasets and workloads expand.

What is WarehousePG's relationship with Greenplum?chevron_right

WarehousePG has a direct lineage from the open source Greenplum Database project, which was originally a fork of Postgres. The need for a stable, open source alternative became acute when Greenplum went closed-source in May 2024. EDB forked the project from its last open source version, demonstrating our commitment to carrying this open source, Postgres-based data warehouse forward. For existing Greenplum users, WarehousePG is a frictionless, zero-migration binary swap — backed by enterprise support and stability from EDB PG AI.

What is the core advantage of WarehousePG over cloud big data providers like Snowflake or Databricks?chevron_right

The core advantage is complete sovereignty and predictable cost control. Unlike cloud-only models, WarehousePG offers true hybrid deployment (on-prem, cloud, or hybrid) for full data residency compliance. EDB PG AI is also priced per core, eliminating the unpredictable, escalating consumption costs (credits/DBUs) associated with proprietary cloud platforms.

How does WarehousePG accelerate our advanced analytics and AI/ML initiatives?chevron_right

WarehousePG accelerates time-to-value by providing the data foundation for advanced analytics and AI. It features native vector capabilities (via the pgvector extension), in-database machine learning, and comprehensive external data access. When these capabilities combine with the pipeline automation and agent building capabilities of EDB PG AI Factory, customers have a complete solution — from data foundation and model training to agent orchestration — to power cutting-edge analytics agents.

How does WarehousePG simplify migration from existing legacy data warehouses?chevron_right

WarehousePG simplifies migration by leveraging its foundation in Postgres and high SQL parity. For existing Greenplum users on versions 6.x and 7.x, transitioning to WarehousePG is a zero-migration binary swap that can be done in hours, not months. Migration is low-risk for other SQL-based systems thanks to WarehousePG’s high SQL parity and 24x7 support from EDB's world-class team of WarehousePG and Postgres committers.

What if we still need to query data stored in our existing data lake or lakehouse (e.g. S3, HDFS)?chevron_right

WarehousePG supports comprehensive external data access through the Platform Extension Framework (PXF). This allows you to query and join data stored outside the warehouse (formats like JSON, Parquet, and AVRO in data lakes like Amazon S3, HDFS, or MinIO) using standard SQL, breaking down data silos and eliminating complex ETL pipelines.