Search Shortcut cmd + k | ctrl + k
paimon

Query Apache Paimon tables directly from DuckDB

Maintainer(s): mrdrivingduck

Installing and Loading

INSTALL paimon FROM community;
LOAD paimon;

Example

-- Query local Paimon tables
SELECT * FROM paimon_scan('./data/testdb.db/testtbl');

-- Configure OSS credentials for remote tables
CREATE SECRET my_oss (
    TYPE paimon,
    key_id 'your-access-key-id',
    secret 'your-access-key-secret',
    endpoint 'oss-cn-hangzhou.aliyuncs.com'
);

-- Query Paimon tables on OSS
SELECT * FROM paimon_scan('oss://your-bucket/warehouse', 'your_db', 'your_table');

-- Attach as catalog
ATTACH 'oss://my-bucket/warehouse' AS paimon_lake (TYPE paimon);
SHOW ALL TABLES IN paimon_lake;

About paimon

The Paimon extension enables DuckDB to read and query Apache Paimon format data directly. Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.

Key features:

  • Zero JVM dependency — Pure C++ implementation
  • Apache Arrow data exchange for zero-copy transfers
  • Parallel scan architecture utilizing multi-core CPUs
  • Secure credential management via DuckDB's Secret Manager
  • Support for local and remote OSS storage
  • Projection pushdown optimization

Built on top of paimon-cpp, this extension brings DuckDB's powerful local analytics to the Paimon data lake ecosystem.

For more information, visit the extension repository.