Query Apache Paimon tables directly from DuckDB
Maintainer(s):
mrdrivingduck
Installing and Loading
INSTALL paimon FROM community;
LOAD paimon;
Example
-- Query local Paimon tables
SELECT * FROM paimon_scan('./data/testdb.db/testtbl');
-- Configure OSS credentials for remote tables
CREATE SECRET my_oss (
TYPE paimon,
key_id 'your-access-key-id',
secret 'your-access-key-secret',
endpoint 'oss-cn-hangzhou.aliyuncs.com'
);
-- Query Paimon tables on OSS
SELECT * FROM paimon_scan('oss://your-bucket/warehouse', 'your_db', 'your_table');
-- Attach as catalog
ATTACH 'oss://my-bucket/warehouse' AS paimon_lake (TYPE paimon);
SHOW ALL TABLES IN paimon_lake;
About paimon
The Paimon extension enables DuckDB to read and query Apache Paimon format data directly. Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
Key features:
- Zero JVM dependency — Pure C++ implementation
- Apache Arrow data exchange for zero-copy transfers
- Parallel scan architecture utilizing multi-core CPUs
- Secure credential management via DuckDB's Secret Manager
- Support for local and remote OSS storage
- Projection pushdown optimization
Built on top of paimon-cpp, this extension brings DuckDB's powerful local analytics to the Paimon data lake ecosystem.
For more information, visit the extension repository.