Query Apache Paimon tables directly from DuckDB
Maintainer(s):
mrdrivingduck
Installing and Loading
INSTALL paimon FROM community;
LOAD paimon;
Example
-- Query local Paimon tables
SELECT * FROM paimon_scan('./data/testdb.db/testtbl');
-- Configure OSS credentials for remote tables
CREATE SECRET my_oss (
TYPE paimon,
key_id 'your-access-key-id',
secret 'your-access-key-secret',
endpoint 'oss-cn-hangzhou.aliyuncs.com'
);
-- Query Paimon tables on OSS
SELECT * FROM paimon_scan('oss://your-bucket/warehouse', 'your_db', 'your_table');
-- Attach as catalog
ATTACH 'oss://my-bucket/warehouse' AS paimon_lake (TYPE paimon);
SHOW ALL TABLES IN paimon_lake;
About paimon
The Paimon extension enables DuckDB to read and query Apache Paimon format data directly. Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch operations.
Key features:
- Zero JVM dependency — Pure C++ implementation
- Apache Arrow data exchange for zero-copy transfers
- Parallel scan architecture utilizing multi-core CPUs
- Secure credential management via DuckDB's Secret Manager
- Support for local and remote OSS storage
- Projection pushdown optimization
Built on top of paimon-cpp, this extension brings DuckDB's powerful local analytics to the Paimon data lake ecosystem.
For more information, visit the extension repository.
Added Functions
This extension does not add any functions.
Overloaded Functions
This extension does not add any function overloads.
Added Types
This extension does not add any types.
Added Settings
This extension does not add any settings.