Estimate Parquet scan bytes for SELECT queries without executing them
Maintainer(s):
aleda145
Installing and Loading
INSTALL dryrun FROM community;
LOAD dryrun;
Example
SELECT *
FROM dryrun(
'SELECT * FROM ''https://dryrun-data.dahl.dev/gaia-5m.parquet'''
);
About dryrun
Dryrun adds a table function for estimating Parquet scan bytes before running a query.
It parses a read-only SELECT statement, finds explicit Parquet scan sources, determines required columns, uses Parquet metadata for compressed column-chunk sizes, and applies simple row-group pruning from min/max statistics when possible.
The result reports estimated data bytes, estimated Parquet metadata bytes, files, row groups used versus total row groups, confidence, and notes (if any).
Added Functions
| function_name | function_type | description | comment | examples |
|---|---|---|---|---|
| dryrun | table | NULL | NULL |
Overloaded Functions
This extension does not add any function overloads.
Added Types
This extension does not add any types.
Added Settings
This extension does not add any settings.