Search Shortcut cmd + k | ctrl + k
dryrun

Estimate Parquet scan bytes for SELECT queries without executing them

Maintainer(s): aleda145

Installing and Loading

INSTALL dryrun FROM community;
LOAD dryrun;

Example

SELECT *
FROM dryrun(
  'SELECT * FROM ''https://dryrun-data.dahl.dev/gaia-5m.parquet'''
);

About dryrun

Dryrun adds a table function for estimating Parquet scan bytes before running a query.

It parses a read-only SELECT statement, finds explicit Parquet scan sources, determines required columns, uses Parquet metadata for compressed column-chunk sizes, and applies simple row-group pruning from min/max statistics when possible.

The result reports estimated data bytes, estimated Parquet metadata bytes, files, row groups used versus total row groups, confidence, and notes (if any).

Added Functions

function_name function_type description comment examples
dryrun table NULL NULL  

Overloaded Functions

This extension does not add any function overloads.

Added Types

This extension does not add any types.

Added Settings

This extension does not add any settings.