Data Inlining in DuckLake: Unlocking Streaming for Data Lakes
TL;DR: DuckLake’s data inlining stores small updates directly in the catalog, eliminating the “small files problem” and making continuous streaming into data lakes practical. Our benchmark shows 926× faster queries and 105× faster ingestion when compared to Iceberg.