PrestoDB Blog - PrestoDB

Time Travel for Iceberg Tables in Presto

By Ajaykumar Gupte April 16, 2024April 16, 2024

Introduction Presto, an open-source distributed SQL query engine, excels at querying large data sets distributed across diverse data sources. It has maintained its position as a high-performance data analytics tool for over a decade. As data collection capabilities expand, businesses increasingly recognize the importance of historical data alongside current data. With the recent release of…

5 design choices—and 1 weird trick — to get 2x efficiency gains in Presto repartitioning

By Ying Su, Orri Erling, Tim Meehan, Sahar Massachi, Bhavani Hari & Maria Basmanova December 20, 2019September 21, 2023

We like Presto. We like it a lot — so much we want to make it better in every way. Here’s an example: we just optimized the PartitionedOutputOperator. It’s now 2-3x more CPU efficient, which, when measured against Facebook’s production workload, translates to 6% gains overall. That’s huge. The optimized repartitioning is in use on…