PrestoDB Blog - PrestoDB

Hudi tables via Presto-Hive connector: A Deep Dive

By Pratyaksh Sharma May 30, 2023September 14, 2023

With the growing popularity of the lakehouse approach, it has become increasingly important for query engines to support these new formats such as Hudi. A previous blog discusses the evolution of presto-hudi integration via hive connector at a high level. With the latest community developments, a separate presto-hudi connector has come up but it is…

Customer-Facing Presto at Rippling – Andy Li, Rippling

By Ali LeClerc January 9, 2023September 14, 2023

Last month we hosted PrestoCon, a return to in-person events that showcased the community development of Presto. In this blog we’ll detail Rippling’s presentation on their Presto use case, including their architecture, key optimizations, and hard earned lessons. You can also check out their full presentation here. Background Rippling is a popular HR and payroll…

PrestoDB and Apache Hudi

By Bhavani Sudha Saktheeswaran August 4, 2020September 21, 2023

Apache Hudi is a fast growing data lake storage system that helps organizations build and manage petabyte-scale data lakes. Hudi brings stream style processing to batch-like big data by introducing primitives such as upserts, deletes and incremental queries. These features help surface faster, fresher data on a unified serving layer. Hudi tables can be stored…

Running Presto in a Hybrid Cloud Architecture

By Adit Madan July 17, 2020September 21, 2023

Migrating SQL workloads from a fully on-premise environment to cloud infrastructure has numerous benefits, including alleviating resource contention and reducing costs by paying for computation resources on an on-demand basis. In the case of Presto running on data stored in HDFS, the separation of compute in the cloud and storage on-premises is apparent since Presto’s…