Author: Orri Erling

Software Engineer at Facebook

Even Faster Unnest
By Ying Su, Maria Basmanova & Orri Erling August 20, 2020September 21, 2023
Unnest is a common operation in Facebook’s daily Presto workload. It converts an ARRAY, MAP, or ROW into a flat relation. Its original implementation used deep copy all the time and was very inefficient. In Unnest Operator Performance Enhancement with Dictionary Blocks, the author improved the Unnest operator by up to 10x in CPU and…
Read More Even Faster Unnest
5 design choices—and 1 weird trick — to get 2x efficiency gains in Presto repartitioning
By Ying Su, Orri Erling, Tim Meehan, Sahar Massachi, Bhavani Hari & Maria Basmanova December 20, 2019September 21, 2023
We like Presto. We like it a lot — so much we want to make it better in every way. Here’s an example: we just optimized the PartitionedOutputOperator. It’s now 2-3x more CPU efficient, which, when measured against Facebook’s production workload, translates to 6% gains overall. That’s huge. The optimized repartitioning is in use on…
Read More 5 design choices—and 1 weird trick — to get 2x efficiency gains in Presto repartitioning
Table Scan: Doing The Right Thing With Structured Types
By Orri Erling September 26, 2019September 21, 2023
In the previous article we saw what gains are possible when filtering early and in the right order. In this article we look at how we do this with nested and structured types. We use the 100G TPC-H dataset, but now we group top level columns into structs or maps. Maps, lists and structs are…
Read More Table Scan: Doing The Right Thing With Structured Types
Complete Table Scan: A Quantitative Assessment
By Orri Erling July 29, 2019September 21, 2023
In the previous article we looked at the abstract problem statement and possibilities inherent in scanning tables. In this piece we look at the quantitative upside with Presto. We look at a number of queries and explain the findings. The initial impulse motivating this work is the observation that table scan is by far the…
Read More Complete Table Scan: A Quantitative Assessment
Everything You Always Wanted To Do in Table Scan
By Orri Erling, Maria Basmanova, Ying Su, Tim Meehan & Elon Azoulay June 29, 2019September 21, 2023
Table scan, on the face of it, sounds trivial and boring. What’s there in just reading a long bunch of records from first to last? Aren’t indexing and other kinds of physical design more interesting? As data has gotten bigger, the columnar table scan has only gotten more prominent. The columnar scan is a fairly…
Read More Everything You Always Wanted To Do in Table Scan
Introducing the Presto blog
By Orri Erling June 28, 2019September 21, 2023
Presto is a key piece of data infrastructure at many companies. The community has many ongoing projects for taking it to new levels of performance and functionality plus unique experience and insight into challenges of scale. We are opening this blog as an informal channel for discussing our work as well as technology trends and…
Read More Introducing the Presto blog