Common Sub Expression Optimization at Facebook

Common Sub Expression Optimization at Facebook

In complex analytics queries, we often see repeated expressions, for example parsing the same JSON column but extracting different fields, elaborate CASE statement with common predicates and different ones. Previously, Presto will compute the same expression many times as they appear in query. With common sub expression optimization, we would only evaluate the same expression once within the same project operator or filter operator. In our workload, we’ve seen 3x improvements on certain queries with expensive common sub expressions like JSON_PARSE. Microbenchmark also shows a consistent ~10% performance improvement for simple common sub-expressions like x + y. In this talk, we will talk about how this is implemented.

Presto Connector for DataCTRL – Mario Ceste, Jr., SAP NS2

Presto Connector for DataCTRL – Mario Ceste, Jr., SAP NS2

DataCTRL is a data management platform for ingesting large quantities of disparate data sets. We’ve written a connector for Presto which allows our users to leverage the data they’ve ingested using SQL. Integrating Presto with our platform has given our customers a quick and effective way to query their data while also building additional data products.