Presto on Kafka at Scale – Yang Yang & Yupeng Fu, Uber

Presto on Kafka at Scale – Yang Yang & Yupeng Fu, Uber

Presto is a popular distributed SQL query engine for running interactive analytic queries. Presto provides a Connector API that allows plugins to dozens of data sources, and thus positions itself as a single point of access to a wide variety of data. At Uber, we significantly improved Presto’s Kafka connector to meet Uber’s scale. For example, the new connector allows dynamic Kafka cluster and topic discovery so users can directly query existing Kafka topics without any registration and onboarding process; dynamic schema discovery allows fetching the latest schema without any Presto restart or deployment; smart time range suggestions to users based on Kafka metadata analysis to avoid large-range scans and thus keep the query interactive.

Top 10 Reasons to Use & Contribute to Presto – Steven Mih, Ahana

Top 10 Reasons to Use & Contribute to Presto – Steven Mih, Ahana

Presto is complicated with many intricacies. Ahana Cloud is the only managed service for Presto on AWS that simplifies Presto, bringing its power to platform teams of any size or skill set. In this session we’ll give you a quick overview of Ahana Cloud, including managing multiple Presto clusters seamlessly, querying a range of data sources, as well as just-released capabilities.