Scalable Feature Engineering with Tecton on Athena – Derek Salama, Tecton

Scalable Feature Engineering with Tecton on Athena – Derek Salama, Tecton

Tecton is the leading feature platform for real-time machine learning. Rather than build new SQL engines from scratch, Tecton connects to your existing engine to transform raw data into features for machine learning. This talk will cover Tecton’s new integration with Athena for feature engineering. Derek will demonstrate how Tecton with Athena is the fastest way to build feature pipelines and put new models in production.

Querying streaming data with Presto, Amazon Athena and Upsolver

Querying streaming data with Presto, Amazon Athena and Upsolver

In this session, Yoni will present on querying streaming data with Presto and Amazon Athena including performance, data partitioning and compaction. In addition, we will demo using the Upsolver platform with Amazon Athena. In addition, he will share what they are working on with Prestodb.

Presto on Kafka at Scale – Yang Yang & Yupeng Fu, Uber

Presto on Kafka at Scale – Yang Yang & Yupeng Fu, Uber

Presto is a popular distributed SQL query engine for running interactive analytic queries. Presto provides a Connector API that allows plugins to dozens of data sources, and thus positions itself as a single point of access to a wide variety of data. At Uber, we significantly improved Presto’s Kafka connector to meet Uber’s scale. For example, the new connector allows dynamic Kafka cluster and topic discovery so users can directly query existing Kafka topics without any registration and onboarding process; dynamic schema discovery allows fetching the latest schema without any Presto restart or deployment; smart time range suggestions to users based on Kafka metadata analysis to avoid large-range scans and thus keep the query interactive.