GPU-Accelerated Presto C++ is Here: Nightly Images for NVIDIA GPUs
TL;DR — GPUs can run analytical SQL dramatically faster than CPUs: published numbers show a single node Presto C++ with GPU-accelerated operators running a TPC-H-style benchmark in ~100 seconds versus ~1,200 seconds on a high-end CPU — on the order of 12× faster — and a UCX/NVLink exchange running >6× faster with multi-node Presto. Together…