Tutorial Presto 8.8 !new! | 2027 |
Since Presto 8.8 is a historical release (from the 8.x line before the transition to Presto 8.x/Trino split), this guide focuses on key features available in Presto 8.8 and how to use them.
Step 2: Create Configuration Directories
mkdir -p etc/catalog
6.2 Running a Vectorized Query (Velox in action)
Let's simulate a large aggregation. Presto 8.8 will automatically use the vectorized engine if enabled.
-- Switch to the Iceberg catalog (assuming you have a 'sales' table) USE iceberg.default;
-- Sample query leveraging 8.8's vectorization SELECT region, COUNT(*) AS total_sales, SUM(amount) AS revenue FROM sales WHERE sale_date BETWEEN DATE '2024-01-01' AND DATE '2024-12-31' GROUP BY region ORDER BY revenue DESC;tutorial presto 8.8
Check the Query Execution Timeline in the Web UI. You should see "Vectorized Operators" lit up. Since Presto 8
Part 1: What’s New in Presto 8.8?
Before diving into commands, let’s highlight the key improvements in this version. Understanding these will help you leverage the tutorial effectively.
- Improved Fault-Tolerant Execution: Version 8.8 introduces better spill-to-disk strategies for memory-intensive queries.
- Dynamic Filtering Enhancements: Faster joins between large tables due to smarter runtime filters.
- Geospatial Functions: Upgraded GIS support with new
ST_functions for location-based analytics. - Connector Upgrades: Native improvements to the Delta Lake, Iceberg, and Cassandra connectors.
- Security: Role-based access control (RBAC) now fully integrated with LDAP and OAuth 2.0.
Problem 2: "Failed to open iceberg table: missing manifest file"
Solution: Initialize the Iceberg catalog: FORMAT JSON) SELECT c.customer_name
CALL iceberg.system.create_namespace('default', 'analytics');
Example Query Patterns
- Aggregation: SELECT region, COUNT(*) AS orders, SUM(amount) AS total FROM orders WHERE order_date BETWEEN DATE '2026-01-01' AND DATE '2026-03-31' GROUP BY region;
- Join across systems: SELECT c.customer_id, c.name, o.total FROM mysql.customers c JOIN hive.orders o ON c.customer_id = o.customer_id WHERE o.order_date >= DATE '2026-04-01';
Conclusion
Presto 8.8 remains a functional tool for specific legacy environments. While it lacks the sleek UI of modern VPN solutions, its stability and configuration depth make it a favorite for
Prerequisites
- Java 11 or 17 (OpenJDK recommended)
- Linux/macOS (or WSL2 for Windows)
- At least 4GB RAM allocated for Presto (8GB+ for heavier workloads)
Step 5: Check the Optimized Plan
Use EXPLAIN to see CBO decisions:
EXPLAIN (TYPE DISTRIBUTED, FORMAT JSON)
SELECT c.customer_name, sum(s.amount)
FROM hive.default.sales s
JOIN hive.default.customers c ON s.customer_id = c.id
WHERE s.order_date = '2024-01-15';
Look for:
"distribution" : "REPLICATED"(broadcast join)"estimates"object showing row count estimates