Presto for Data Analysts
This course is designed to help you understand how Presto integrates with Qubole to provide your team with fast, inexpensive and scalable data processing.
Presto is an open source distributed SQL query engine designed for running interactive analytics against data of all sizes stored in cloud repositories. In this course you'll learn about Presto Commands and Tuning, including syntax best practices as well as the causes of job lag and job failure and options to help prevent these scenarios.
Estimated time to complete this course: 30 mins.
In this section, you’ll learn about Presto’s integration with Qubole to provide efficient Ad-Hoc querying of your data. Topics include:
- Presto Overview
- Ad Hoc Analysis
- RDMS & Hive Comparisons
- Hive Metadata
- Qubole Partioning
- The Presto Execution Model
- Presto Notebooks
In this lesson you’ll learn about a few tricks available for optimizing your Presto query performance. Topics include:
- Approximate Agg
- Limiting ORDER BY
- GROUP BY
- Join Behavior
- Columnar Formats & Partitions
- Job Lag, Failure and Tuning
Recommended Follow Up:
Course Version & Product Release
This course is based on Release 50 - to see the latest updates to Qubole please refer to the release notes in our documentation:
What's New In Qubole Release 52: http://docs.qubole.com/en/latest/release-notes/releasenotesR52/index.html