Presto for Data Analysts

Presto for Data Analysts

This course is designed to help you understand how Presto integrates with Qubole to provide your team with fast, inexpensive and scalable data processing.

ABOUT THIS COURSE

LEARNING FORMAT:

Self-paced

DESCRIPTION:

Presto is an open source distributed SQL query engine designed for running interactive analytics against data of all sizes stored in cloud repositories. In this course you'll learn about Presto Commands and Tuning, including syntax best practices as well as the causes of job lag and job failure and options to help prevent these scenarios.

Estimated time to complete this course: 30 mins.

Recommended Prerequisites:

Presto Commands

In this section, you’ll learn about Presto’s integration with Qubole to provide efficient Ad-Hoc querying of your data. Topics include:

  • Presto Overview
  • Ad Hoc Analysis
  • RDMS & Hive Comparisons
  • Hive Metadata
  • Qubole Partioning
  • The Presto Execution Model
  • Presto Notebooks

Presto Tuning

In this lesson you’ll learn about a few tricks available for optimizing your Presto query performance. Topics include:

  • Approximate Agg
  • Limiting ORDER BY
  • GROUP BY
  • Join Behavior
  • Columnar Formats & Partitions
  • Job Lag, Failure and Tuning

Recommended Follow Up:

Course Version & Product Release

This course is based on Release 50 - to see the latest updates to Qubole please refer to the release notes in our documentation:

What's New In Qubole Release 52: http://docs.qubole.com/en/latest/release-notes/releasenotesR52/index.html

 

Curriculum

  • Course Introduction
  • Course Terminology
  • Presto Introduction
  • Presto Commands
  • Presto Notebooks
  • Presto Commands Best Practices
  • Basic Presto Tuning
  • Course Conclusion

ABOUT THIS COURSE

LEARNING FORMAT:

Self-paced

DESCRIPTION:

Presto is an open source distributed SQL query engine designed for running interactive analytics against data of all sizes stored in cloud repositories. In this course you'll learn about Presto Commands and Tuning, including syntax best practices as well as the causes of job lag and job failure and options to help prevent these scenarios.

Estimated time to complete this course: 30 mins.

Recommended Prerequisites:

Presto Commands

In this section, you’ll learn about Presto’s integration with Qubole to provide efficient Ad-Hoc querying of your data. Topics include:

  • Presto Overview
  • Ad Hoc Analysis
  • RDMS & Hive Comparisons
  • Hive Metadata
  • Qubole Partioning
  • The Presto Execution Model
  • Presto Notebooks

Presto Tuning

In this lesson you’ll learn about a few tricks available for optimizing your Presto query performance. Topics include:

  • Approximate Agg
  • Limiting ORDER BY
  • GROUP BY
  • Join Behavior
  • Columnar Formats & Partitions
  • Job Lag, Failure and Tuning

Recommended Follow Up:

Course Version & Product Release

This course is based on Release 50 - to see the latest updates to Qubole please refer to the release notes in our documentation:

What's New In Qubole Release 52: http://docs.qubole.com/en/latest/release-notes/releasenotesR52/index.html

 

Curriculum

  • Course Introduction
  • Course Terminology
  • Presto Introduction
  • Presto Commands
  • Presto Notebooks
  • Presto Commands Best Practices
  • Basic Presto Tuning
  • Course Conclusion