Spark for Data Analysts
The objective of this course is to familiarize you with Spark Commands and Tuning in Qubole.
This course introduces you to key best practices related to Spark Commands and Tuning in Qubole.
Estimated time to complete this course: 30 mins.
Spark is a distributed processing solution with a toolkit of programming languages and supplemental core components that make it very appealing to enterprises looking to perform more than simple SQL queries. In this lesson you'll learn about:
- The Spark Toolkit
- Datasets & DataFrames
- Spark Notebooks
Tuning Spark in Qubole helps optimize your queries. In this lesson you'll learn how to tune:
- Spark Executors
- Notebook Interpreters
Recommended Follow Up:
Course Version & Product Release
This course is based on Release 50 - to see the latest updates to Qubole please refer to the release notes in our documentation:
What's New In Qubole Release 52: http://docs.qubole.com/en/latest/release-notes/releasenotesR52/index.html