Spark Notebooks Overview
Spark Notebook Features
Spark Notebook Dashboards
Basic Notebook Tuning
Spark for Data Scientists
This course is designed to familiarize you with Spark functionality in Qubole.
This course introduces you to key best practices related to Spark Notebooks & Tuning in Qubole. By leveraging the features provided by Spark, you’ll help your enterprise lower costs and increase the productivity of your data teams.
Estimated time to complete this course: 30 mins.
Notebooks are often used by Data Scientists because they are convenient for quick exploration tasks. Once set up, a Notebook provides a convenient way to save, share and re-run a set of queries on a data source. In this lesson you'll learn about:
- The Spark Toolkit
- Notebook Features & Permissions
- Using Packages
- Deep Learning in Qubole
- Integrating Jars
- Notebook API
In this section you’ll learn the following key concepts for tuning Notebooks in Qubole.
- Notebook Interpreters
- Cache Management
- Data Format
- Garbage Collection
- Resource Manager & Notebook Troubleshooting
- Notebook Logs
Recommended Follow Up:
Course Version & Product Release
This course is based on Release 54 - to see the latest updates to Qubole please refer to the release notes in our documentation.