Training
Databricks
Optimizing Apache Spark™ on Databricks

7839 Reviews star_rate star_rate star_rate star_rate star_half

Optimizing Apache Spark™ on Databricks

In this course, you will explore the five key problems that represent the vast majority of performance issues in an Apache Spark application: skew, spill, shuffle, storage, and serialization. With...

View Full Schedule

$1,500 USD GSA $1,360.20

Course Code OPTSPARK

Duration 2 days

Available Formats Classroom, Virtual

Enter your Email to Download Full Course Details

Apr 22, 2024 - Apr 25, 2024 (4 days)

Language	Time
Virtual
English	2:00 PM – 6:00 PM EDT
Select delivery method/location (1 options)
Virtual \| 2:00 PM – 6:00 PM EDT Virtual \| 2:00 PM – 6:00 PM EDT

Enroll: Enroll

May 20, 2024 - May 23, 2024 (4 days)

Language	Time
Virtual
English	2:00 PM – 6:00 PM EDT
Select delivery method/location (1 options)
Virtual \| 2:00 PM – 6:00 PM EDT Virtual \| 2:00 PM – 6:00 PM EDT

Enroll: Enroll

In this course, you will explore the five key problems that represent the vast majority of performance issues in an Apache Spark application: skew, spill, shuffle, storage, and serialization. With examples based on 100 GB to 1+ TB datasets, you will investigate and diagnose sources of bottlenecks with the Spark UI and learn effective mitigation strategies. You will also discover new features introduced in Spark 3 that can automatically address common performance problems. Lastly, you learn how to design and configure clusters for optimal performance based on specific team needs and concerns.

Skills Gained

Articulate how the five most common performance problems in a Spark application can be mitigated to achieve better application performance
Summarize the most common performance problems associated with data ingestion and how to mitigate them
Articulate how new features in Spark 3.x can be employed to mitigate performance problems in your Spark applications
Configure a Spark cluster for maximum performance given specific job requirements

Prerequisites

Hands-on experience developing Apache Spark applications (6+ months). We recommend the Apache Spark Programming course to get started working with Spark.
Intermediate experience in Python or Scala

Course Details

Course Outline

Day 1

Review of Spark architecture and Spark UI
Skew
Spill
Shuffle
Storage
Serialization

Day 2

Ingestion basics
Predicate push downs
Disk partitioning
Z-ordering
Bucketing
Optimization with Adaptive Query Execution (AQE)
Designing and configuring clusters for high performance

Read Less

View Full Schedule

2 options available

Apr 22, 2024 - Apr 25, 2024 (4 days)

Language	Time
Virtual
English	2:00 PM – 6:00 PM EDT
Select delivery method/location (1 options)
Virtual \| 2:00 PM – 6:00 PM EDT Virtual \| 2:00 PM – 6:00 PM EDT

Enroll: Enroll

May 20, 2024 - May 23, 2024 (4 days)

Language	Time
Virtual
English	2:00 PM – 6:00 PM EDT
Select delivery method/location (1 options)
Virtual \| 2:00 PM – 6:00 PM EDT Virtual \| 2:00 PM – 6:00 PM EDT

Enroll: Enroll

When does class start/end?

Classes begin promptly at 9:00 am, and typically end at 5:00 pm.

Does the course schedule include a Lunchbreak?

Lunch is normally an hour long and begins at noon. Coffee, tea, hot chocolate and juice are available all day in the kitchen. Fruit, muffins and bagels are served each morning. There are numerous restaurants near each of our centers, and some popular ones are indicated on the Area Map in the Student Welcome Handbooks - these can be picked up in the lobby or requested from one of our ExitCertified staff.

How can someone reach me during class?

If someone should need to contact you while you are in class, please have them call the center telephone number and leave a message with the receptionist.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

What does GTR stand for?

GTR stands for Guaranteed to Run; if you see a course with this status, it means this event is confirmed to run. View our GTR page to see our full list of Guaranteed to Run courses.

How do I find an ExitCertified training location?

We have training locations across the United States and Canada. View a full list of classroom training locations.

Which delivery formats are available?

At ExitCertified we offer training that is Instructor-Led, Online, Virtual and Self-Paced.

Does ExitCertified deliver group training?

Yes, we provide training for groups, individuals and private on sites. View our group training page for more information.

What does vendor-authorized training mean?

As a vendor-authorized training partner, we offer a curriculum that our partners have vetted. We use the same course materials and facilitate the same labs as our vendor-delivered training. These courses are considered the gold standard and, as such, are priced accordingly.

Is the training too basic, or will you go deep into technology?

It depends on your requirements, your role in your company, and your depth of knowledge. The good news about many of our learning paths, you can start from the fundamentals to highly specialized training.

How up-to-date are your courses and support materials?

We continuously work with our vendors to evaluate and refresh course material to reflect the latest training courses and best practices.

Are your instructors seasoned trainers who have deep knowledge of the training topic?

ExitCertified instructors have an average of 27 years of practical IT experience. They have also served as consultants for an average of 15 years. To stay up to date, instructors will at least spend 25 percent of their time learning new emerging technologies and courses.

Do you provide hands-on training and exercises in an actual lab environment?

Lab access is dependent on the vendor and the type of training you sign up for. However, many of our top vendors will provide lab access to students to test and practice. The course description will specify lab access.

Will you customize the training for our company’s specific needs and goals?

We will work with you to identify training needs and areas of growth. We offer a variety of training methods, such as private group training, on-site of your choice, and virtually. We provide courses and certifications that are aligned with your business goals.

How do I get started with certification?

Getting started on a certification pathway depends on your goals and the vendor you choose to get certified in. Many vendors offer entry-level IT certification to advanced IT certification that can boost your career. To get access to certification vouchers and discounts, please contact customerexp@exitcertified.com.

Will I get access to content after I complete a course?

You will get access to the PDF of course books and guides, but access to the recording and slides will depend on the vendor and type of training you receive.

How to request a W9 for ExitCertified LLC?

View our filing status and how to request a W9.

my experince was great from the day i regetered to the actuall day of the class.

ExitCertified Student

ExitCertified

Good training. A lot to take in for the short amount of time we have though

ExitCertified Student

ExitCertified

The tool provided to practice the course teachings is very functional and easy to use.

ExitCertified Student

ExitCertified

Some Labs are very good but some steps it ask to update but its already updated, but overall its very good training.

ExitCertified Student

ExitCertified

I registered a day before class and am happy that I received all the materials and links in time for the class. Thanks.

ExitCertified Student

ExitCertified

Optimizing Apache Spark™ on Databricks

Overview

Schedule

FAQ

Reviews

Skills Gained

Prerequisites

Course Details

Course Outline

When does class start/end?

Does the course schedule include a Lunchbreak?

How can someone reach me during class?

What languages are used to deliver training?

What does GTR stand for?

How do I find an ExitCertified training location?

Which delivery formats are available?

Does ExitCertified deliver group training?

What does vendor-authorized training mean?

Is the training too basic, or will you go deep into technology?

How up-to-date are your courses and support materials?

Are your instructors seasoned trainers who have deep knowledge of the training topic?

Do you provide hands-on training and exercises in an actual lab environment?

Will you customize the training for our company’s specific needs and goals?

How do I get started with certification?

Will I get access to content after I complete a course?

How to request a W9 for ExitCertified LLC?

Drag & Drop a File Here

Alert!

Modal Title

Error!

Default Title

Prompt

Confirm

Login

Optimizing Apache Spark™ on Databricks

Upcoming Course Dates

Overview

Schedule

FAQ

Reviews

Skills Gained

Prerequisites

Course Details

Course Outline

When does class start/end?

Does the course schedule include a Lunchbreak?

How can someone reach me during class?

What languages are used to deliver training?

What does GTR stand for?

How do I find an ExitCertified training location?

Which delivery formats are available?

Does ExitCertified deliver group training?

What does vendor-authorized training mean?

Is the training too basic, or will you go deep into technology?

How up-to-date are your courses and support materials?

Are your instructors seasoned trainers who have deep knowledge of the training topic?

Do you provide hands-on training and exercises in an actual lab environment?

Will you customize the training for our company’s specific needs and goals?

How do I get started with certification?

Will I get access to content after I complete a course?

How to request a W9 for ExitCertified LLC?

Drag & Drop a File Here

Alert!

Modal Title

Error!

Default Title

Prompt

Confirm

Login