Live Webinar - ITIL 4 Overview - What’s New from ITIL v3 to ITIL 4

closeClose

Databricks Delta

Course Details
Code: DB-DELTA-SELF
Tuition (USD): $75.00 • Self Paced
Generate a quote

This hands-on self-paced training course targets Data Engineers, Data Scientists and Data Analysts who want to use Databricks Delta for ETL processing on Data Lakes. The course ends with a capstone project building a complete data pipeline using Databricks Delta.

Skills Gained

  • Use the interactive Databricks notebook environment.
  • Use Databricks Delta to create, append and upsert data into a Data Lake.
  • Use Databricks Delta to manage and extract actionable insights out of a Data Lake.
  • Use Databricks Delta's advanced optimization features to speed up queries.
  • Use Databricks Delta to seamlessly ingest streaming and historical data.
  • Implement a Databricks Delta data pipeline architecture.

Prerequisites

  • Completed the Getting Started with Apache Spark™ SQL, Getting Started with Apache Spark™ DataFrames, or ETL Part 1 course, or already have similar knowledge

Course Details

Course Outline

  • Introducing Delta
  • Create
  • Append
  • Upsert
  • Streaming
  • Architecture
  • Capstone Project

Platforms

Supported platforms include Azure Databricks, Databricks Community Edition, and non-Azure Databricks.

  • If you're planning to use the course on Azure Databricks, select the "Azure Databricks" Platform option.
  • If you're planning to use the course on Databricks Community Edition or on a non-Azure version of Databricks, select the "Other Databricks" Platform option.

Format

The course is a series of seven self-paced lessons plus a final capstone project building a complete data pipeline using Databricks Delta.. Each lesson includes hands-on exercises.