This hands-on self-paced training course targets Data Engineers who want to process big data using Apache Spark™ Structured Streaming. The course ends with a capstone project building a complete data streaming pipeline using structured streaming.
Use the interactive Databricks notebook environment
Ingest streaming log file data
Aggregate small batches of data with time windows
Stream data from a Kafka connection
Use Structured Streaming in conjunction with Databricks Delta
Visualize streaming live data
Use Structured Streaming to analyze streaming Twitter data
Getting Started with Apache Spark™ DataFrames self-paced course (optional, but strongly encouraged)
Structured Streaming Concepts
Supported platforms include Azure Databricks, Databricks Community Edition, and non-Azure Databricks.
If you're planning to use the course on Azure Databricks, select the "Azure Databricks" Platform option.
If you're planning to use the course on Databricks Community Edition or on a non-Azure version of Databricks, select the "Other Databricks" Platform option.
The course is a series of five self-paced lessons plus a final capstone project building a complete data pipeline using Structured Streaming. Each lesson includes hands-on exercises.
https://www.exitcertified.com/training/databricks/structured-streaming-56306-detail.htmlSTRUC-STREAM-SELFStructured Streaminghttps://assets.exitcertified.com/assets/CourseImages/c7140964c1/AdobeStock_136168319__FitMaxWzEwMDAsMTAwMF0.jpg75.00USDInStock/Training/DatabricksThis hands-on self-paced training course targets Data Engineers who want to process big data using Apache Spark™ Structured...75.00DatabricksSelf Paced2019-03-21T10:12:12+00:00USD