The future of IBM Maximo: Work Centers and Inspections Can Transform Your Business

closeClose

Apache Spark Programming

  • Tuition USD $2,000
  • Reviews star_rate star_rate star_rate star_rate star_half 508 Ratings
  • Course Code DB105
  • Duration 3 days
  • Available Formats Classroom, Virtual

This 3-day course provides a thorough review of the Apache Spark framework, including the "Spark fundamentals" with specific emphasis on skills development and the unique needs of a Data Engineering team through the use of lecture and hands-on labs.

This course is combined with DB 100 - Apache Spark Overview to provide a comprehensive overview of the Apache Spark framework for Data Engineers.

After working through the Apache Spark fundamentals on the first day, the following days resume with more advanced APIs and techniques such as a review of specific Readers & Writers, broadcast table joins, additional SQL functions, and more hands-on labs. Additionally, the Structured Streaming demos from day #1 are replaced with broader, streaming-specific, lectures, and labs.

Throughout the three day course, participants are also introduced into more of the Apache Spark architecture. Topics include, but are not limited to, the DAG Execution model, an introduction to the Catalyst Optimizer, and Spark-Partitioning.

Skills Gained

After taking this class, students will be able to:

  • This course is ideal for Data Engineers that are new to Apache Spark or that have been using Apache Spark for less than one year
  • This course is suitable for SQL Analyst seeking to grow beyond simple SQL queries and into the use of the DataFrame APIs
  • This course is suitable for Data Analyst, Data Scientists, and ML Practitioners that have a stronger engineering background and would like to benefit from a deeper understanding of the architecture and APIs

Who Can Benefit

  • This course is ideal for Data Engineers that are new to Apache Spark or that have been using Apache Spark for less than one year
  • This course is suitable for SQL Analyst seeking to grow beyond simple SQL queries and into the use of the DataFrame APIs
  • This course is suitable for Data Analyst, Data Scientists, and ML Practitioners that have a stronger engineering background and would like to benefit from a deeper understanding of the architecture and APIs

Prerequisites

  • Knowledge of SQL is helpful
  • Experience with either Python or Scala is required
  • Some familiarity with Apache Spark or other big-data processing frameworks is helpful but not required

Course Details

Software & Hardware Requirements

  • Web Browser: Chrome
  • An Internet Connection
  • GoToTraining (for remote classes only)
  • A computer, laptop, or tablet with a keyboard

Course Outline

  • About Databricks, Spark
  • A high-level overview of the Spark Architecture
  • Spark Entry Points, Simple Data Injestion & overview of API docs
  • Hands-on practice with different data injestion options
  • Hands-on practice with the DataFrames APIs
  • Introduction to Spark's execution model
  • Hands-on practice with performance optimization
  • Introduction to Structured Streaming
  • Introduction to Machine Learning Pipelines

How do I enroll?

A comprehensive listing of ExitCertified courses can be found here. You can register directly for the required course/location when you select "register". If you have any questions or prefer to speak with an ExitCertified education consultant directly, please submit your query here. A representative will contact you shortly.

How do I pay for a class?

You can pay at the time of registration using credit card (Mastercard/Visa/American Express) cheque or PO.

What if I have training credits?

ExitCertified honors all savings programs from the partners we work with. ExitCertified also offers training credits across multiple partners through our FLEX Account.

When does class start/end?

Classes begin promptly at 9:00 am, and typically end at 5:00 pm.

Lunchtime?

Lunch is normally an hour long and begins at noon. Coffee, tea, hot chocolate and juice are available all day in the kitchen. Fruit, muffins and bagels are served each morning. There are numerous restaurants near each of our centers, and some popular ones are indicated on the Area Map in the Student Welcome Handbooks - these can be picked up in the lobby or requested from one of our ExitCertified staff.

How can someone reach me during class?

If someone should need to contact you while you are in class, please have them call the center telephone number and leave a message with the receptionist.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

Well organized course, supporting materials, and instructor. Good intro to AWS for various levels of participants - managers to developers.

Course content and labs were helpful. The instructor was knowledgeable and easy to understand.

Joel Rosenblum is an excellent instructor. Would gladly take more courses he taught. Extremely knowledgeable with the content and real world experience with the tech. Gotcha issues provided by him to look for in tech were very useful. Very friendly and available during and after class time.

Tom Peters was very helpful initially and once voucher was confirmed valid. Tom P required Kyle Banas to approve my voucher that was earned for selling certain quantity of M365 licenses. Request confirmation voucher was valid less than 3 months from notification. Tom P CC'd Kyle B on email to confirm. Kyle to several weeks and MANY email requests without ever responding to request. Tom P eventually approved the voucher.

Content was excellent. Labs and lab environment was excellent.

Bill has been outstanding instructor. He has right balance of knowledge and skills to navigate this entire process in a timely manner. Thanks a lot Bill.

Lab infraestructure is very suitable and works pretty good help to understand better the concepts exposed during the training.

4 options available

undo
  • GTR Aug 26, 2020 Aug 28, 2020 (3 days)
    Location
    iMVP
    Language
    English
    Time
    9:00AM 5:00PM EDT
    Enroll
    Enroll
  • Sep 30, 2020 Oct 2, 2020 (3 days)
    Location
    iMVP
    Language
    English
    Time
    9:00AM 5:00PM EDT
    Enroll
    Enroll
  • Nov 4, 2020 Nov 6, 2020 (3 days)
    Location
    iMVP
    Language
    English
    Time
    9:00AM 5:00PM PST
    Enroll
    Enroll
  • Dec 9, 2020 Dec 11, 2020 (3 days)
    Location
    iMVP
    Language
    English
    Time
    9:00AM 5:00PM EST
    Enroll
    Enroll
Contact Us 1-800-803-3948
Contact Us Live Chat
FAQ Get immediate answers to our most frequently asked qestions. View FAQs arrow_forward