Summer-Ready Savings: Find the Training Course You Need at a Price You'll Love

closeClose

Cloudera Data Analyst Training

  • Tuition USD $3,195 GSA  $2,736.27
  • Reviews star_rate star_rate star_rate star_rate star_half 329 Ratings
  • Course Code DATA-ANALYST
  • Duration 4 days
  • Available Formats Classroom, Virtual

Apache Hive makes transformation and analysis of complex, multi-structured data scalable in Hadoop. Apache Impala enables real-time interactive analysis of the data stored in Hadoop using a native SQL environment. Together, they make multi-structured data accessible to analysts, database administrators, and others without Java programming expertise.

Prerequisites

This course is designed for data analysts, business intelligence specialists, developers, system architects, and database administrators. Some knowledge of SQL is assumed, as is basic Linux command-line familiarity. Prior knowledge of Apache Hadoop is not required.

Course Details

Introduction

Apache Hadoop Fundamentals

  • The Motivation for Hadoop
  • Hadoop Overview
  • Data Storage: HDFS
  • Distributed Data Processing: YARN, MapReduce, and Spark
  • Data Processing and Analysis: Hive and Impala
  • Database Integration: Sqoop
  • Other Hadoop Data Tools
  • Exercise Scenario Explanation

Introduction to Apache Hive and Impala

  • What Is Hive?
  • What Is Impala?
  • Why Use Hive and Impala?
  • Schema and Data Storage
  • Comparing Hive and Impala to Traditional Databases
  • Use Cases

Querying with Apache Hive and Impala

  • Databases and Tables
  • Basic Hive and Impala Query Language Syntax
  • Data Types
  • Using Hue to Execute Queries
  • Using Beeline (Hive's Shell)
  • Using the Impala Shell

Common Operators and Built-In Functions

  • Operators
  • Scalar Functions
  • Aggregate Functions

Data Management

  • Data Storage
  • Creating Databases and Tables
  • Loading Data
  • Altering Databases and Tables
  • Simplifying Queries with Views
  • Storing Query Results

Data Storage and Performance

  • Partitioning Tables
  • Loading Data into Partitioned Tables
  • When to Use Partitioning
  • Choosing a File Format
  • Using Avro and Parquet File Formats

Working with Multiple Datasets

  • UNION and Joins
  • Handling NULL Values in Joins
  • Advanced Joins

Analytic Functions and Windowing

  • Using Analytic Functions
  • Other Analytic Functions
  • Sliding Windows

Complex Data

  • Complex Data with Hive
  • Complex Data with Impala

Analyzing Text

  • Using Regular Expressions with Hive and Impala
  • Processing Text Data with SerDes in Hive
  • Sentiment Analysis and n-grams in Hive

Apache Hive Optimization

  • Understanding Query Performance
  • Cost-Based Optimization and Statistics
  • Bucketing
  • ORC File Optimizations

Apache Impala Optimization

  • How Impala Executes Queries
  • Improving Impala Performance

Extending Apache Hive and Impala

  • Custom SerDes and File Formats in Hive
  • Data Transformation with Custom Scripts in Hive
  • User-Defined Functions
  • Parameterized Queries

Choosing the Best Tool for the Job

  • Comparing Hive, Impala, and Relational Databases
  • Which to Choose?

Conclusion

Apache Kudu

  • What Is Kudu?
  • Kudu Tables
  • Using Impala with Kudu

How do I enroll?

A comprehensive listing of ExitCertified courses can be found here. You can register directly for the required course/location when you select "register". If you have any questions or prefer to speak with an ExitCertified education consultant directly, please submit your query here. A representative will contact you shortly.

How do I pay for a class?

You can pay at the time of registration using credit card (Mastercard/Visa/American Express) cheque or PO.

What if I have training credits?

ExitCertified honors all savings programs from the partners we work with. ExitCertified also offers training credits across multiple partners through our FLEX Account.

When does class start/end?

Classes begin promptly at 9:00 am, and typically end at 5:00 pm.

Lunchtime?

Lunch is normally an hour long and begins at noon. Coffee, tea, hot chocolate and juice are available all day in the kitchen. Fruit, muffins and bagels are served each morning. There are numerous restaurants near each of our centers, and some popular ones are indicated on the Area Map in the Student Welcome Handbooks - these can be picked up in the lobby or requested from one of our ExitCertified staff.

How can someone reach me during class?

If someone should need to contact you while you are in class, please have them call the center telephone number and leave a message with the receptionist.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

Warm greeting and excellent help. The eating area is always clean and well stocked

The exit certified aws course provided a good introduction to the tools available on aws.

I just completed a three day course in BigFix usage/administration led by Gary Lehnus. Gary was an excellent instructor. ExitCertified made the online class process painless. I can't speak for their other courses as I have not taken them, but the BigFix course is well worth it.

Tech Data provided me with all the resources required to familiarize myself with the TRIRIGA Lease module. The facilitator was very knowledgeable and provided comprehensive exercises and support to aid in my understanding of the application.

Very clean, great cafeteria and well sorted, very kind staff. The bathrooms have to be expanded as they might get crowded sometimes

7 options found

undo
  • GTR Jul 21, 2020 Jul 24, 2020 (4 days)
    Location
    iMVP
    Language
    English
    Time
    9:00AM 5:00PM EDT
    Enroll
    Enroll
  • Aug 25, 2020 Aug 28, 2020 (4 days)
    Location
    iMVP
    Language
    English
    Time
    9:00AM 5:00PM PDT
    Enroll
    Enroll
  • Sep 8, 2020 Sep 11, 2020 (4 days)
    Location
    Virtual
    Language
    English
    Time
    10:00 am 6:00 pm EDT
    Enroll
    Enroll
  • Sep 29, 2020 Oct 2, 2020 (4 days)
    Location
    iMVP
    Language
    English
    Time
    9:00AM 5:00PM EDT
    Enroll
    Enroll
  • Oct 20, 2020 Oct 23, 2020 (4 days)
    Location
    Virtual
    Language
    English
    Time
    10:00 am 6:00 pm EDT
    Enroll
    Enroll
  • Dec 1, 2020 Dec 4, 2020 (4 days)
    Location
    Virtual
    Language
    English
    Time
    10:00 am 6:00 pm EST
    Enroll
    Enroll
  • Jan 26, 2021 Jan 29, 2021 (4 days)
    Location
    Virtual
    Language
    English
    Time
    10:00 am 6:00 pm EST
    Enroll
    Enroll
Contact Us 1-800-803-3948
Contact Us Live Chat
FAQ Get immediate answers to our most frequently asked qestions. View FAQs arrow_forward