“Have been working in Cloudera for several months now, but haven't completely understood the animals and when to use as well as how things are working in the background. This class answered those questions.”
Cloudera Educational Services' four-day Data Analyst Training course will teach you to apply traditional data analytics and business intelligence skills to big data. This course presents the tools data professionals need to access, manipulate, transform, and analyze complex data sets using SQL and familiar scripting languages.
How the open source ecosystem of big data tools addresses challenges not met by traditional RDBMSs
Using Apache Hive and Apache Impala to provide SQL access to data
Hive and Impala syntax and data formats, including functions and subqueries
Create, modify, and delete tables, views, and databases; load data; and store results of queries
Create and use partitions and different file formats
Combining two or more datasets using JOIN or UNION, as appropriate
What analytic and windowing functions are, and how to use them
Store and query complex or nested data structures
Process and analyze semi-structured and unstructured data
Techniques for optimizing Hive and Impala queries
Extending the capabilities of Hive and Impala using parameters, custom file formats and SerDes, and external scripts
How to determine whether Hive, Impala, an RDBMS, or a mix of these is best for a given task
Who Can Benefit
This course is designed for data analysts, business intelligence specialists, developers, system architects, and database administrators. Some knowledge of SQL is assumed, as is basic Linux command-line familiarity. Prior knowledge of Apache Hadoop is not required.
Apache Hadoop Fundamentals
The Motivation for Hadoop
Data Storage: HDFS
Distributed Data Processing: YARN, MapReduce, and Spark
Data Processing and Analysis: Hive and Impala
Database Integration: Sqoop
Other Hadoop Data Tools
Exercise Scenario Explanation
Introduction to Apache Hive and Impala
What Is Hive?
What Is Impala?
Why Use Hive and Impala?
Schema and Data Storage
Comparing Hive and Impala to Traditional Databases
Classes begin promptly at 9:00 am, and typically end at 5:00 pm.
Does the course schedule include a Lunchbreak?
Lunch is normally an hour long and begins at noon. Coffee, tea, hot chocolate and juice are available all day in the kitchen. Fruit, muffins and bagels are served each morning. There are numerous restaurants near each of our centers, and some popular ones are indicated on the Area Map in the Student Welcome Handbooks - these can be picked up in the lobby or requested from one of our ExitCertified staff.
How can someone reach me during class?
If someone should need to contact you while you are in class, please have them call the center telephone number and leave a message with the receptionist.
What languages are used to deliver training?
Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.
What does GTR stand for?
GTR stands for Guaranteed to Run; if you see a course with this status, it means this event is confirmed to run. View our GTR page to see our full list of Guaranteed to Run courses.
Yes, we provide training for groups, individuals and private on sites. View our group training page for more information.
Does ExitCertified deliver group training?
Yes, we provide training for groups, individuals, and private on sites. View our group training page for more information.
What does vendor-authorized training mean?
As a vendor-authorized training partner, we offer a curriculum that our partners have vetted. We use the same course materials and facilitate the same labs as our vendor-delivered training. These courses are considered the gold standard and, as such, are priced accordingly.
Is the training too basic, or will you go deep into technology?
It depends on your requirements, your role in your company, and your depth of knowledge. The good news about many of our learning paths, you can start from the fundamentals to highly specialized training.
How up-to-date are your courses and support materials?
We continuously work with our vendors to evaluate and refresh course material to reflect the latest training courses and best practices.
Are your instructors seasoned trainers who have deep knowledge of the training topic?
ExitCertified instructors have an average of 27 years of practical IT experience. They have also served as consultants for an average of 15 years. To stay up to date, instructors will at least spend 25 percent of their time learning new emerging technologies and courses.
Do you provide hands-on training and exercises in an actual lab environment?
Lab access is dependent on the vendor and the type of training you sign up for. However, many of our top vendors will provide lab access to students to test and practice. The course description will specify lab access.
Will you customize the training for our company’s specific needs and goals?
We will work with you to identify training needs and areas of growth. We offer a variety of training methods, such as private group training, on-site of your choice, and virtually. We provide courses and certifications that are aligned with your business goals.
How do I get started with certification?
Getting started on a certification pathway depends on your goals and the vendor you choose to get certified in. Many vendors offer entry-level IT certification to advanced IT certification that can boost your career. To get access to certification vouchers and discounts, please contact firstname.lastname@example.org.
Will I get access to content after I complete a course?
You will get access to the PDF of course books and guides, but access to the recording and slides will depend on the vendor and type of training you receive.
Joel was great and handled our questions with great knowledge and professionalism.
Have been working in Cloudera for several months now, but haven't completely understood the animals and when to use as well as how things are working in the background. This class answered those questions.
Eric has been great at explaining and guiding the course. He is very knowledgeable and able to help tie the topics and concepts shared in the class back to our real life use cases.
Instructor was well?prepared and explained topics in much detail. Was also very responsive to questions asked by the audience.
Not every topic applied to my job but many topics applied. Joel spent a lot of time explaining how processing worked with the different tools which was also valuable!