Winter Savings - Save on IT Training Using Promo Code FROSTBYTE


Hadoop Data Management with Hive, Pig, and SAS(R)

  • Tuition USD $2,175 GSA  $1,768.26
  • Reviews star_rate star_rate star_rate star_rate star_half 2019 Ratings
  • Course Code DIHPS
  • Available Formats Classroom
In this course, you use processing methods to prepare structured and unstructured big data for analysis. You learn to organize this data into structured tabular form using Apache Hive and Apache Pig. You also learn SAS software technology and techniques that integrate with Hive and Pig and how to leverage these open source capabilities by programming with Base SAS and SAS/ACCESS Interface to Hadoop, and with SAS Data Integration Studio.

The Extended Learning page for this course includes the option to purchase Virtual Lab time to practice.

The e-learning format of this course also includes the option to purchase Virtual Lab time to practice.

Skills Gained

  • Move data into the Hadoop ecosystem.
  • Use Hive to design a data warehouse in Hadoop.
  • Perform data analysis using Hive Query Language.
  • Join data sources.
  • Perform extract, load, and transformation.
  • Organize data in Hadoop by usage.
  • Perform analysis on unstructured data using Apache Pig.
  • Join massive data sets using Pig.
  • Use user-defined functions (UDFs).
  • Analyze big data in Hadoop using Hive and Pig.
  • Use SAS programming to submit Hive and Pig programs that execute in Hadoop and store results in Hadoop or return results to SAS.
  • Use SAS programming to move data between the SAS server and the Hadoop Distributed File System (HDFS).
  • Construct SAS Data Integration Studio jobs that integrate with Hive and Pig processes and the HDFS.

Who Can Benefit

  • Data scientists and programmers, database administrators, applications developers, and ETL developers who are looking for an in-depth technical overview of data management and extraction for big data and the Hadoop ecosystem


  • A basic understanding of and experience with UNIX and SQL is preferred. For advanced topics such as user-defined functions, prior programming experience is necessary.

Course Details

The Apache Hadoop Project

  • Overview of the big data ecosystem.
  • Hadoop essentials.

Hive and HiveQL

  • Apache Hive overview.
  • Data definition language.
  • Data manipulation language.

Pig and Pig Latin

  • Apache Pig overview.
  • Apache Pig programming.
  • Advanced Apache Pig programming.
  • Pig programming recommendations.

SAS and Hadoop

  • SAS technology for Hadoop overview.
  • Programming with Base SAS and SAS/ACCESS.
  • SAS Data Integration Studio.
  • DS2 and the code accelerator for Hadoop.
  • SAS In-Memory Analytics.

When does class start/end?

Classes begin promptly at 9:00 am, and typically end at 5:00 pm.

Does the course schedule include a Lunchbreak?

Lunch is normally an hour long and begins at noon. Coffee, tea, hot chocolate and juice are available all day in the kitchen. Fruit, muffins and bagels are served each morning. There are numerous restaurants near each of our centers, and some popular ones are indicated on the Area Map in the Student Welcome Handbooks - these can be picked up in the lobby or requested from one of our ExitCertified staff.

How can someone reach me during class?

If someone should need to contact you while you are in class, please have them call the center telephone number and leave a message with the receptionist.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

What does GTR stand for?

GTR stands for Guaranteed to Run; if you see a course with this status, it means this event is confirmed to run. View our GTR page to see our full list of Guaranteed to Run courses.

Does ExitCertified deliver group training?

Yes, we provide training for groups, individuals and private on sites. View our group training page for more information.

Does ExitCertified deliver group training?

Yes, we provide training for groups, individuals, and private on sites. View our group training page for more information.

ExitCertified is a great way to gain hands-on experience through their virtual learning environment.

Course was good over all. Got a good idea about hadoop and how to think about available aws resources connecting to that.

Facilities were very clean, coffee was nice to have, snack selection was refreshing

Your team appears to be VERY experienced and can present and discuss ALL topics. I am very pleased with my learning experience with ExitCertified. Thank you!

I would highly recommend this instructor. But would appreciate the Actual Slide material to refresh. Also exam prep doc would be an advantage

0 options available

There are currently no scheduled dates for this course. If you are interested in this course, request a course date with the links above. We can also contact you when the course is scheduled in your area.

Contact Us 1-800-803-3948
Contact Us Live Chat
FAQ Get immediate answers to our most frequently asked qestions. View FAQs arrow_forward