The future of IBM Maximo: Work Centers and Inspections Can Transform Your Business


Hadoop Data Management with Hive, Pig, and SAS(R)

  • Tuition USD $1,950 GSA  $1,768.26
  • Reviews star_rate star_rate star_rate star_rate star_half 486 Ratings
  • Course Code DIHPS
  • Duration 0 hours
  • Available Formats Classroom, Virtual
In this course, you use processing methods to prepare structured and unstructured big data for analysis. You learn to organize this data into structured tabular form using Apache Hive and Apache Pig. You also learn SAS software technology and techniques that integrate with Hive and Pig and how to leverage these open source capabilities by programming with Base SAS and SAS/ACCESS Interface to Hadoop, and with SAS Data Integration Studio.

The Extended Learning page for this course includes the option to purchase Virtual Lab time to practice.

The e-learning format of this course also includes the option to purchase Virtual Lab time to practice.

Skills Gained

  • Move data into the Hadoop ecosystem.
  • Use Hive to design a data warehouse in Hadoop.
  • Perform data analysis using Hive Query Language.
  • Join data sources.
  • Perform extract, load, and transformation.
  • Organize data in Hadoop by usage.
  • Perform analysis on unstructured data using Apache Pig.
  • Join massive data sets using Pig.
  • Use user-defined functions (UDFs).
  • Analyze big data in Hadoop using Hive and Pig.
  • Use SAS programming to submit Hive and Pig programs that execute in Hadoop and store results in Hadoop or return results to SAS.
  • Use SAS programming to move data between the SAS server and the Hadoop Distributed File System (HDFS).
  • Construct SAS Data Integration Studio jobs that integrate with Hive and Pig processes and the HDFS.

Who Can Benefit

  • Data scientists and programmers, database administrators, applications developers, and ETL developers who are looking for an in-depth technical overview of data management and extraction for big data and the Hadoop ecosystem


  • A basic understanding of and experience with UNIX and SQL is preferred. For advanced topics such as user-defined functions, prior programming experience is necessary.

Course Details

The Apache Hadoop Project

  • Overview of the big data ecosystem.
  • Hadoop essentials.

Hive and HiveQL

  • Apache Hive overview.
  • Data definition language.
  • Data manipulation language.

Pig and Pig Latin

  • Apache Pig overview.
  • Apache Pig programming.
  • Advanced Apache Pig programming.
  • Pig programming recommendations.

SAS and Hadoop

  • SAS technology for Hadoop overview.
  • Programming with Base SAS and SAS/ACCESS.
  • SAS Data Integration Studio.
  • DS2 and the code accelerator for Hadoop.
  • SAS In-Memory Analytics.

How do I enroll?

A comprehensive listing of ExitCertified courses can be found here. You can register directly for the required course/location when you select "register". If you have any questions or prefer to speak with an ExitCertified education consultant directly, please submit your query here. A representative will contact you shortly.

How do I pay for a class?

You can pay at the time of registration using credit card (Mastercard/Visa/American Express) cheque or PO.

What if I have training credits?

ExitCertified honors all savings programs from the partners we work with. ExitCertified also offers training credits across multiple partners through our FLEX Account.

When does class start/end?

Classes begin promptly at 9:00 am, and typically end at 5:00 pm.


Lunch is normally an hour long and begins at noon. Coffee, tea, hot chocolate and juice are available all day in the kitchen. Fruit, muffins and bagels are served each morning. There are numerous restaurants near each of our centers, and some popular ones are indicated on the Area Map in the Student Welcome Handbooks - these can be picked up in the lobby or requested from one of our ExitCertified staff.

How can someone reach me during class?

If someone should need to contact you while you are in class, please have them call the center telephone number and leave a message with the receptionist.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

Excellent class overall! The Instructor and the course material were the best so far, and I have taken a few AWS classes. I highly recommend it- Architecting on AWS.

Great personnel and facility. I just should have been told i was the only person physically in thr class, and that there was an option to attend the class remotely.

I just completed a three day course in BigFix usage/administration led by Gary Lehnus. Gary was an excellent instructor. ExitCertified made the online class process painless. I can't speak for their other courses as I have not taken them, but the BigFix course is well worth it.

Very good overview and Intro to AWS. Just the right amount of information.

I really enjoyed the course. It was well designed given that it was online. I would recommend that the higher level courses be taken on-site, but this course was very thorough for those looking to get started with an AWS Certification.

2 options available

  • Oct 6, 2020 Oct 8, 2020 (3 days)
    Cary, NC
    9:00 AM 5:00 PM EST
  • Dec 14, 2020 Dec 18, 2020 (5 days)
    1:00 PM 4:30 PM EST
Contact Us 1-800-803-3948
Contact Us Live Chat
FAQ Get immediate answers to our most frequently asked qestions. View FAQs arrow_forward