3 arrows

Popular Mirantis On Demand Training Courses Now 50% Off

closeClose

Advanced Methods in Data Science and Big Data Analytics

  • Tuition USD $5,000
  • Reviews star_rate star_rate star_rate star_rate star_half 2915 Ratings
  • Course Code 4192
  • Duration 5 days
  • Available Formats Classroom

This course builds on skills developed in the Data Science and Big Data Analytics  course. The main focus areas cover Hadoop (including Pig, Hive, and HBase), natural language processing, social network analysis, simulation, random forests, multinomial logistic regression, and data visualization. With a technology-neutral approach, this course utilizes several open-source tools to address big data challenges.

Skills Gained

  • MapReduce functionality
  • NoSQL databases and Hadoop Ecosystem tools for analyzing large-scale, unstructured data sets
  • Natural language processing, social network analysis, and data visualization concepts
  • Use advanced quantitative methods, and apply one of them in a Hadoop environment
  • Apply advanced techniques to real-world datasets in a final lab

Who Can Benefit

  • Aspiring data scientists
  • Data analysts that have completed the associate level Data Science and Big Data Analytics course
  • Computer scientists wanting to learn MapReduce and methods for analyzing unstructured data such as text.

Course Details

1. MapReduce and Hadoop

  • The MapReduce Framework
  • Apache Hadoop
  • Hadoop Distributed File System
  • YARN

2. Hadoop Ecosystem and NoSQL

  • Hadoop Ecosystem
  • Pig
  • Hive
  • NoSQL--Not only SQL
  • HBase
  • Spark

3. Natural Language Processing

  • Introduction to NLP
  • Text Preprocessing
  • TFIDF
  • Beyond Bag of Words
  • Language Modeling
  • POS Tagging and HMM
  • Sentiment Analysis and Topic Modeling

4. Social Network Analysis

  • Introduction to SNA and Graph Theory
  • Most Important Nodes
  • Communities and Small World
  • Network Problems and SNA Tools

5. Data Science Theory and Methods

  • Simulation
  • Random Forests
  • Multinomial Logistic Regression

6. Data Visualization

In addition to lecture and demonstrations, this course includes labs designed to give you practical experience.

When does class start/end?

Classes begin promptly at 9:00 am, and typically end at 5:00 pm.

Does the course schedule include a Lunchbreak?

Lunch is normally an hour long and begins at noon. Coffee, tea, hot chocolate and juice are available all day in the kitchen. Fruit, muffins and bagels are served each morning. There are numerous restaurants near each of our centers, and some popular ones are indicated on the Area Map in the Student Welcome Handbooks - these can be picked up in the lobby or requested from one of our ExitCertified staff.

How can someone reach me during class?

If someone should need to contact you while you are in class, please have them call the center telephone number and leave a message with the receptionist.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

What does GTR stand for?

GTR stands for Guaranteed to Run; if you see a course with this status, it means this event is confirmed to run. View our GTR page to see our full list of Guaranteed to Run courses.

Does ExitCertified deliver group training?

Yes, we provide training for groups, individuals and private on sites. View our group training page for more information.

Does ExitCertified deliver group training?

Yes, we provide training for groups, individuals, and private on sites. View our group training page for more information.

This was an excellent class to get me up to speed quickly on AWS solution Architect concepts.

The training was conducted smoothly and with the required hands-on exercises

The course was good a good refresher and also updated me on some newer AWS products. I would have prefered that we were supplied with training materials that we could keep to review while studying for the exam.

It was well organized and planned. That gave me a memorable learning experience.

Had a good time taking course with ExitCertified. The Data Science course with Python was really laid out well and it was very easy to understand and navigate. The setup using Anaconda was really good and it was easy to follow along and do the practise exercises.

0 options available

There are currently no scheduled dates for this course. If you are interested in this course, request a course date with the links above. We can also contact you when the course is scheduled in your area.

Contact Us 1-800-803-3948
Contact Us
FAQ Get immediate answers to our most frequently asked qestions. View FAQs arrow_forward