The future of IBM Maximo: Work Centers and Inspections Can Transform Your Business

closeClose

Cloudera Data Scientist Training

  • Tuition USD $3,195
  • Reviews star_rate star_rate star_rate star_rate star_half 482 Ratings
  • Course Code DATA-SCI-TRAIN
  • Duration 4 days
  • Available Formats Classroom, Virtual

This four-day workshop covers data science and machine learning workflows at scale using Apache Spark 2 and other key components of the Hadoop ecosystem. The workshop emphasizes the use of data science and machine learning methods to address real-world business challenges. Using scenarios and datasets from a fictional technology company, students discover insights to support critical business decisions and develop data products to transform the business. The material is presented through a sequence of brief lectures, interactive demonstrations, extensive hands-on exercises, and discussions. The Apache Spark demonstrations and exercises are conducted in Python (with PySpark) and R (with sparklyr) using the Cloudera Data Science Workbench (CDSW) environment. The workshop is designed for data scientists who currently use Python or R to work with smaller datasets on a single machine and who need to scale up their analyses and machine learning models to large datasets on distributed clusters. Data engineers and developers with some knowledge of data science and machine learning may also find this workshop useful.

Skills Gained

  • Overview of data science and machine learning at scale
  • Overview of the Hadoop ecosystem
  • Working with HDFS data and Hive tables using Hue
  • Introduction to Cloudera Data Science Workbench
  • Overview of Apache Spark 2
  • Reading and writing data
  • Inspecting data quality
  • Cleansing and transforming data
  • Summarizing and grouping data
  • Combining, splitting, and reshaping data
  • Exploring data
  • Configuring, monitoring, and troubleshooting Spark applications
  • Overview of machine learning in Spark MLlib
  • Extracting, transforming, and selecting features
  • Building and evaluating regression models
  • Building and evaluating classification models
  • Building and evaluating clustering models
  • Cross-validating models and tuning hyperparameters
  • Building machine learning pipelines
  • Deploying machine learning models
  • Spark, Spark SQL, and Spark MLlib
  • PySpark and sparklyr
  • Cloudera Data Science Workbench (CDSW)
  • Hue

Prerequisites

Workshop participants should have a basic understanding of Python or R and some experience exploring and analyzing data and developing statistical or machine learning models. Knowledge of Hadoop or Spark is not required.

How do I enroll?

A comprehensive listing of ExitCertified courses can be found here. You can register directly for the required course/location when you select "register". If you have any questions or prefer to speak with an ExitCertified education consultant directly, please submit your query here. A representative will contact you shortly.

How do I pay for a class?

You can pay at the time of registration using credit card (Mastercard/Visa/American Express) cheque or PO.

What if I have training credits?

ExitCertified honors all savings programs from the partners we work with. ExitCertified also offers training credits across multiple partners through our FLEX Account.

When does class start/end?

Classes begin promptly at 9:00 am, and typically end at 5:00 pm.

Lunchtime?

Lunch is normally an hour long and begins at noon. Coffee, tea, hot chocolate and juice are available all day in the kitchen. Fruit, muffins and bagels are served each morning. There are numerous restaurants near each of our centers, and some popular ones are indicated on the Area Map in the Student Welcome Handbooks - these can be picked up in the lobby or requested from one of our ExitCertified staff.

How can someone reach me during class?

If someone should need to contact you while you are in class, please have them call the center telephone number and leave a message with the receptionist.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

The lady at the front really nice. Everthing was always stocked up: Helpful

Great course led by a knowledgeable instructor. Would definitely recommend this course to any one I know that is looking to get an AWS Cert. The 4 day course was a great pace. So glad we had that extra day compared to the 3 day course. It really helped a lot and was really worth it. Thank you!

The course material and instructor were very good. easy to follow, lab was setup nicely and was able to complete most of the lab material.
The Koretex App is absolute garbage and very cumbersome to use on a tablet/ipad. A PDF file would be 100 times better than that atrocious app.
As a recommendation this class should be 5 days instead of 4 as some chapters had to be rushed.

Company offers excellent training course options, that helps with your career advancement.

Everything went well. Enrollment process was easy and I always felt welcomed throughout the whole experience.

3 options available

undo
  • Sep 15, 2020 Sep 18, 2020 (4 days)
    Location
    Virtual
    Language
    English
    Time
    10:00 am 6:00 pm EDT
    Enroll
    Enroll
  • Oct 27, 2020 Oct 30, 2020 (4 days)
    Location
    Virtual
    Language
    English
    Time
    10:00 am 6:00 pm EDT
    Enroll
    Enroll
  • Dec 8, 2020 Dec 11, 2020 (4 days)
    Location
    Virtual
    Language
    English
    Time
    10:00 am 6:00 pm EST
    Enroll
    Enroll
Contact Us 1-800-803-3948
Contact Us Live Chat
FAQ Get immediate answers to our most frequently asked qestions. View FAQs arrow_forward