Flash Sale: Save 15% on IT Training

closeClose

Cloudera Search Training

  • Tuition USD $1,815 GSA  $1,554.41
  • Reviews star_rate star_rate star_rate star_rate star_half 1451 Ratings
  • Course Code CLOUD-SEARCH
  • Available Formats Self Paced

This OnDemand offering provides you with a 180-day subscription that begins on the date of purchase.

Prerequisites

This course is intended for developers and data engineers with at least basic familiarity with Hadoop and experience programming in a general-purpose language such as Java, C, C++, Perl, or Python. Participants should be comfortable with the Linux command line and should be able to perform basic tasks such as creating and removing directories, viewing and changing file permissions, executing scripts, and examining file output. No prior experience with Apache Solr or Cloudera Search is required, nor is any experience with HBase or SQL.

Course Details

This course includes video lectures, assessments, and hands-on exercise access. Participants will navigate the Hadoop ecosystem, learning topics such as:

  • Performing batch indexing of data stored in HDFS and HBase
  • Indexing streaming data in near-real-time with Flume
  • How to index content in multiple languages and file formats
  • Processing and transforming incoming data with Morphlines
  • Creating a user interface for an index using Hue
  • Integrating Cloudera Search with external applications
  • Improving the experience using faceting, highlighting, and spelling correction

Subscription Details

This OnDemand offering provides you with a 180-day subscription that begins on the date of purchase. While the subscription is active, you will have unlimited access to the course training materials which includes recorded course lectures and demonstrations, assessment components, and hands-on exercise instructions. You will also receive 15 runtime hours of access to the online hands-on exercise environment accessible though web browser. You can start the exercise environment when you are ready to use it. You can stop or pause it when you are done for the time being, then return anytime to continue where you left off. The exercise environment remains accessible until you have used the runtime hours or the subscription period ends, whichever occurs first.

Overview of Cloudera Search

  • What is Cloudera Search?
  • Helpful Features
  • Use Cases
  • Basic Architecture

Performing Basic Queries

  • Executing a Query in the Admin UI
  • Basic Syntax
  • Techniques for Approximate Matching
  • Controlling Output

Writing More Powerful Queries

  • Relevancy and Filters
  • Query Parsers
  • Functions
  • Geospatial Search
  • Faceting

Preparing to Index Documents

  • Overview of the Indexing Process
  • Understanding Morphlines
  • Generating Configuration Files
  • Schema Design
  • Collection Management

Batch Indexing HDFS Data with MapReduce

  • Overview of the HDFS Batch Indexing Process
  • Using the MapReduce Indexing Tool
  • Testing and Troubleshooting

Near-Real-Time Indexing with Flume

  • Overview of the Near-Real-Time Indexing Process
  • Introduction to Apache Flume
  • How to Perform Near-Real-Time Indexing with Flume
  • Testing and Troubleshooting

Indexing HBase Data with Lily

  • What is Apache HBase?
  • Batch Indexing for HBase
  • Indexing HBase Tables in Near-Real-Time

Indexing Data in Other Languages and Formats

  • Field Types and Analyzer Chains
  • Word Stemming, Character Mapping, and Language Support
  • Schema and Analysis Support in the Admin UI
  • Metadata and Content Extraction with Apache Tika
  • Indexing Binary File Types with SolrCell

Improving Search Quality and Performance

  • Delivering Relevant Results
  • Helping Users Find Information
  • Query Performance and Troubleshooting

Building User Interfaces for Search

  • Search UI Overview
  • Building a User Interface with Hue
  • Integrating Search into Custom Applications

Considerations for Deployment

  • Planning for Deployment
  • Determining Hardware Needs
  • Security Overview
  • Collection Aliasing

When does class start/end?

Classes begin promptly at 9:00 am, and typically end at 5:00 pm.

Does the course schedule include a Lunchbreak?

Lunch is normally an hour long and begins at noon. Coffee, tea, hot chocolate and juice are available all day in the kitchen. Fruit, muffins and bagels are served each morning. There are numerous restaurants near each of our centers, and some popular ones are indicated on the Area Map in the Student Welcome Handbooks - these can be picked up in the lobby or requested from one of our ExitCertified staff.

How can someone reach me during class?

If someone should need to contact you while you are in class, please have them call the center telephone number and leave a message with the receptionist.

What languages are used to deliver training?

Most courses are conducted in English, unless otherwise specified. Some courses will have the word "FRENCH" marked in red beside the scheduled date(s) indicating the language of instruction.

What does GTR stand for?

GTR stands for Guaranteed to Run; if you see a course with this status, it means this event is confirmed to run. View our GTR page to see our full list of Guaranteed to Run courses.

Does ExitCertified deliver group training?

Yes, we provide training for groups, individuals and private on sites. View our group training page for more information.

Does ExitCertified deliver group training?

Yes, we provide training for groups, individuals, and private on sites. View our group training page for more information.

The course provided the essentials in preparing for AWS certification. Material was well organized and utilized during topic discussions.

This was an excellent and very informative class. I will recommend it to others. Thank you.

The instructor really took his time and made sure I was able to understand the concepts.

Team is very good to organize things, reply immediately to coordinate and advise.

Better arrangement was made to make sure we receive necessary tools to attend the course days before the date, this saves time to join.

Contact Us 1-800-803-3948
Contact Us Live Chat
FAQ Get immediate answers to our most frequently asked qestions. View FAQs arrow_forward