Core Solr, September 25-26, 2017

Days: September 25-26, 2017

Time: 9:00 AM to 1:00 PM EDT each day

Cost: $720 (early bird until July 25) $800 after / participant

Overview

Comprehensive 2-day sessions (two 4-hour sessions), this Solr online class is taught by  Rafal Kuć a seasoned Solr instructor and consultant from Sematext, the author of several Solr and Elasticsearch books and frequent conference speaker. The training is held online from 9:00 am – 1:00 pm (ET). After taking this course you will be able to:
  • configure and deploy Solr
  • run a wide range of queries including queries with facets and aggregations
  • index documents with Solr.
You will learn about inverted index, about Solr schema, analysis, tokens, token filters, highlighting, query parsing, and so on – see course outline below for more. Each section is followed by a lab with multiple hands-on exercises.

Who Should Attend

The course is designed for technical attendees of any knowledge level and is aimed at those who need to configure, tune and manage Solr and have only basic Solr knowledge. No prior Solr experience is required. Experience with Linux systems is not a must, but basic familiarity with running shell commands (e.g., using curl command) will make the course more enjoyable.

Prerequisites

None, just desire to learn!

Why Attend

The virtual Solr training gives you and your team the skills needed to successfully use Solr capabilities by improving your workflow and increasing efficiency.  Further benefits:
  • a customized learning experience
  • same high-quality instruction as our public or private Elasticsearch classes
  • more affordable than public training
  • more flexible – no need to travel

Things to Remember

For the online training all participants must use their own computer with OSX, Linux, or Windows, with the latest version of Java installed.  Participants should be comfortable using a terminal / command line. Sematext provides:
  • a digital copy of the training material
  • a VM with all configs, scripts, exercises, etc.

Course Outline

Modules

  1. Getting Started with Solr
    • What is Apache Solr
    • General principles
    • Architecture types
  2. Introduction to Solr
    • Starting Solr with schema-less configuration
    • Inverted index
    • Relevancy basics
    • Indexing documents
    • Retrieving documents by identifier
    • Searching for documents
    • Deleting documents
    • Lab
      • Using start scripts
      • Working with configuration
      • CRUD operations
  3. Indexing Data
    • Data structure
    • Index structure configuration
    • Defining custom field types
    • String vs Text based types
    • Basic field usage examples
    • Tokenizers
    • Char filters
    • Filters
    • Language oriented data
    • Dynamic fields
    • Copy fields
    • Running Solr with our own configuration
    • XML data format explained
    • JSON data format explained
    • CSV data format explained
    • Batch indexing
    • Doc values
    • Additional field properties
    • Nested documents support
    • Lab
      • Creating fields and types structure
      • Using copy fields
      • Using Solr language analysis capabilities
      • Indexing data in various format
  4. Searching
    • Simple URI search
    • Paging
    • Sorting
    • Filters
    • Choosing display fields
    • Pseudo fields
    • Debug query
    • Lucene query language
    • Standard query parser
    • Dismax query parser
    • Extended dismax query parser
    • XML query parser
    • Examples of other parsers
    • Timing out searches
    • Using cursor for deep paging
    • Nested documents support
    • Dealing with relevancy
    • Lab
      • Paging
      • Sorting
      • Term searching
      • Using various query parsers
      • Using cursor
  5. Data Analysis
    • Introduction to faceting
    • Basic use cases
    • Field faceting
    • Field prefix faceting
    • Sorting faceting results
    • Limiting faceting
    • Faceting execution control
    • Range faceting
    • Query faceting
    • Hierarchical faceting
    • Interval faceting
    • Lab
      • Building tag cloud using field faceting
      • Using prefixes to build simple autocomplete feature
      • Sorting faceting results
      • Working with numerical data and faceting
      • Using hierarchical faceting to get more insight into the data
      • Interval faceting
  6. JSON Facets
    • Introduction to JSON request API
    • Facet functions
    • Nested JSON facets
    • Execution type
    • Lab
      • Searching using JSON request API
      • Finding top tags
      • Retrieving statistics using range faceting
      • Using terms JSON facets to retrieve term counts
      • Using functions with JSON facets
      • Nesting JSON facets
  7. Highlighting and More Like This
    • Introduction to highlighting
    • Highlighting query hits
    • Specifying fields to highlight
    • Choosing highlighting tags
    • Using FastVectorHighlighter
    • Using PostingsHighlighter
    • Finding similar documents
    • Prerequisites for More Like This functionality
    • Configuring More Like This functionality
    • Lab
      1. Highlighting field matches
      2. Using own tags for matching highlighted fragments
      3. Using various parsers with highlighting
      4. Using different query for highlighting and matching
      5. Finding documents similar to a given one
      6. Using term frequency and length to find similar documents