Intermediate Solr, December 6-7, 2017

Days: December 6-7, 2017

Time: 9:00 AM to 1:00 PM EDT each day

Cost: $800 per participant

Overview

Comprehensive 2-day sessions (two 4-hour sessions), this Solr online class is taught by Rafal Kuć a seasoned Solr instructor and consultant from Sematext, the author of several Solr and Elasticsearch books and frequent conference speaker. The training is held online from 9:00 am – 1:00 pm (ET). After taking this course you will:

  • understand the differences and use-cases for Solr and SolrCloud
  • create Spatial Search and Function Queries
  • perform Document Grouping
  • configure and tune Query Spellchecking and Suggesters.
During the second half of the source you will learn how to tune and scale Solr and SolrCloud, as well as various operational elements, like monitoring, backups, etc. Each section is followed by a lab with multiple hands-on exercises. See course outline below for more.

Who Should Attend

The course is designed for technical attendees with basic Solr experience. A person should be able to index data to Solr, run queries, work with Solr analysis and use faceting. Experience with Linux systems is not a must, but a basic familiarity with running shell commands (e.g., using curl command) will make the course more enjoyable. If you do not have prior Solr experience or have just started working with Solr please consider attending Core Solr class.

Prerequisites

Sematext’s Core Solr or pre-existing knowledge of Solr concepts covered in Core Solr

Why Attend

The virtual Solr training gives you and your team the skills needed to successfully use Solr capabilities by improving your workflow and increasing efficiency.  Further benefits:
  • a customized learning experience
  • same high-quality instruction as our public or private Elasticsearch classes
  • more affordable than public training
  • more flexible – no need to travel

Things to Remember

For the online training all participants must use their own computer with OSX, Linux, or Windows, with the latest version of Java installed.  Participants should be comfortable using a terminal / command line. Sematext provides:
  • a digital copy of the training material
  • a VM with all configs, scripts, exercises, etc.

Course Outline

Modules

  1. Solr Architecture
    • Solr master – slave architecture
    • SolrCloud architecture
    • Solr master – slave vs SolrCloud
  2. Spatial Search
    • Indexing spatial data
    • Spatial filters
    • Distance function queries
    • Bounding box field
    • Heatmap faceting
    • Lab
      • Configuring spatial field types
      • Indexing spatial data
      • Searching for documents within distance from a point
      • Sorting documents on the basis of a distance
      • Boosting documents on the basis of distance
  3. Documents Grouping
    • Grouping documents by field value
    • Grouping documents by function value
    • Grouping documents by query
    • Paging in grouped results
    • Controlling number of groups and documents count
    • Sorting inside groups
    • Documents grouping and faceting
    • Using collapse query parser
    • Using expand component
    • Lab
      • Displaying top matching document per group
      • Sorting grouping results
      • Controlling number of displayed documents and groups
      • Sorting inside groups
      • Using queries for creating document groups
      • Displaying number of calculated groups
      • Using faceting with grouping
      • Using collapse parser to execute efficient grouping
  4. Relations
    • Flat documents approach
    • Nested documents
    • Dynamic query time joins
    • Lab
      • Working with nested documents
      • Working with query time joins
  5. Function Queries
    • Using function queries
    • Math function queries
    • Term function queries
    • Example use cases
    • Boosting by using functions
    • Sorting by function
    • External file field type
    • Using external file field type for boosting
    • Lab
      • Using efficient range filtering
      • Sorting on the basis of function value
      • Including function value in returned documents
      • Boosting using function value
  6. Spellchecking
    • Spellchecker with its own index
    • File based spellchecker
    • Index based spellchecker
    • Building spellchecker
    • Including spell checking results with queries
    • Querying spellchecker independently
    • Maximum number of suggestions
    • Collation
    • Controlling collation
    • Accuracy
    • Extended results
    • Lab
      • Working with Spellchecker configuration
      • Running queries with Spellchecker
      • Using various Spellchecker implementations
  7. Suggesters
    • What are suggesters
    • Suggester types
    • Configuring suggesters
    • Using different dictionary factories
    • Lab
      • Creating suggester configuration for a field
      • Building suggester dictionary automatically
      • Creating separate suggester configuration
      • Using created suggester
  8. API v2
    • What is API v2
    • Nested documents
  9. Configuring Solr Internals
    • General solrconfig.xml section
    • Replication
    • Update request processors
    • Language detection
    • Configuring logging
    • Slow query log
    • Lab
      • Preparing master – slave replication
      • Language detection during document indexing
      • Configuring slow logging
  10. Tuning Solr
    • Indexing threads
    • Indexing buffer size
    • Auto commit tuning
    • Caches
    • Replication throttling
    • Warming up
    • Lab
      • Configuring indexing buffer
      • Configuring auto commits
      • Configuring warming queries
  11. Scaling Solr & SolrCloud
    • Solr master & slaves configuration
    • ZooKeeper role explained
    • Working with ZooKeeper
    • Sharding
    • Using Collections API
    • SolrCloud Replica Types
    • Caches in SolrCloud
    • Aliases
    • Lab
      • Creating collections
      • Creating aliases
      • Setting up caches for rapidly changing data
      • Setting up caches for high querying scenarios
  12. Operations
    • Running Solr as a service on Linux and Windows systems
    • Backing up Solr master ­ slave
    • Backing up SolrCloud
    • Monitoring using JMX
    • Monitoring using SPM
    • Key Metrics to Monitor
    • Lab
      • Install Solr using install scripts
      • Create a backup of master – slave environment