Skip to main content

Advanced Solr Training

Learn Advanced Solr & SolrCloud Tuning and Scaling


Our Solr training classes have a 4.48/5 rating based on 40 reviews

In this Solr course you will learn about query routing, results re-ranking, term vectors, schema API, custom similarity, merge policy, codecs, language identification, data import handler, advanced Solr and SolrCloud tuning and scaling, shard splitting, data migrations, handling a large number of collections, authentication, Solr and HDFS, and so on. This online training covers Solr 6.x, 7.x, and 8.x (8.6 version included). Each section is followed by a lab with multiple hands-on exercises. See the course outline below for more.

Your Apache Solr instructor is an active Solr engineer and consultant with years of experience helping enterprise, medium, and small organizations. Radu has worked with clients from 10+ industries and regularly spoke at the main Solr conference, Activate (previously Lucene/Solr Revolution). Here are some problems Radu Gheorghe solved for Sematext clients recently:

  • Optimized and troubleshot SolrCloud clusters with 50+ nodes, 10s of TB of data, and thousands of queries per second.
  • Improved search relevancy to provide on-point results in various business use cases from e-commerce to document and people search.
  • Improved average search latency by more than 10x.
  • Developed, troubleshot, and maintained multiple Solr plugins, from query parsing to access control.
  • Improved 99th percentile GC pause times by more than 10x.

What’s Included

  • 8 hours online training
  • A digital copy of the training material
  • Docker Compose files, configs, scripts, etc.
  • Certificate of Completion

Next Class: Dec 12-13, 2022 See Upcoming Classes

$800.00 Register Now

On-site training available upon request

Looking for an extended knowledge-based Solr training covering from beginner to an advanced level? You’ve come to the right place.

Request Now

Why attend?

  • Small, interactive, instructor-led classes
  • Lots of hands-on exercises
  • Customized learning experience
  • More flexible – no need to travel
  • Get our Solr certification – Certificate of Completion included

Who should attend?

This Solr course is designed for technical attendees experienced with Solr and looking to extend their Solr knowledge. A person should be able to index data to Solr, run queries, work with Solr analysis, use faceting, grouping, know basic Solr configuration and tuning principles. Experience with Linux systems is not a must, but a basic familiarity with running shell commands (e.g., using curl command) will make the course more enjoyable. If you do not have prior Solr experience and you would like to take advantage of Solr advance training, please consider attending Core Solr and Intermediate Solr classes.

Prerequisites: Sematext’s Intermediate Solr or pre-existing knowledge of Solr concepts covered in Intermediate and Core Solr.

What attendees say

For a non-native English speaker student, this course was a crystal clear explanation of Solr and all the sweetest juice you can extract from your text searches. Even if you think you know enough Solr, I wholeheartedly recommend this course for you.

Nestor Arturo Fernandez Ricaurte Senior Developer – Legis

Thank you for a very informative training, such a wealth of information that I truly enjoyed learning about.

Vickie Jean Charles Sr. System Engineer – Xactly Corporation

If you are serious about getting Solr right, this is the course for you. Simply put, it’s the best Solr instruction for those who want to master this domain and use Solr in the real world

Architect at Large Cloud Company

Upcoming Classes

Pick from the Solr Online Course matching your exact needs. Delivery method: Live Online. Time: 09:00 AM – 01:00 PM ET (2 sessions).

Be the first to hear about upcoming classes by signing up to our mailing list

Dec 12-13, 2022Advanced Solr$800 / personSee Course Outline Register Now

Course Outline

Solr Architecture
  • Solr master-slave architecture
  • SolrCloud architecture
  • Routing
Configuring Solr Internals
  • Lucene directory configuration
  • Schema factory settings
  • Schema API
  • Managed resources
  • Codecs
  • Merge policy
  • Merge scheduler
  • Transaction log configuration
  • Config API
  • Lab
    • Configuring Solr to use managed schema
    • Creating new handler using API
    • Configuring merge policy for faster indexing
    • Configuring merge policy for less segments
    • Using Schema API
Data Import Handler
  • Configuring data import handler
  • Using data import handler
  • Entity processors
  • Transformers
  • Lab
    • Importing data from SQL database
    • Partial data import from SQL database
    • Importing data from XML files using
Streaming Aggregations
  • Streaming expressions basics
  • Stream sources
  • Stream decorators
  • Scheduling streams
  • Streaming statistical language
  • SQL over MapReduce in SolrCloud
  • Export request handler
  • Lab
    • Searching using streaming aggregations
    • Merging two results streams
    • Retrieve unique documents based on a given field
    • Using scheduling streams
Expert Solr Tuning
  • Memory considerations
  • Auto commit tuning
  • Caches
  • Replication throttling
  • Lab
    • Configuring auto commits
    • Throttle replication
Expert SolrCloud
  • Sharding and replication
  • Autoscaling
  • Cluster state explained
  • Caches in SolrCloud
  • Shard splitting
  • Migrating data between collections
  • Working with large number of collections
  • Lab
    • Creating collection matching environment needs
    • Adding and removing replicas
    • Moving shards around the cluster
    • Adding shards to collection
    • Migrating data between collections

Main Topics

  • Solr Master-Slave Architecture
  • Working with Multi-Master Architecture
  • SolrCloud Architecture
  • High Availability, Fault Tolerance, Performance
  • Routing, Local Params & Parameter Dereferencing
  • Tagging, Exclusions and Advanced Faceting Control
  • Working With Managed Schema
  • Configuring Merge Policy
  • Using Data Import Handler
  • Using SolrCloud as Streaming Engine
  • Tuning & Scaling Solr Master – Slave
  • Tuning & Scaling SolrCloud
  • Securing Solr

Elasticsearch Training

Course key takeaways

After taking this course you will:

  • Understand the differences and use-cases for Solr and SolrCloud
  • Create Spatial Search and Function Queries
  • Perform Document Grouping
  • Configure and tune Query Spellchecking and Suggesters

Things to remember

Participants must use their own computer with OSX, Linux, or Windows, with a recent version of Java installed.

Participants should be comfortable using a terminal/command line.Sematext provides:
  • A digital copy of the training material
  • A VM with all configs, scripts, exercises, etc.

Need On-Site or Remote Training

Get in touch with us

Stay up to date

Get tips, how-tos, and news about Elastic / ELK Stack, Observability, Solr, and Sematext Cloud news and updates.

Sematext Newsletter
Securely save credentials in User Journey Scripts Learn more