Registration is open - Live, Instructor-led Online Classes - Elasticsearch in March - Solr in April - OpenSearch in May. See all classes


Solr

Solr Operations Training

Learn to tune and scale Solr & SolrCloud

Rating1-star 1-star 1-star 1-star half-star
Our Solr training classes have a 4.48/5 rating based on 40 reviews

In this hands-on Solr course you will learn how to deploy, manage, tune and scale Solr/SolrCloud setups. From choosing the right hardware profile for your use-case to tuning everything from the Lucene merge policy to Solr's caches and JVM's garbage collection to scaling a SolrCloud cluster to work with many collections and everything in between. This online training covers Solr 7.x, 8.x and 9.x. Each section is followed by a lab with multiple hands-on exercises. See course outline below for more.

See Course Outline

Your Apache Solr instructor is Radu Gheorghe, an active Solr engineer and consultant with 10+ years of experience helping enterprise, medium and small organizations from 15+ different industries. Here are some issues Radu solved for Sematext clients recently:

  • Designed and deployed leader-follower and SolrCloud based architectures from small businesses to large scale enterprises use-cases
  • Improved search relevancy to provide on-point results in various business use cases from e-commerce to health industries
  • Optimized clusters handling thousands of queries per second
  • Helped clients reduce operational costs by optimizing the amount of hardware needed as a result of SolrCloud tuning
  • Diagnosed and suggested solutions for various JVM related issues - from garbage collector problems to heap usage reduction

What's Included

  • 8 hours online training
  • A digital copy of the training material
  • Docker Compose files, configs, scripts, etc.
  • Certificate of Completion

Next Class April 22-23, 2024

Upcoming Classes

Price $800

Register Now

On-site training available upon request

Looking for an extended knowledge-based Solr training covering form beginner to an advanced level? You’ve come to the right place.

Request Now

Get Certified Upon Course Completion

Enroll in our course and take the next step in your professional journey.
Complete the course and receive a certificate that showcases your newly acquired skills.

Learn More

Who should attend?

This Solr online course is designed for technical attendees with basic Solr experience: A person should be able to index data to Solr, run queries, work with Solr analysis and use faceting. Experience with Linux systems is not a must, but a basic familiarity with running shell commands (e.g., using curl command) will make the course more enjoyable.

Prerequisites

Sematext's Core Solr or pre-existing knowledge of Solr concepts covered in Core Solr.

Why attend?

  • Small, interactive, instructor-led classes
  • Lots of hands-on exercises
  • Customized learning experience
  • More flexible - no need to travel
  • Get our Solr certification - Certificate of Completion included

What attendees say

For a non-native English speaker student, this course was a crystal clear explanation of Solr and all the sweetest juice you can extract from your text searches. Even if you think you know enough Solr, I wholeheartedly recommend this course for you.

Nestor Arturo Fernandez Ricaurte
Senior Developer - Legis

If you are serious about getting Solr right, this is the course for you. Simply put, it's the best Solr instruction for those who want to master this domain and use Solr in the real world

Architect at Large Cloud Company

Thank you for a very informative training, such a wealth of information that I truly enjoyed learning about.

Vickie Jean Charles
Sr. System Engineer - Xactly Corporation

The Intermediate and Advanced Solr Training went very well. A few of the things I liked:

— The intermediate and advanced command files included in the training material were very helpful. I’m still referring to them after taking the class as I look more in-depth into the various features.
— The provided example data and scripts to load the data into Solr are another big time saver. Also, this gives me something to refer to as I review the training notes.
— The hands-on exercise/labs

The training covered several Solr features e.g., streaming aggregation, JSON API, etc. that we haven’t really used yet, but may be very useful to us in the future.

David Powell
Software Engineer - Large-Scale NGO in the Educational Sector

The CORE training made the complex concepts around solr easy to understand. Although it was a group setting, I still feel like I received individual attention. All of my questions were answered well, and at the close of each unit, I felt confident in my understanding.

Sarah Smulligan
Software Developer

Thanks Radu for guiding us through this Apache Solr training from Core to the Advance training program. Your experience, explanations and labs helped me to get the knowledge to improve the way we use Apache Solr in my company.

Emmanuel Altamirano
Senior Java Developer at TransUnion

Upcoming Classes

Pick from the Solr Online Course matching your exact needs. Delivery method: Live Online. Time: 09:00 AM - 01:00 PM ET (2 sessions).

solr Solr Operations
April 22-23, 2024
$800 / personSee Course OutlineRegister Now!

Course Outline

Solr Architecture
  • Solr leader-follower architecture
  • SolrCloud architecture
  • Routing
  • Lab
    • Configuring Solr to use routing
    • Search across collections
Configuring Solr Internals
  • Getting used to solrconfig.xml
  • Lucene directory configuration
  • Schema factories
  • Schema version
  • Codecs & compression
  • Merge policy
  • Merge scheduler & warmer
  • Replication
  • Transaction log
  • Distributed inverse document frequency calculation
  • Update request processors
  • Lab
    • Configure Lucene directory
    • Change schema type and compression
    • Configuring merge policy for faster indexing
Tuning
  • Hardware & Memory considerations
  • RAM Buffer size
  • Auto commit tuning
  • Caches
  • Warming up
  • Replication throttling
  • Memory Related Performance Considerations
  • General Performance Considerations
  • Lab
    • Pick the right hardware for a use-case
    • Configure the right heap size for a use-case
    • Tweak GC settings for large heaps
    • Limit replication throughput
    • Tweak commits and caches for specific use-cases
Monitoring
  • Slow query log
  • Metrics API
  • Monitoring using JMX
  • Monitoring using Sematext Cloud
  • Key Metrics to Monitor
  • Lab
    • Locate the main log and the GC log
    • Configure slowlogs
Scaling
  • Solr leader & follower configuration
  • Scaling a Leader-Follower Setup
  • Proper Solr leader configuration
  • Proper Solr followers configuration
  • Sharding in SolrCloud
  • Using ZooKeeper
  • Collection state
  • Handling replicas
  • SolrCloud Replica Types
  • Caches in SolrCloud
  • Replica placement plugins
  • Shard splitting
  • Controlling shard connectivity
  • Working with large number of collections
  • Caches in SolrCloud
  • Controlling distributed search
  • Lab
    • Choose the right number of shards and replicas
    • Create a collection
    • Add and remove replicas
    • Split shards
Operations
  • API v2
  • Aliases
  • Schema API
  • Managed resources
  • Config API
  • Running Solr as a service
  • Backups in Solr and Solr-Cloud
  • Migrating/reindexing data between collections/clusters
  • Lab
    • Create a new field in the schema via the API
    • Migrate documents between collections

Main Topics

  • Solr leader-follower Architecture
  • Working with Multi-Leader Architecture
  • SolrCloud Architecture
  • Working With Managed Schema and Config
  • Tuning commits, merges, caches
  • Choosing the right hardware, heap size, GC
  • Using the right number of shards, replicas
  • Changing shards, replicas
  • Back up, reindex, migrate data

Course key takeaways

After taking this course you will:

  • Understand whether standalone Solr or SolrCloud works best
  • Be able to choose the right hardware, heap size, sharding
  • Understand trade-off for various configuration options
  • Be able to manage and troubleshoot the cluster via logs, backups, etc

Setup requirements

  • Participants must use their own computer with OSX, Linux, or Windows, with a recent version of Java installed.
  • Participants should be comfortable using a terminal/command line.

Need On-Site or Remote Training

Get in touch with us.