Solr Operations Training
Learn to tune and scale Solr & SolrCloud
In this hands-on Solr course you will learn how to deploy, manage, tune and scale Solr/SolrCloud setups. From choosing the right hardware profile for your use-case to tuning everything from the Lucene merge policy to Solr's caches and JVM's garbage collection to scaling a SolrCloud cluster to work with many collections and everything in between. This online training covers Solr 7.x, 8.x and 9.x. Each section is followed by a lab with multiple hands-on exercises. See course outline below for more.
Your Apache Solr instructor is Radu Gheorghe, an active Solr engineer and consultant with 10+ years of experience helping enterprise, medium and small organizations from 15+ different industries. Here are some issues Radu solved for Sematext clients recently:
- Designed and deployed leader-follower and SolrCloud based architectures from small businesses to large scale enterprises use-cases
- Improved search relevancy to provide on-point results in various business use cases from e-commerce to health industries
- Optimized clusters handling thousands of queries per second
- Helped clients reduce operational costs by optimizing the amount of hardware needed as a result of SolrCloud tuning
- Diagnosed and suggested solutions for various JVM related issues - from garbage collector problems to heap usage reduction
What's Included
- 8 hours online training
- A digital copy of the training material
- Docker Compose files, configs, scripts, etc.
- Certificate of Completion
Next Class April 22-23, 2024
Price $800
On-site training available upon request
Looking for an extended knowledge-based Solr training covering form beginner to an advanced level? You’ve come to the right place.
Get Certified Upon Course Completion
Enroll in our course and take the next step in your professional journey.
Complete the course and receive a certificate that showcases your newly acquired skills.
Who should attend?
This Solr online course is designed for technical attendees with basic Solr experience: A person should be able to index data to Solr, run queries, work with Solr analysis and use faceting. Experience with Linux systems is not a must, but a basic familiarity with running shell commands (e.g., using curl command) will make the course more enjoyable.
Prerequisites
Sematext's Core Solr or pre-existing knowledge of Solr concepts covered in Core Solr.
Why attend?
- Small, interactive, instructor-led classes
- Lots of hands-on exercises
- Customized learning experience
- More flexible - no need to travel
- Get our Solr certification - Certificate of Completion included
What attendees say
Upcoming Classes
Pick from the Solr Online Course matching your exact needs. Delivery method: Live Online. Time: 09:00 AM - 01:00 PM ET (2 sessions).
Solr Operations
April 22-23, 2024 | $800 / person | See Course Outline | Register Now! |
Course Outline
Solr Architecture
- Solr leader-follower architecture
- SolrCloud architecture
- Routing
-
Lab
- Configuring Solr to use routing
- Search across collections
Configuring Solr Internals
- Getting used to solrconfig.xml
- Lucene directory configuration
- Schema factories
- Schema version
- Codecs & compression
- Merge policy
- Merge scheduler & warmer
- Replication
- Transaction log
- Distributed inverse document frequency calculation
- Update request processors
-
Lab
- Configure Lucene directory
- Change schema type and compression
- Configuring merge policy for faster indexing
Tuning
- Hardware & Memory considerations
- RAM Buffer size
- Auto commit tuning
- Caches
- Warming up
- Replication throttling
- Memory Related Performance Considerations
- General Performance Considerations
-
Lab
- Pick the right hardware for a use-case
- Configure the right heap size for a use-case
- Tweak GC settings for large heaps
- Limit replication throughput
- Tweak commits and caches for specific use-cases
Monitoring
- Slow query log
- Metrics API
- Monitoring using JMX
- Monitoring using Sematext Cloud
- Key Metrics to Monitor
-
Lab
- Locate the main log and the GC log
- Configure slowlogs
Scaling
- Solr leader & follower configuration
- Scaling a Leader-Follower Setup
- Proper Solr leader configuration
- Proper Solr followers configuration
- Sharding in SolrCloud
- Using ZooKeeper
- Collection state
- Handling replicas
- SolrCloud Replica Types
- Caches in SolrCloud
- Replica placement plugins
- Shard splitting
- Controlling shard connectivity
- Working with large number of collections
- Caches in SolrCloud
- Controlling distributed search
-
Lab
- Choose the right number of shards and replicas
- Create a collection
- Add and remove replicas
- Split shards
Operations
- API v2
- Aliases
- Schema API
- Managed resources
- Config API
- Running Solr as a service
- Backups in Solr and Solr-Cloud
- Migrating/reindexing data between collections/clusters
-
Lab
- Create a new field in the schema via the API
- Migrate documents between collections
Main Topics
- Solr leader-follower Architecture
- Working with Multi-Leader Architecture
- SolrCloud Architecture
- Working With Managed Schema and Config
- Tuning commits, merges, caches
- Choosing the right hardware, heap size, GC
- Using the right number of shards, replicas
- Changing shards, replicas
- Back up, reindex, migrate data
Course key takeaways
After taking this course you will:
- Understand whether standalone Solr or SolrCloud works best
- Be able to choose the right hardware, heap size, sharding
- Understand trade-off for various configuration options
- Be able to manage and troubleshoot the cluster via logs, backups, etc
Setup requirements
- Participants must use their own computer with OSX, Linux, or Windows, with a recent version of Java installed.
- Participants should be comfortable using a terminal/command line.