Elasticsearch Operations
Tune and Scale Your Elasticsearch Clusters
If you are looking to administer, tune and scale clusters, in this class you will learn everything you need to handle your Elasticsearch clusters in production – from tuning OS and JVM for performance to commits, merge policies and caches, query routing, scrolling, thread pools, and so on.
Your trainer is an active Elasticsearch consultant who worked with clients from 20+ different industries and the author of Elasticsearch in Action.
Here are some problems Radu Gheorghe, your Elasticsearch trainer, solved for Sematext clients recently:
- Improved search relevancy using Learning to Rank
- Optimized multiple petabyte-scale clusters. Some up to 400 nodes.
- Designed Elasticsearch index and cluster architecture for dozens of clients
- Optimized log ingestion pipelines to parse and enrich 100K+ events/second
- Helped clients reduce production Elasticsearch and ingestion pipeline costs by as much as 10x
What's Included
- 8-hour online training
- A digital copy of the training material
- Docker Compose files, configs, scripts, etc.
- Certificate of Completion
Next Class March 7-8, 2024
Price $800
Get Certified Upon Course Completion
Enroll in our course and take the next step in your professional journey.
Complete the course and receive a certificate that showcases your newly acquired skills.
Who should attend?
This Elasticsearch course is designed for technical attendees with basic Elasticsearch experience. A person should be able to index data to Elasticsearch, run queries and aggregations, work with mappings and analysis.
Experience with Linux systems is not a must, but a basic familiarity with running shell commands (e.g., using curl command) will make the course more enjoyable. If you do not have prior Elasticsearch experience, we strongly suggest you consider attending our Intro to Elasticsearch class first.
Why attend?
- Small, interactive, instructor-led classes
- Lots of hands-on exercises
- Customized learning experience
- More flexible - no need to travel
- Certificate of Completion included
What attendees say
Upcoming Classes
Pick from our 8h online classes, structured to correspond to different roles and Elasticsearch knowledge levels: for beginners to experienced developers or ops who want to learn quickly. Delivery method: Live Online. Time: 09:00 AM – 01:00 PM ET (2 sessions).
Elasticsearch Ops
March 7-8, 2024 | $800 / person | See Course Outline | Register Now! |
Course Outline
Scaling basics
- Bootstrapping and discovery
- Number of shards and replicas
- Node roles
-
Lab
- Set up a cluster; adjust number of shards and replicas for a new index
- Set up dedicated master, data and client nodes in a cluster
Designing for scale
- Time-based indices and aliases
- Shard allocation
- Cross-cluster search
-
Lab
- Roll time-based indices by size
- Create a multi-tier cluster
- Set up a cluster to work across availability zones
Cluster maintenance
- Index and search templates
- Snapshot and restore
- Upgrades and cluster restarts
- Shard allocation troubleshooting
-
Lab
- Set up index templates
- Adjust initial recovery settings
- Adjust rebalancing settings
- Perform a cluster upgrade
- Snapshot and restore an index
Troubleshooting and profiling
- Health and stats APIs
- Cat APIs
- Monitoring products
- Logging
- Task management
- Hot threads API
- Query profiling
-
Lab
- Use JSON and Cat APIs to get information about the state of the cluster
- Set up slowlogs
- Profile a query from the slowlog
- Reindex data from one index to another
Performance bottlenecks
- Heap size vs OS caches
- Garbage collection
- Elasticsearch-specific memory usage and caches
- Hardware considerations
-
Lab
- Pick up the right hardware for a use-case
- Size the heap correctly
- Tune the garbage collector for more throughput
- Adjust cache sizing for a use-case
Tuning indexing and queries
- How often to commit: translog, index buffer and refresh interval
- How data is stored: merge policies
- Routing
- Thread pools
-
Lab
- Set up routing
- Adjust refresh, flush and merge settings
- Adjust thread pool size and queue sizes
Main Topics
- Scaling with Different Node Types, Multi-Tier and Multi-Cluster Architectures
- Efficient Index Designs for Product and Time-Series Search
- Tuning Caches, Merge Policies and Other Elasticsearch, JVM and OS Settings
- Monitoring: Relevant Metrics to Watch and How to React to Them
- Maintenance: Aliases, Snapshots, Recovery, Upgrades, etc
Course key takeaways
After taking this course you will learn about:
- Everything you need to handle your Elasticsearch clusters in production: from tuning OS and JVM for performance through commits, merge policies and caches, query routing, scrolling, thread pools, and so on.
- A number of tips and tricks for scaling out your cluster, different types of nodes and deployment topologies, the best way to handle time-based indices, etc.
- Various Elasticsearch APIs important for keeping your cluster healthy, about backups, hot threads, logging, monitoring tools and so on.
Things to remember
- Participants must use their own computer with OSX, Linux, or Windows, with a recent version of Java installed.
- Participants should be comfortable using a terminal/command line.
Sematext provides:
- A digital copy of the training material
- A VM with all configs, scripts, exercises, etc.