Elasticsearch Operations

Tune and Scale Your Elasticsearch Clusters

If you are looking to administer, tune and scale clusters, in this class you will learn everything you need to handle your Elasticsearch clusters in production – from tuning OS and JVM for performance to commits, merge policies and caches, query routing, scrolling, thread pools, and so on.

Your trainer is an active Elasticsearch consultant who worked with clients from 20+ different industries and the author of Elasticsearch in Action.

See Course Outline

Here are some problems Radu Gheorghe, your Elasticsearch trainer, solved for Sematext clients recently:

Improved search relevancy using Learning to Rank
Optimized multiple petabyte-scale clusters. Some up to 400 nodes.
Designed Elasticsearch index and cluster architecture for dozens of clients
Optimized log ingestion pipelines to parse and enrich 100K+ events/second
Helped clients reduce production Elasticsearch and ingestion pipeline costs by as much as 10x

What's Included

8-hour online training
A digital copy of the training material
Docker Compose files, configs, scripts, etc.
Certificate of Completion

Next Class March 7-8, 2024

Upcoming Classes

Price $800

Get Certified Upon Course Completion

Enroll in our course and take the next step in your professional journey.
Complete the course and receive a certificate that showcases your newly acquired skills.

Learn More

Who should attend?

This Elasticsearch course is designed for technical attendees with basic Elasticsearch experience. A person should be able to index data to Elasticsearch, run queries and aggregations, work with mappings and analysis.

Experience with Linux systems is not a must, but a basic familiarity with running shell commands (e.g., using curl command) will make the course more enjoyable. If you do not have prior Elasticsearch experience, we strongly suggest you consider attending our Intro to Elasticsearch class first.

Why attend?

Small, interactive, instructor-led classes
Lots of hands-on exercises
Customized learning experience
More flexible - no need to travel
Certificate of Completion included

What attendees say

**Andrew Montalenti**
CTO/Founder – Parse.ly

**Chris Hirsch**
Staff Engineer - Wayfair

I had to leave before the last class ended unfortunately. But had I been there I would have given you a mini-speech to tell you how awesome you were! ☺️
Thank you so much for being such a fantastic, patient, and effective instructor throughout this 4-day training. Personally, I appreciated the clarity and expertise in the material and the class.
I also thought the class was meticulously structured over the 4 days, each day not feeling too long, and with the right number of breaks - this made me feel like I had the capacity to learn the things planned for the day without feeling too overwhelmed. Having the course material is great for the team to revisit later.
Overall, we really felt like we learned a lot. To put it in perspective, <name redacted for privacy> (who worked with <name redacted> and <name redacted>) already applied some of the lessons from the training in one of our customer throughput issues today. A big progress for the team. And, I am sure we will continue to apply in the future too – I just want you to know that you have clearly influenced us ☺️

Engineering Manager, Search-Infra at Twitter

Upcoming Classes

Pick from our 8h online classes, structured to correspond to different roles and Elasticsearch knowledge levels: for beginners to experienced developers or ops who want to learn quickly. Delivery method: Live Online. Time: 09:00 AM – 01:00 PM ET (2 sessions).

Elasticsearch Ops
March 7-8, 2024

$800 / person

See Course Outline

Course Outline

Scaling basics

Bootstrapping and discovery
Number of shards and replicas
Node roles
Lab
- Set up a cluster; adjust number of shards and replicas for a new index
- Set up dedicated master, data and client nodes in a cluster

Designing for scale

Time-based indices and aliases
Shard allocation
Cross-cluster search
Lab
- Roll time-based indices by size
- Create a multi-tier cluster
- Set up a cluster to work across availability zones

Cluster maintenance

Index and search templates
Snapshot and restore
Upgrades and cluster restarts
Shard allocation troubleshooting
Lab
- Set up index templates
- Adjust initial recovery settings
- Adjust rebalancing settings
- Perform a cluster upgrade
- Snapshot and restore an index

Troubleshooting and profiling

Health and stats APIs
Cat APIs
Monitoring products
Logging
Task management
Hot threads API
Query profiling
Lab
- Use JSON and Cat APIs to get information about the state of the cluster
- Set up slowlogs
- Profile a query from the slowlog
- Reindex data from one index to another

Performance bottlenecks

Heap size vs OS caches
Garbage collection
Elasticsearch-specific memory usage and caches
Hardware considerations
Lab
- Pick up the right hardware for a use-case
- Size the heap correctly
- Tune the garbage collector for more throughput
- Adjust cache sizing for a use-case

Tuning indexing and queries

How often to commit: translog, index buffer and refresh interval
How data is stored: merge policies
Routing
Thread pools
Lab
- Set up routing
- Adjust refresh, flush and merge settings
- Adjust thread pool size and queue sizes

Main Topics

Scaling with Different Node Types, Multi-Tier and Multi-Cluster Architectures
Efficient Index Designs for Product and Time-Series Search
Tuning Caches, Merge Policies and Other Elasticsearch, JVM and OS Settings
Monitoring: Relevant Metrics to Watch and How to React to Them
Maintenance: Aliases, Snapshots, Recovery, Upgrades, etc

Course key takeaways

After taking this course you will learn about:

Everything you need to handle your Elasticsearch clusters in production: from tuning OS and JVM for performance through commits, merge policies and caches, query routing, scrolling, thread pools, and so on.
A number of tips and tricks for scaling out your cluster, different types of nodes and deployment topologies, the best way to handle time-based indices, etc.
Various Elasticsearch APIs important for keeping your cluster healthy, about backups, hot threads, logging, monitoring tools and so on.

Things to remember

Participants must use their own computer with OSX, Linux, or Windows, with a recent version of Java installed.
Participants should be comfortable using a terminal/command line.

Sematext provides:

A digital copy of the training material
A VM with all configs, scripts, exercises, etc.

Sematext

Menu

Featured

Latest

Categories

Elasticsearch Operations

Tune and Scale Your Elasticsearch Clusters

What's Included

Get Certified Upon Course Completion

Who should attend?

Why attend?

What attendees say

Upcoming Classes

Course Outline

Main Topics

Course key takeaways

After taking this course you will learn about:

Things to remember

Sematext provides:

Need On-Site or Remote Training

Get in touch with us