Registration is open - Live, Instructor-led Online Classes - Elasticsearch in March - Solr in April - OpenSearch in May. See all classes


Elasticsearch

Intro to Elasticsearch

Get started with Elasticsearch

If you’re just getting started, after taking this comprehensive 2-day session (two 4-hour sessions), you will understand all core Elasticsearch concepts - data, master and client nodes and their differences, sharding, replication, mapping, search relevance scoring, etc.

Your trainer is an active Elasticsearch consultant who worked with clients from 20+ different industries and the author of Elasticsearch in Action.

See Course Outline

Here are some problems Radu Gheorghe, your Elasticsearch trainer, solved for Sematext clients recently:

  • Improved search relevancy using Learning to Rank
  • Optimized multiple petabyte-scale clusters. Some up to 400 nodes.
  • Designed Elasticsearch index and cluster architecture for dozens of clients
  • Optimized log ingestion pipelines to parse and enrich 100K+ events/second
  • Helped clients reduce production Elasticsearch and ingestion pipeline costs by as much as 10x

Attendees come in highly motivated, making the class feel more “alive” than I expected. They constantly look for takeaways to improve their setup, from tweaking a boost to changing the sharding strategy. Their use-cases are very diverse, too, so we end up covering a lot of material.

Radu Gheorghe Radu Gheorghe
Sematext Elasticsearch Training Instructor

What's Included

  • 8-hour online training
  • A digital copy of the training material
  • Docker Compose files, configs, scripts, etc.
  • Certificate of Completion

Next Class March 4-5, 2024

Upcoming Classes

Price $800

Register Now

Get Certified Upon Course Completion

Enroll in our course and take the next step in your professional journey.
Complete the course and receive a certificate that showcases your newly acquired skills.

Learn More

Who should attend?

  • This Elasticsearch online course is designed for technical attendees with any knowledge level.
  • No prior Elasticsearch experience or knowledge is required. Experience with Linux is not a must, but basic familiarity with running shell commands (e.g., using curl command) will make the course more enjoyable.

Why attend?

  • Small, interactive, instructor-led classes
  • Lots of hands-on exercises
  • Customized learning experience
  • More flexible - no need to travel
  • Certificate of Completion included

What attendees say

Sematext was an ideal training partner for Parse.ly. We had just recently adopted Elasticsearch on a new project, and they gave us two days of solid training that was tailored to our team’s needs. The material was built atop strong foundations and moved quickly into advanced areas around querying, Lucene internals, and cluster performance. It was clear that it was all informed by real-world experience operating these systems at scale.

Andrew Montalenti Andrew Montalenti
CTO/Founder – Parse.ly

Not your typical by-the-book training. Radu was engaging and was a great guide in our journey through Elasticsearch.

Chris Hirsch Chris Hirsch
Staff Engineer - Wayfair

I had to leave before the last class ended unfortunately. But had I been there I would have given you a mini-speech to tell you how awesome you were! ☺️

Thank you so much for being such a fantastic, patient, and effective instructor throughout this 4-day training. Personally, I appreciated the clarity and expertise in the material and the class.
I also thought the class was meticulously structured over the 4 days, each day not feeling too long, and with the right number of breaks - this made me feel like I had the capacity to learn the things planned for the day without feeling too overwhelmed. Having the course material is great for the team to revisit later.

Overall, we really felt like we learned a lot. To put it in perspective, <name redacted for privacy> (who worked with <name redacted> and <name redacted>) already applied some of the lessons from the training in one of our customer throughput issues today. A big progress for the team. And, I am sure we will continue to apply in the future too – I just want you to know that you have clearly influenced us ☺️

Engineering Manager, Search-Infra at Twitter

Upcoming Classes

Pick from our 8h online classes, structured to correspond to different roles and Elasticsearch knowledge levels: for beginners to experienced developers or ops who want to learn quickly. Delivery method: Live Online. Time: 09:00 AM – 01:00 PM ET (2 sessions).

elasticsearch Intro to Elasticsearch
March 4-5, 2024
$800 / personSee Course OutlineRegister Now!

Course Outline

Basic flow of data in Elasticsearch
  • What is Elasticsearch and typical use-cases
  • Indexing; what is an index and an ID
  • Mappings; stored and indexed fields
  • Analysis basics
  • Realtime get
  • Search; how searches are distributed to shards
  • Aggregations and doc values introduction
  • Updates; versioning
  • Deletes; introduction to Lucene segment merges
  • Lab
    • CRUD operations
    • Query and filter
    • Aggregations
Indexing and storing data
  • Bulk API
  • Mappings and mapping types
  • Subfield definitions
  • Default mappings
  • Texts, keywords, integers and other core types
  • Predefined fields; storing fields separately vs _source
  • Lab
    • Using the bulk API
    • Changing mapping
Text analysis
  • Built-in analyzers: standard language analyzers
  • Custom analyzers
  • Char filters
  • Tokenizers
  • Token filters: lowercase, stemming, synonyms
  • Using the Analyze API
  • Lab
    • Add stemming support
    • Add support for non-ASCII characters
Searching through your data
  • Selecting fields.
  • Sorting and pagination
  • Search basics: term, range and bool queries
  • Match query and its main options
  • Query string query
  • Lab
    • Configure sorting, pagination and select the right fields
    • Using a bool query to combine different match, range and term queries
Aggregations
  • Metrics aggregations: stats, cardinality, percentiles
  • Why terms, cardinality and percentiles are approximate
  • Multi-bucket aggregations: terms, ranges and histograms
  • Nesting aggregations; how nesting works
  • Lab
    • Computing the cardinality of a field
    • Sorting buckets by results of sub-aggregations
    • Nest the sum and histogram aggregations
Clustering Essentials
  • Nodes, shards and replicas
  • How replication works
  • How distributed search works
  • RAM and heap size
  • Bootstrapping a cluster
  • Cat APIs
  • Lab
    • Create an index
    • Verify the distribution of shards
    • Add a new node to the cluster

Main Topics

  • Elasticsearch Basic Concepts
  • Elasticsearch Strengths and Weaknesses
  • Elasticsearch CRUD Operations
  • Field types and Predefined Fields
  • Using and Customizing Analyzers
  • Types of Queries and Aggregations
  • Shards, Replicas and Clustering

Course key takeaways

After taking this course you will:

  • Understand all core Elasticsearch concepts – index, document, sharding, replication, mapping, search relevance, etc.
  • Be able to index data into Elasticsearch and retrieve it using search and realtime get APIs
  • Have a solid grasp of the underlying query parsing, analysis, tokenization, and various types of queries
  • Learn about a number of different types of Elasticsearch aggregations

Things to remember

  • Participants must use their own computer with OSX, Linux, or Windows, with a container management tool installed (Docker Desktop, Podman, nerdctl)
  • Participants should be comfortable using a terminal/command line.

Sematext provides:

  • A digital copy of the training material, including slides and many sample requests
  • A docker-compose.yml file for the lab environment

Need On-Site or Remote Training

Get in touch with us