Product Videos

MongoDB Monitoring Support

Play Video
For many of us in the DevOps field, MongoDB is a critical part of our IT stack… Read Blog Post

Log Management with Docker

Play Video
We became Docker Ecosystem Technology Partner (ETP) for logging… Read Blog Post

How to use Grafana with Logsene

Play Video
The Logsene API is compatible with Elasticsearch, which means you can use the Grafana… Read Blog Post

Death to APM and Logging Silos

Play Video
Sematext has combined the power of SPM and Logsene in a single pane of glass…. Read Blog Post

Introducing Logsene Live Tail

Play Video
We’ve been hard at work on our centralized logging SaaS / On-Premises solution… Read Blog Post

Introducing Top Database Operations

Play Video
If you run Elasticsearch, Solr, or any datastore you connect to via JDBC, you’ll like…. Read Blog Post

Network Maps for Infrastructure Discovery

Play Video
We are happy to announce the immediate availability of NetMaps in SPM… Read Blog Post


How to Run Solr on Docker. And Why.

Play Video
Radu Gheorghe & Rafał Kuć at Lucene/Solr Revolution 2016 talking about Solr and Docker…

Tuning Solr and its Pipeline for Logs

Play Video
Radu Gheorghe & Rafał Kuć at Lucene/Solr Revolution 2016 talking about Solr and Logs…

Top Node.js Metrics to Watch

Play Video
Hosted by Wikimedia in Berlin comes the latest presentation from Sematext DevOps Evangelist Stefan Thies… Read Blog Post

Large Scale Log Analytics with Solr

Play Video
In this presentation from Lucene/Solr Revolution 2015, Sematext engineers… Read Blog Post

Part1: Log Analysis with Elasticsearch

Play Video
Velocity NYC conference presentation by Rafal Kuć and Radu Gheorghe…. Read Blog Post

Part2: Log Analysis with Elasticsearch

Play Video
Velocity NYC conference presentation by Rafal Kuć and Radu Gheorghe…. Read Blog Post

Part2: Side by Side with Elasticsearch and Solr: Performance and Scalability

Play Video
Second installment of “Side by Side with Elasticsearch and Solr” talk…. Read Blog Post

Part1: Side by Side with Elasticsearch and Solr: Performance and Scalability

Play Video
First installment of “Side by Side with Elasticsearch and Solr” talk…. Read Blog Post

Tuning Solr for Logs

Play Video
Solr performance tuning, which is always nice…. Read Blog Post

Solr Anti-Patterns

Play Video
Common mistakes and roads that should be avoided at all costs…. Read Blog Post

Solr for Analytics: Metrics Aggregations

Play Video
Solr for Analytics: Metrics Aggregations at Sematext Metrics Aggregations

Indexing and Searching Logs with Elasticsearch or Solr

Play Video
How Logstash, Apache Flume or rsyslog can help you parse… … Logstash

Open Source Search Evolution

Play Video
Bazaarvoice IO 2014 Technical Conference Bazaarvoices IO

Introduction to ElasticSearch

Play Video
What Elasticsearch is and how it can act as your NoSQL… Read Blog Post

Reference Architecture: Monitoring and Logging for Docker Enterprise Edition

Docker Enterprise Edition (EE) simplifies container orchestration and increases the flexibility and scalability of application deployments. However, the high level of automation create new challenges for monitoring and log management. Why? Because each container typically runs a single process, has its own environment, utilizes virtual networks, or has various methods of managing storage.

Performance Monitoring Essentials – Elasticsearch Edition

Elasticsearch is booming. Together with Logstash, a tool for collecting and processing logs, and Kibana, a tool for searching and visualizing data in Elasticsearch (collectively they comprise the “ELK stack”), adoption of Elasticsearch continues to grow by leaps and bounds. In this detailed booklet Sematext’s DevOps Evangelist, Stefan Thies, walks readers through Elasticsearch and ELK stack basics and supplies numerous graphs, diagrams and infographics to clearly explain the essential elements. There is also a “Top 10 Elasticsearch Metrics” list with corresponding explanations and screenshots. The booklet will be especially helpful to those readers new to Elasticsearch and ELK stack, and also to experienced users who want a quick start into performance monitoring.

Log Management & Analytics – A Quick Guide to Logging Basics

This all-things-Logging booklet will especially appeal to readers who are looking to replace Splunk or a similar commercial application with Elasticsearch, Logstash, and Kibana (aka, “ELK stack”) or an alternative logging stack. Topics addressed by our logging experts with how-to instructions, screenshots, code, and more include: 5-Minute Logstash: Parsing and Sending a Log File, Encrypting Logs on Their Way to Elasticsearch, Recipe: rsyslog + Elasticsearch + Kibana, and Structured Logging with rsyslog and Elasticsearch. For more information about logging, see logging posts on Sematext Blog.

Lucene in Action – Second Edition

When Lucene first appeared, this superfast search engine was nothing short of amazing. Today, Lucene still delivers. Its high-performance, easy-to-use API, features like numeric fields, payloads, near-real-time search, and huge increases in indexing and searching speed make it the leading search tool. And with clear writing, reusable examples, and unmatched advice, Lucene in Action, Second Edition is still the definitive guide to effectively integrating search into your applications. It introduces you to searching, sorting, and filtering, and covers the numerous improvements to Lucene since the first edition. Source code is for Lucene 3.0.1.

Mastering Elasticsearch – Second Edition

“Mastering Elasticsearch – Second Edition” covers intermediate and advanced functionalities of Elasticsearch and walks you through its internals including caches, the Apache Lucene library, and its monitoring capabilities. You’ll learn about practical usage of Elasticsearch configuration parameters and how to use the monitoring API. With this book, you’ll delve into Elasticsearch’s query rewrite, query template, bulk operation, document grouping, and function score queries. You will also learn how to improve user search experience, index distribution, segment statistics, and merging. By the end of the book, you will be able to enhance Elasticsearch’s performance and create your own Elasticsearch plugins.

Apache Solr 4 Cookbook

Apache Solr is a blazing fast, scalable, open source Enterprise search server built upon Apache Lucene. Solr is wildly popular because it supports complex search criteria, faceting, result highlighting, query-completion, query spell-checking, and relevancy tuning, amongst other numerous features. “Apache Solr 4 Cookbook” will show you how to get the most out of your search engine. Full of practical recipes and examples, this book will show you how to set up Apache Solr, tune and benchmark performance as well as index and analyze your data to provide better, more precise, and useful search data.

Elasticsearch Server – Second Edition

This book begins by introducing the most commonly used Elasticsearch server functionalities, from creating your own index structure, through querying, faceting, and aggregations, and ends with cluster monitoring and problem diagnosis. As you progress through the book, you will cover topics such as starting Elasticsearch, creating a new index, and designing its proper structure. After that, you’ll read about the query API that Elasticsearch exposes, as well as about filtering capabilities, aggregations, and faceting. Last but not least, you will get to know how to find similar documents by using similar functionalities and how to implement application alerts by using the prospective search functionality called percolator. Some advanced topics such as shard allocation control, gateway configuration, and how to use the discovery module will also be discussed. This book will also show you the possibilities of cluster state and health monitoring as well as how to use third-party tools.

Elasticsearch in Action

Elasticsearch makes it easy to add efficient and scalable searches to your enterprise applications. Busy administrators and developers love this open source real-time search and analytics engine because they can simply install it, make a few tweaks, and go on with their work. And once Elasticsearch is up and running, you’ll discover that it’s miles deep, so you can build nearly any custom search solution you can imagine. The book focuses on Elasticsearch’s REST API via HTTP. Code snippets are written mostly in bash using curl, which makes them easily translatable to other languages.

Spark in Action

Spark in Action teaches you to use Spark for stream and batch data processing. It starts with an introduction to the Spark architecture and ecosystem followed by a taste of Spark’s command line interface. You then discover the most fundamental concepts and abstractions of Spark, particularly Resilient Distributed Datasets (RDDs) and the basic data transformations that RDDs provide. The first part of the book also introduces you to writing Spark applications using the the core APIs. Next, you learn about different Spark components: how to work with structured data using Spark SQL, how to process near-real time data with Spark Streaming, how to apply machine learning algorithms with Spark MLlib, how to apply graph algorithms on graph-shaped data using Spark GraphX, and a clear introduction to Spark clustering.

Apache Solr 3.1 Cookbook

This cookbook will show you how to get the most out of your search engine. Each chapter covers a different aspect of working with Solr from analyzing your text data through querying, performance improvement, and developing your own modules. The practical recipes will help you to quickly solve common problems with data analysis, show you how to use faceting to collect data and to speed up the performance of Solr. This practical guide shows you how to get the most out of Apache Solr 3.1 with recipes that show you how to improve your search engine’s performance, analyze data quickly and efficiently, and customize the search server with your own modules.

Elasticsearch DevOps Cheat Sheet

Elasticsearch is a search engine based on Lucene. It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents. Check out useful Elasticsearch DevOps snippets on Allocation, Caches, Merges, Troubleshooting and more…

Solr Metrics API Cheat Sheet

Solr is highly reliable, scalable and fault tolerant, providing distributed indexing, replication and load-balanced querying, automated failover and recovery, centralized configuration and more. Solr powers the search and navigation features of many of the world’s largest internet sites. Check out useful Solr snippets such as Jetty Metrics, JVM Metrics, Solr Node Metrics, Troubleshooting, Solr API and more…

Elasticsearch Developer Cheat Sheet

Elasticsearch is a search engine based on Lucene. It provides a distributed, multitenant-capable full-text search engine with an HTTP web interface and schema-free JSON documents. Check out useful Elasticsearch developer snippets on Data Manipulation, Mapping Parameters, Queries, Aggregations, Document Relationships and more…

Solr Cheat Sheet

This Solr / SolrCloud Cheat Sheet shows you how to access all the new Solr features – Running​ ​ Solr, Data​ ​ Manipulation, Searching, Faceting, Streaming​ ​ Aggregations, etc. Print it. Copy-paste from it. Use it when troubleshooting Solr performance issues. Enjoy and share!

Kubernetes Cheat Sheet

Kubernetes is an open source system for automating deployment, scaling and management of containerized applications that was originally designed by Google and donated to the Cloud Native Computing Foundation. It aims to provide a “platform for automating deployment, scaling, and operations of application containers across clusters of hosts”. It usually works with the Docker container tool and coordinates between a wide cluster of hosts running Docker.

The Essential Apache HBase Cheat Sheet

HBase is the Hadoop database: a distributed, scalable Big Data store that lets you host very large tables — billions of rows multiplied by millions of columns — on clusters built with commodity hardware Just as Google Bigtable leverages the distributed data storage provided by the Google File System, HBase provides Bigtable-like capabilities on top of Hadoop and HDFS.

Docker Monitoring Webinar

Play Video
The recent Docker Monitoring webinar is ready for consuming! Read Blog Post

Docker Logging Webinar

Play Video
Docker Logging has been a very popular topic of late in our…. Read Blog Post

External Open Source Projects

Some of the other projects we’ve contributed to over the years in one form of another:

Sematext Open Source Projects

Sematext loves open source software (OSS). We help our customers with a number of OSS search, data processing, and analytics technologies and we use OSS to build our products. We also give back in form of contribution to existing OSS projects, participation in the community, or by open-sourcing new software we’ve built in-house. We are always looking for bright people. If you want to work on stuff like this check out jobs at Sematext.