Let’s Encrypt has announced that it will no longer send certificate expiration notification emails. What now?

Results for "solr"

Solr Monitoring Tools

Solr is widely adopted by startups and enterprises alike. It's powerful and open-source, so it's very appealing to just about everyone looking for a search platform to build off of.

Using Solr Operator to Autoscale Solr on Kubernetes

In this tutorial, you'll see how to deploy Solr on Kubernetes. You'll also see how to use the Solr Operator to autoscale a SolrCloud cluster based on CPU with the

11 Alternatives to Elasticsearch, OpenSearch, and Solr

Introduction In the ever-evolving world of search engines, Elasticsearch, OpenSearch, and Solr have long held the spotlight. However, there are several smaller search platforms that pack a punch and offer

OpenSearch vs Solr: Which One Is Better to Use?

If you’re looking for a short answer on OpenSearch vs Solr, here’s a flow chart: We normally recommend the one you (or your team) already know or the prefer because,

All About Solr Replica Placement Plugins

With Solr 9 the Autoscaling Framework was removed - for being too complex and not terribly reliable - and instead we have Replica Placement Plugins. Unlike Autoscaling, replica placement only

Writing a Custom Sort Plugin for Solr

OK, so you want to sort your documents by something that can’t be implemented with Solr’s built-in functions. This calls for a custom function, which you can implement through your

solr-reindexer: Quick Way to Reindex to a New Collection

If you’re using Solr, for sure there are times when you change the schema and need to reindex. Quite often the source of truth is a database, so you can

Solr vs Elasticsearch: Performance Differences & More. How to Decide Which One Is Best for You

“Solr or Elasticsearch?”…well, at least that is the common question we hear from Sematext’s consulting services clients and prospects. Which one is better, Solr or Elasticsearch? Which one is faster?

Solr Performance: Troubleshooting Solr Slow Queries Using Logs and Metrics

Let's say you get an alert that one or more queries is slow. Or that your users complain, whichever comes first 🙂  We've all been there… How do you find

Working with Solr Plugins System

Apache Solr was always ready to be extended. What was only needed is a binary with the code and the modification of the Solr configuration file, the solrconfig.xml and we

Solr-diagnostics: How to use it and what it collects

If you’re running Solr and have to troubleshoot it (or maybe you just want a good overview!), then you’d probably want to collect logs, configs, maybe a snapshot of metrics

Complete Apache Solr Guide

The ability to search is a key feature of most modern applications. While encompassing huge amounts of data, they need to allow the end-user to find what they're searching for

Solr Monitoring Made Easy with Sematext

As shown in Part 1, key Solr metrics to monitor, the setup, tuning, and operations of Solr require deep insights into the performance metrics such as request rate and latency,

Solr Key Metrics to Monitor

As the first part of the three-part series on monitoring Apache Solr, this article explores which Solr metrics are important to monitor and why. The second part of the series

Using Solr to Tag Text

Over the years, natural language processing, in the world of search, went from interesting detail to a must have, especially in areas such as e-commerce. Engineers started incorporating classification, synonym

Search Relevance – Solr & Elasticsearch Similarities

What is Search Relevance Similarity Lucene has a lot of options for configuring similarity. By extension, Solr and Elasticsearch have the same options. Similarity makes the base of your relevancy

Solr Learning To Rank and Streaming Expressions

During the Entity Extraction For Product Searches talk that Radu Gheorghe and I gave at Activate conference in Montreal last year, we talked about various natural language processing and machine learning algorithms. We

Solr online training classes in 2018

Since we’ve been working with Solr on a daily basis for over a decade now and know so much about it, we like helping others improve their knowledge of these

Solr Streaming Expressions for Collection auto-updating

One of the things that were extensively changed in Solr 6.0 is the Streaming Expressions and what we can do with them (hint: amazing stuff!). We already described Solr SQL

Solr 6, SolrCloud and SQL Queries

With the recent release of Apache Lucene and Solr 6, we should familiarize ourselves with the juicy features that come with them. We have the new default Similarity implementation -

Solr 7 – New Replica Types

With the release of Solr 7 the community around it produced yet another great version of this search engine. As usual, there is an extensive list of changes, bug fixes

Solr: Optimize Is (Not) Bad for You – Video & Slides

Another Lucene/Solr Revolution happened on September 12-15, 2017 in Las Vegas. Sematext was there, exhibiting AND giving two talks! Thanks to everyone who stopped by our booth and attended our two talks: Optimize Is (Not) Bad

Solr on Docker – the Good, the Bad and the Ugly – Video & Slides

Another Lucene/Solr Revolution happened on September 12-15, 2017 in Las Vegas. Sematext was there, exhibiting AND giving two talks! Thanks to everyone who stopped by our booth and attended our two talks: Optimize Is (Not) Bad

Solr V2 API – Quick Look

Last updated on Jan 11, 2018 We are all used to the Solr API that has been present in Solr from its beginnings. We send the data using HTTP protocol,

Sematext Solr AutoComplete: Introduction and Howto

Sematext Solr AutoComplete is an open-source Solr add-on that provides suggest-as-you-type functionality. In this post we'll explain how you can install it, load the autocomplete collection/core with suggestions and how

Solr New Metrics API: Quick look at Solr 6.4

As you know, in Sematext we looooove logs and metrics and we enjoy playing with them on a daily basis. We have our Logsene, which is all about logs and

Running Solr in Docker: How & Why

Docker is all the rage these days, but one doesn't hear about running Solr on Docker very much. Last month, we gave a talk on the topic of running containerized

Tuning Solr & Pipeline for Logs – Video & Slides

Not everyone uses Splunk or ELK stack for logs. A few weeks ago, at the Lucene/Solr Revolution conference in Boston, we gave a talk about using Solr for logging, along

Handling Shards in SolrCloud

Last updated on Jan 10, 2018 One of the things you learn when attending Sematext Solr training is how to scale Solr. We discuss various topics regarding leader shards and

DocValues Reindexing with Solr Streaming Expressions

Last updated on Jan 8, 2018 Last time, when talking about Solr 6 we learned how to use streaming expressions to automatically update data in a collection. You can imagine

Presentation: Large Scale Log Analytics with Solr

In this presentation from Lucene/Solr Revolution 2015, Sematext engineers -- and Solr and centralized logging experts -- Radu Gheorghe and Rafal Kuć talk about searching and analyzing time-based data at

SolrCloud: Dealing with Large Tenants and Routing

Last updated on Jan 10, 2018 Many Solr users need to handle multi-tenant data. There are different techniques that deal with this situation: some good, some not-so-good. Using routing to handle such

Scaling Elasticsearch by Cleaning the Cluster State

We often get questions like: How much data can I put in an Elasticsearch cluster? How many nodes can an Elasticsearch cluster have? What's the biggest cluster that you've seen?

Running OpenSearch on Kubernetes With Its Operator

If you’re thinking of running OpenSearch on Kubernetes, you have to check out the OpenSearch Kubernetes Operator. It’s by far the easiest way to get going, you can configure pretty

OpenSearch vs Elasticsearch: Which One Is Better to Use?

Whenever we start a search consulting project from scratch, the obvious question is: which search engine to use? We’ve talked about Elasticsearch vs Solr before, but here we’ll compare Elasticsearch

5 Logstash Alternatives [2025 Review]

When it comes to centralizing logs to Elasticsearch, the first log shipper that comes to mind is Logstash. People hear about it even if it’s not clear what it does:

20+ Best Log Management Tools for Monitoring, Analytics & More: Pros & Cons Comparison [2023]

Whether you capture them for application security and compliance, production monitoring, performance monitoring, or troubleshooting, logs contain valuable information about the health of your apps. But it all comes down

13 Best System Monitoring Tools in 2025

A system is “a set of things working together as parts of a mechanism or an interconnecting network; a complex whole”. So when we say System Monitoring Tools we mean

How to Monitor ZooKeeper: Key Metrics & Best Tools [2023 Comparison]

Apache Zookeeper is a great tool used by many popular tools. Your Kafka uses Zookeeper, your HDFS uses it, your SolrCloud uses it, and your ClickHouse may also be using

Buffer vs. Cache

What Is Buffer? A buffer is a temporary area where data is stored in the main memory (RAM) or disk while moving from an input system to an output system.

Autoscaling Elasticsearch Clusters for Logs: Using a Kubernetes Operator to Scale Up or Down

When we say “logs” we really mean any kind of time-series data: events, social media, you name it. See Jordan Sissel’s definition of time + data. And when we talk

DevSecOps

Definition: What Is DevSecOps and How Does It Work? DevSecOps stands for development, security, and operations. It's a software development practice that involves integrating security throughout the entire software development

Black Friday log management (with the Elastic Stack) checklist

For this Black Friday, Sematext wishes you: more products sold more traffic and exposure more logs :) Now seriously, applications tend to generate a lot more logs on Black Friday,

What Is Log Aggregation: A Complete Guide to Best Practices & Tools You Need to Know

Every developer's worst nightmare is digging through a huge log file, trying to pinpoint problems. The troubleshooting most likely won't stop there. They'll have to follow the trail to multiple

Understanding Apache Logging: How to View, Locate and Analyze Access & Error Logs

Apache - the technology that powers the web. I'm not sure if that is correct, but I think that we wouldn't see the world wide web in its current shape

Exception Handling in Java: How-to Tutorial with Examples & Best Practices

As developers, we would like our users to interact with applications that run smoothly and without issues. We want the libraries that we create to be widely adopted and successful.

Key JVM Metrics to Monitor for Peak Java Application Performance

Monitoring is crucial if you want to see what happens in your system and JVM-based applications are not different. Well, some metrics, like memory and garbage collection, require special attention

Understanding Java Garbage Collection Logging: What Are GC Logs and How to Analyze Them

When working with Java or any other JVM-based programming language we get certain functionalities for free. One of those functionalities is clearing the memory. If you’ve ever used languages like

Node.js Logging Tutorial: How to Check, Write & Debug [Best Practices]

Building SaaS products is hard. Making customers happy is even harder. I should know, I've built a start-up that failed! But, not everything is that bad. I learned a lot.

Top 10 Elasticsearch Metrics to Monitor

This article is part 2 of a four-part series of articles about Elasticsearch performance monitoring. Part 1 explains what Elasticsearch is and how it works, while in this part, we're

Introduction to Metrics, Monitoring and Alerting

Monitoring your system and infrastructure is critical to ensure the performance of your services. In fact, as software development moves faster and faster, alerting and monitoring becomes an indispensable practice

Service Autodiscovery & Automatic Monitoring with Sematext

If you are anything like us here at Sematext, you are likely always trying to automate any tedious, repetitive tasks. Repetitio est mater… boringdorum. Setting up monitoring falls in that

Tutorial: Logging with journald

I'm sure you bumped into journald: it's what most distros use by default for system logging in Linux. Most applications running as a service will also log to the journal.

Docker Container Performance Metrics to Monitor

In Part 1 we’ve described what container monitoring is and why you need it. Because each container typically runs a single process, has its own environment, utilizes virtual networks, or has

A Step-by-Step Guide to Java Garbage Collection Tuning

Working with Java applications has a lot of benefits. Especially when compared to languages like C/C++. In the majority of cases, you get interoperability between operating systems and various environments.

Complete Elasticsearch Guide for Beginners

Search and Analytics are key features of modern software applications. Scalability and the capability to handle large volumes of data in near real-time is a must for many applications such

Complete Log Management Guide for DevOps

Definition: What Is Log Management Log management is the process of handling log events generated by all software applications and infrastructure on which they run. It involves log collection, aggregation,

What Is Log Management in DevOps

Definition: What Is Log Management Log management is the process of handling log events generated by all software applications and infrastructure on which they run. It involves log collection, aggregation,

Entity Extraction for Product Searches

What is Entity Extraction? Entity extraction is, in the context of search, the process of figuring out which fields a query should target, as opposed to always hitting all fields.

Entity Extraction with spaCy

What is Entity Extraction? Entity extraction is, in the context of search, the process of figuring out which fields a query should target, as opposed to always hitting all fields.

4 Best Practices for Choosing your DevOps Tools

By Chris Tozzi If you google “DevOps tools,” you’ll see a dizzying litany of software applications, all promising to simplify your life as a DevOps engineer. This can be an

Entity Extraction with Scikit-learn Classifiers

What is entity extraction? Entity extraction is the process of figuring out which fields a query should target, as opposed to always hitting all fields. For example: how to tell,

Generating Word Embeddings with Gensim’s word2vec

During our Activate presentation, we talked about how to do query expansion by dynamically generating synonyms. Instead of statically defining synonyms lists, we showed a demo of how you could

The New Version of Logagent Enriches Container Logs with Metadata and GeoIP

Logagent is a modern, open-source, light-weight data shipper with out of the box and extensible log parsing, on-disk buffering, secure transport and bulk indexing to Elasticsearch and Sematext Cloud. Its

Named Entity Extraction with OpenNLP

We recently had a presentation at Activate 2018 about entity extraction in the context of a product search. For example: how to tell, when the user typed in Activate 2018,

Now Open Source: Sematext Monitoring Agent

You can now create monitoring integrations for your technology stack easier than ever before. We’ve recently opened-sourced Sematext’s Monitoring Agent, reworked it, and made it fully pluggable, making it possible for

Garbage Collection Settings for Elasticsearch Master Nodes

Elasticsearch comes with good out-of-the-box Garbage Collection settings. So good in fact that the Definitive Guide recommends not changing them. While we agree that most use-cases wouldn’t benefit from GC

Java 9 Elasticsearch Benchmark

TL;DR: The main question here is: How Does Java 9 Work with Elasticsearch 6? It works well, but don't expect miracles. Unless you're using G1, then there are some miracles. With

Metrics & Logs – Together or Separate?

Every modern organization depends on software, and where there is software there are metrics and logs and, one would hope, monitoring of these two critically important types of data. Operations

[VIDEO] Elasticsearch for logs and metrics

Since we work with Elasticsearch and Solr on a daily basis and know so much about them we like helping others improve their knowledge of these technologies, not only through

Elasticsearch for logs and metrics: A deep dive

We are known worldwide for our Elasticsearch, ELK stack and Solr consulting services, and we are always happy to help others improve their skills in these technologies, not only through

Reindexing Data with Elasticsearch

Last updated on Jan 8, 2018 SIDE NOTE: We run Elasticsearch and ELK trainings, which may be of interest to you and your teammates. Sooner or later, you'll run into

Recipe: Apache Logs + rsyslog (parsing) + Elasticsearch

More than two years ago we posted a recipe on how to centralize syslog in Elasticsearch in order to search and analyze them with Kibana, all by using only rsyslog.