First, to discuss the tradeoffs for running Solr on Docker. For example, you get dynamic allocation of operating system caches, but you also get some CPU overhead. We’ll keep in mind that Solr nodes tend to be different than your average container: Solr is usually long running, takes quite some RSS and a lot of virtual memory. This will imply, for example, that it makes more sense to use Docker on big physical boxes than on configurable-size VMs (like Amazon EC2).
Second, to discuss issues with deploying Solr on Docker and how to work around them. For example, many older (and some of the newer) combinations of Docker, Linux Kernel and JVM have memory leaks. The below presentation goes over Docker operations best practices, such as using container limits to cap memory usage and prevent the host OOM killer from terminating a memory-consuming process – usually a Solr node. Or running Docker in Swarm mode over multiple smaller boxes to limit the spread of a single issue.

Interested in listening to the 40-minute talk? Check it below.

Don’t have time to watch the video? You can check Solr on Docker – the Good, the Bad and the Ugly slides instead.

[slideshare id=79941531&doc=sematext-lucenerevolution2017-solrondocker-thegoodthebadandtheugly-170919151833]

What’s Next

You may download the Cheat Sheet you need. Here they are:

Running Solr, Data Manipulation, Searching, Faceting, Streaming Aggregations and more…
Solr Cheat Sheet

Execute commands in containers, Docker networks, Data cleanup and more…

Docker Commands Cheat Sheet

Want to learn more about Solr? Subscribe to our blog or follow @sematext. If you need any help with Solr / SolrCloud – don’t forget that we provide Solr Consulting, Solr Production Support, and offer Solr Training.

Need a Solr monitoring solution? Try Sematext Cloud! Monitor all key Solr metrics from Request Rate & Latency to Warmup time, and more.

Solr on Docker – the Good, the Bad and the Ugly – Video & Slides