As with the other recipes, I’ll show you how to install and configure the needed components. The end result would be that local syslog (and tailed files, if you want to tail them) will end up in Elasticsearch, or a logging SaaS like Logsene (which exposes the Elasticsearch API for both indexing and searching). Of course, you can choose to change your rsyslog configuration to parse logs as well (as we’ve shown before), and change Logstash to do other things (like adding GeoIP info).

Getting the ingredients for the logstash + kafka + rsyslog integration

rsyslog Kafka Output

First of all, you’ll probably need to update rsyslog. Most distros come with ancient versions and don’t have the plugins you need.

From the official packages you can install:

rsyslog. This will update the base package, including the file-tailing module
rsyslog-kafka. This will get you the Kafka output module

Setting up Kafka

If you don’t have Kafka already, you can set it up by downloading the binary tar. And then you can follow the quickstart guide. Basically you’ll have to start Zookeeper first (assuming you don’t have one already that you’d want to re-use):

bin/zookeeper-server-start.sh config/zookeeper.properties

And then start Kafka itself and create a simple 1-partition topic that we’ll use for pushing logs from rsyslog to Logstash. Let’s call it rsyslog_logstash:

bin/kafka-server-start.sh config/server.properties
bin/kafka-topics.sh --create --zookeeper localhost:2181 --replication-factor 1 --partitions 1 --topic rsyslog_logstash

Finally, you’ll have Logstash.

After downloading and unpacking, you can start it via:

bin/logstash -f logstash.conf

Though you also have packages, in which case you’d put the configuration file in /etc/logstash/conf.d/ and start it with the init script.

rsyslog to Logstash via Kafka

rsyslog inputs, templates and queues

With rsyslog, you’d need to load the needed modules first:

module(load="imuxsock")  # will listen to your local syslog
module(load="imfile")    # if you want to tail files
module(load="omkafka")   # lets you send to Kafka

If you want to tail files, you’d have to add definitions for each group of files like this:

input(type="imfile"
  File="/opt/logs/example*.log"
  Tag="examplelogs"
)

Then you’d need a template that will build JSON documents out of your logs. You would publish these JSON’s to Kafka and consume them with Logstash. Here’s one that works well for plain syslog and tailed files that aren’t parsed via mmnormalize:

template(name="json_lines" type="list" option.json="on") {
  constant(value="{")
  constant(value=""timestamp":"")
  property(name="timereported" dateFormat="rfc3339")
  constant(value="","message":"")
  property(name="msg")
  constant(value="","host":"")
  property(name="hostname")
  constant(value="","severity":"")
  property(name="syslogseverity-text")
  constant(value="","facility":"")
  property(name="syslogfacility-text")
  constant(value="","syslog-tag":"")
  property(name="syslogtag")
  constant(value=""}")
}

By default, rsyslog has a memory queue of 10K messages and has a single thread that works with batches of up to 16 messages (you can find all queue parameters here).

You may want to change:

the batch size, which also controls the maximum number of messages to be sent to Kafka at once
the number of threads, which would parallelize sending to Kafka as well
the size of the queue and its nature: in-memory(default), disk or disk-assisted

In a rsyslog->Kafka->Logstash setup I assume you want to keep rsyslog light, so these numbers would be small, like:

main_queue(
  queue.workerthreads="1"      # threads to work on the queue
  queue.dequeueBatchSize="100" # max number of messages to process at once
  queue.size="10000"           # max queue size
)

rsyslog Kafka Output

Finally, to publish to Kafka you’d mainly specify the brokers to connect to (in this example we have one listening to localhost:9092) and the name of the topic we just created:

action(
  broker=["localhost:9092"]
  type="omkafka"
  topic="rsyslog_logstash"
  template="json"
)

Assuming Kafka is started, rsyslog will keep pushing to it.

Logstash Kafka Input

This is the part where we pick the JSON logs (as defined in the earlier template) and forward them to the preferred destinations.

First, we have the input, which will use the Kafka topic we created. To connect, we’ll point Logstash to at least one Kafka broker, and it will fetch info about other Kafka brokers from there:

input {
  kafka {
    bootstrap_servers => ["localhost:9092"]
    topics => ["rsyslog_logstash"]
  }
}

If you need Logstash to listen to multiple topics, you can add all of them in the topics array. A regular expression (topics_pattern) is also possible, if topics are dynamic and tend to follow a pattern.

Logstash Elasticsearch Output

At this point, you may want to use various filters to change your logs before pushing to Logsene or Elasticsearch. For this last step, you’d use the Elasticsearch output:

output {
  elasticsearch {
    hosts => "logsene-receiver.sematext.com:443" # it used to be "host" and "port" pre-2.0
    ssl => "true"
    index => "your Logsene app token goes here"
    manage_template => false
    #protocol => "http" # removed in 2.0
    #port => "443" # removed in 2.0
  }
}

And that’s it! Now you can use Kibana (or, in the case of Logsene, either Kibana or Logsene’s own UI) to search your logs!

Log in

Search site

Recipe: How to integrate rsyslog with Kafka and Logstash

Table of contents

Getting the ingredients for the logstash + kafka + rsyslog integration

rsyslog Kafka Output

Setting up Kafka

rsyslog to Logstash via Kafka

rsyslog inputs, templates and queues

rsyslog Kafka Output

Logstash Kafka Input

Logstash Elasticsearch Output

Best Logging Practices: 14 Do’s and Don’ts for Better Logging

Log Formatting: 8 Best Practices for Better Readability

How to Create Log-Based Metrics to Improve Application Observability

Best Logging Practices: 14 Do’s and Don’ts for Better Logging

Log Formatting: 8 Best Practices for Better Readability

How to Create Log-Based Metrics to Improve Application Observability

Search site

Recipe: How to integrate rsyslog with Kafka and Logstash

Table of contents

Getting the ingredients for the logstash + kafka + rsyslog integration

rsyslog Kafka Output

Setting up Kafka

rsyslog to Logstash via Kafka

rsyslog inputs, templates and queues

rsyslog Kafka Output

Logstash Kafka Input

Logstash Elasticsearch Output

Best Logging Practices: 14 Do’s and Don’ts for Better Logging

Log Formatting: 8 Best Practices for Better Readability

How to Create Log-Based Metrics to Improve Application Observability

Related posts:

Related posts:

Best Logging Practices: 14 Do’s and Don’ts for Better Logging

Log Formatting: 8 Best Practices for Better Readability

How to Create Log-Based Metrics to Improve Application Observability