Common Schema

Tags are sent by the Sematext Agent as part of every data point or log event and they are shown in UI as filters.

The following tags should not be used as custom tags or in in App agent YAMLs.

Common Tags¶

The tags below are applicable to all metrics/logs types:

Tag Name	Description	Synonymous Tags
os.host	Hostname of the host where the agent is running	host, hostname, host.id, host.name, server.name
token	Sematext App Token
measurement	Reserved as per Influx Line Protocol
tag.alias.type	Denotes the Tag Alias type

Logs Tags¶

Refer to this list of default fields for Logs Apps. The tags below are applicable to all logs types:

Tag Name	Description	Synonymous Tags
os.host	A single-valued field and should contain the ID, typically a hostname, of the device or server sending logs.
source	A single-valued field and should contain the ID or descriptor of where the data is coming from. For example, this could be a file name or even a full path to a filename, or the name of the application or process
message	A string field that can contain any sort of text (usually the original log line or some other free text).
@timestamp	A date field, on which log retention is based. If it's not present, it will be added automatically when the log event is received by Sematext. See Supported Date Formats.
facility	A single-valued field used by syslog to indicate the facility level. Sematext stores the keyword values of these levels (such as user or auth).
severity	A single-valued field and should contain the log level, such as error or info.
syslog-tag	A single-valued field used by syslog to indicate the name and the PID of the application generating the event (for example, httpd[215]:).
tags	A multi-valued array field that can contain zero or more tags (for example, `"tags": ["solr", "search"]`). When searching for logs with these tags you can filter by using `tags:solr AND tags:search`. Tags can also contain multiple tokens separated by space, but they are still treated as a single value (for example. `"tags": ["solr search"]`).
geo.ip	IP address defining the location associated with the log line
geo.location	A latitude and longitude defining the location associated with the log line
geo.city_name	The city associated with the log line
geo.region	The region associated with the log line
geo.region_iso_code	The region ISO code assiciated with the region in the log line
geo.country_name	The country associated with the log line
geo.country_iso_code	The ISO code associated with the country in the log line
geo.continent_name	The continent associated with the log line
request.id	Synthetics request ID unique to each monitor run. Learn more
error.id	A reserved field for errors
error.message	A reserved field for errors
error.type	A reserved field for errors
span.id	Building block of a trace in distributed tracing
trace.id	Building block of a trace in distributed tracing

All of these fields are optional, but their use is strongly encouraged. If found in logs with low-enough cardinality, all distinct values of these fields will be loaded and shown in the UI as filters and thus allowing one to very quickly narrow down the search.

Operating System Tags¶

Below are the OS related tags sent as part of OS metrics/logs. They are collected from the host the Sematext Agent is running on. They're mapped to the os.host tag.

Tag Name	Description
os.disk	Human readable name of the block device
os.disk.mountpoint	Mount point for the disk in the file system
os.disk.fs.type	Type of the file system associated with the device
os.network	Name of the network interface
os.uuid	Unique ID based on SMBIOS specification
os.distro.name	Distribution name of the OS. e.g. `ubuntu`
os.distro.version	Version of the OS. e.g. `16.04`
os.kernel	Version of the Kernel. e.g. `4.4.0-130-generic`
os.name	Name of the OS. e.g. `linux`
os.arch	Name of the architecture e.g. `amd64`
jvm.version	Version of JVM, if available in `PATH`
virtualization	Virtualization Type. Possible values are `BareMetal`, `VM`, `Container`
-	All user-defined container labels. These tag are mapped to `os.host` and `container.id`

Cloud Tags¶

The cloud metadata from AWS, Azure and GCE instances are collected as tags. They're mapped to the os.host tag.

Tag Name	Description	Supported Cloud Providers
cloud.type	Provider Type	AWS, GCE, Azure
cloud.instance.id	Instance Identifier	AWS, GCE, Azure
cloud.instance.name	Instance Name	Azure, GCE
cloud.instance.type	Instance Type	AWS, GCE, Azure
cloud.region	Region	AWS, Azure
cloud.zone	Availability Zone	AWS, GCE
cloud.project	Project Identifier	GCE
-	User-defined tags	AWS, GCE, Azure

To collect user-defined cloud tags from AWS, Azure or GCE environment you need to define the IAM roles listed below:

AWS - EC2 Instances should be created with AWS IAM Role that has policy AmazonEC2ReadOnlyAccess. See AWS/EC2 User Guide for more info.
Azure - To fetch resource tags for Virtual Machines, you need to grant Reader role to its Resource Group in Azure Resource Manager. See Access Azure Resource Manager API for more info.
GCE - In GCE user-defined tags are called labels. To read labels, the instance needs roles/compute.viewer IAM role. See Granting Roles to Service Accounts for more info.

Cloud tags collection is enabled by default. To disable Cloud tags collection set cloud.metadata-enabled to false in /opt/spm/properties/st-agent.yml and restart Sematext Agent using sudo service sematext-agent restart.

Process Tags¶

Below are Process related tags sent as part of process info:

Tag Name	Description	Synonymous Tags
process.name	Process/program name
process.pid	Process identifier
process.ppid	Parent process identifier
process.type	Process type. An example of a process type can be `master` or `child` for Node.js processes.

Network Traffic Stats Tags¶

Below are the tags sent as part of Network Traffic Status info:

Tag Name	Description	Synonymous Tags
network.address	Local address of the network connection
network.destination.address	Remote host address of the network connection
network.destination.port	Remote port
network.protocol	Protocol name (TCP or UDP)
network.outgoing	Determines whether the connection is incoming (local process serves the connection) or outgoing (local process connects to remote server)

Container Tags¶

Below are container related tags sent as part of metrics/logs in the container environment:

Tag Name	Description	Synonymous Tags
container.type	Type of container engine (e.g. docker, rkt, crio)
container.name	Container name
container.id	Container identifier
container.image.name	Container image name
container.image.tag	Container image tag
container.hostname	Hostname of the container being monitored
container.host.hostname	Hostname of the host where the container is running

Kubernetes Tags¶

Below are Kubernetes related tags sent as part of metrics/logs in the Kubernetes environment.

Tag Name	Description	Synonymous Tags
kubernetes.pod.name	Name of the kubernetes pod	pod
kubernetes.pod.ip	IP of the kubernetes pod
kubernetes.pod.uid	Unique identifier of the kubernetes pod
kubernetes.node.name	Node name where the pod is running
kubernetes.cluster.name	Kubernetes cluster name
kubernetes.deployment.name	Kubernetes deployment name
kubernetes.namespace	Kubernetes namespace
kubernetes.pvc.name	Kubernetes Persistent Volume Claim name
kubernetes.pod.controlledby	Controller of the pod (deployment or DaemonSet or StatefulSet)

Serverless Tags¶

Below are Serverless related tags sent as part of metrics/logs in the Serverless environment:

Tag Name	Description	Synonymous Tags
function.name	Name of the Lambda function
function.version	Version of the Lambda function
function.request.id	Identifier of the invocation request

Synthetic Monitoring Tags¶

Below are tags sent as part of metrics in Sematext Synthetic Monitoring:

Tag Name	Description	Synonymous Tags
synthetics.monitor.id	Identifier of the Monitor
synthetics.run.id	Identifier of the Monitor run
synthetics.location.id	Identifier of the Monitor location
synthetics.content.type	Type of the resource content
synthetics.domain	The resource URL of the domain
synthetics.response.code	The resource HTTP response status code

Other Special Tags¶

Below are tags that are reserved for future use:

Tag Name	Description	Synonymous Tags
os.host.ip	IP Address of the host/server	server.ip, server.address, host.ip, host.address, source.ip, source.address
service.name	Name of the service where the data is collected from
service.id	Unique service identifier
service.type	Service type e.g. `hadoop`
span.id	Building block of a trace in distributed tracing
trace.id	Building block of a trace in distributed tracing

Events Tags¶

When you create an event you can send a JSON document which consists of multiple fields. Each field can contain event main information or metadata. Even though there is no strict format of such a JSON document we recommend to some of fields. An event can contain the following set of fields, most of which are optional:

Field Name	Field Type	Required	Notes
`timestamp`	date	no	Time when event happened (if not specified,current time will be assumed). The format is dateOptionalTime e.g.: `2014-02-17T21:37:04+0100` or`2014-02-17T14:15:01.534471+02:00`.
`os.host`	string	no	Name of the host where the event has occurred.
`type`	string	yes	Event type which could be e.g. `alert`, `deployment`, etc. Events are later grouped in timeline based on event type which significantly improves visibility.
`message`	string	yes	Short description of event, e.g. `Elasticsearch node03 on host somehost06 restarted`. This is a default search field in Sematext UI, so it is good to keep it concise, but search-friendly. Data in this field can be stored in Markdown format to make your messages more pretty and easier to read. For more details see.
`title`	string	no	Event title, can be used as a short label for event, e.g. `Elasticsearch restart`.
`tags`	string array	no	Multivalued field. Each tag should be specified as aseparate array element e.g., `"tags":[ "elasticsearch", "restart", "emergency fix"]`
`severity`	string	no	A single-valued field which says what kind of an event it is. It should have such values as `error`, `info` or `warning` and lets you easily navigate through important and less important events.
`creator`	string	no	Person, application, or component that created an event. E.g. `John Smith`, `Elasticsearch`, `Some Batch Job`
`data`	string	no	Additional event data. It can be anything you may find useful to have along inside of event object. E.g., it could be stacktrace in case of `app_error` event, base64 encoded content of file, etc.

Container Event Tags¶

Container events track the activities and changes happening within a Container environment.

Field Name	Field Type	Required	Notes
`title`	string	no	Short title for this event
`message`	string	no	Human-readable description of the event
`container.name`	string	no	Name of the container associated with the event
`container.id`	string	no	ID of the container associated with the event
`container.image`	string	no	The image used to create the container
`container.object`	string	no	The object type associated with the event
`container.from`	string	no	The parent image from which the container was created
`container.status`	string	no	The status of the container associated with the event

Kubernetes Event Tags¶

Kubernetes events show what's happening inside a cluster, node, pod, or container.

Field Name	Field Type	Required	Notes
`title`	string	no	Short title for this event
`message`	string	no	Human-readable description of the status of this operation
`kubernetes.namespace`	string	no	Namespace of the resource that originated the event
`kubernetes.name`	string	no	Name of the resource associated with this event
`kubernetes.cluster.name`	string	no	The cluster name of the resource that originated the event
`kubernetes.reason`	string	no	Reason for the transition into the object's current status
`kubernetes.kind`	string	no	Identifier of Kubernetes resource
`kubernetes.node`	string	no	Name of Kubelet node
`tags`	[]string	no	List of custom tags for Kubernetes event

When you ship events to Sematext Cloud, we recommend using the common fields listed in this page. This way you can easily correlate between your events, metrics and logs. For example, when you see CPU usage spikes or start seeing more error logs and you want to investigate it further. You can use Split Screen and load the Events in the right half of the screen and see if there was any deployment type of event that might be the cause of that spike. Or search for events that are shipped from a specific host that started using more resources than expected. This way you can pinpoint the source of the problems easily.

Types¶

Sematext Cloud helps you manage your events better and distinguish between different kinds of events when you provide an event type. There are no limitations to the number of possible values of type field. To get the most value out of typed events we strongly suggest using a smaller number of distinct event types (1-10) to keep things manageable. Keeping the list of unique types concise help with faster navigation. Examples of recommended types: alert, server-info, deployment, infra, outage.

Note: when using curl to call the Events API, you may experience "SSL certificate problem" errors. The reason is that curl doesn't bundle any CA certs any more. For more info see this. Regardless of curl errors, HTTPS communication should be successful.

Example of well defined event¶

{
  "timestamp": "2024-05-30T09:58:43.455Z",
  "creator": "Jenkins",
  "os.host": "jenkins-host",
  "title": "Starting deployment",
  "message": "Started deployment of Test v1.23 to production",
  "severity": "info",
  "type": "deployment",
  "tags": ["version:1.23", "env:prod"],
}

Markdown in events¶

Event message supports markdown.

An example below. Notice message field which contains a text formatted with Markdown.

{
  "title": "Hello Sematext",
  "message": "### Hello Sematext\nClick [link](https://sematext.com/ \"Sematext\") \n",
  "tags": ["msg", "env:dev"],
  "severity": "info"
}

Markdown