Skip to content
share library_books

Monitoring HBase

Integration

Metrics

Metric Name Key Agg Type Description
total max jvm.memory.size.max Max Double Hadoop:service=HBase,name=JvmMetrics#MemMaxM
info jvm.log.info Sum Long Hadoop:service=HBase,name=JvmMetrics#LogInfo
non heap used jvm.nonheap.used Avg Double Hadoop:service=HBase,name=JvmMetrics#MemNonHeapUsedM
error jvm.log.error Sum Long Hadoop:service=HBase,name=JvmMetrics#LogError
heap committed jvm.heap.committed Avg Double Hadoop:service=HBase,name=JvmMetrics#MemHeapCommittedM
heap max jvm.heap.size.max Max Double Hadoop:service=HBase,name=JvmMetrics#MemHeapMaxM
blocked jvm.threads.blocked Avg Long Hadoop:service=HBase,name=JvmMetrics#ThreadsBlocked
terminated jvm.threads.terminated Avg Long Hadoop:service=HBase,name=JvmMetrics#ThreadsTerminated
warn jvm.log.warn Sum Long Hadoop:service=HBase,name=JvmMetrics#LogWarn
timed waiting jvm.threads.waiting.timed Avg Long Hadoop:service=HBase,name=JvmMetrics#ThreadsTimedWaiting
heap used jvm.heap.used Avg Double Hadoop:service=HBase,name=JvmMetrics#MemHeapUsedM
fatal jvm.log.fatal Sum Long Hadoop:service=HBase,name=JvmMetrics#LogFatal
non heap max jvm.nonheap.size.max Avg Double Hadoop:service=HBase,name=JvmMetrics#MemNonHeapMaxM
waiting jvm.threads.waiting Avg Long Hadoop:service=HBase,name=JvmMetrics#ThreadsWaiting
new jvm.threads.new Avg Long Hadoop:service=HBase,name=JvmMetrics#ThreadsNew
non heap committed jvm.nonheap.committed Avg Double Hadoop:service=HBase,name=JvmMetrics#MemNonHeapCommittedM
runnable jvm.threads.runnable Avg Long Hadoop:service=HBase,name=JvmMetrics#ThreadsRunnable
ops hbase.ugi.groups.gets Sum Long Hadoop:service=HBase,name=UgiMetrics#GetGroupsNumOps
failure time hbase.ugi.login.failure.time Sum Double Hadoop:service=HBase,name=UgiMetrics#LoginFailureAvgTime * LoginFailureNumOps
success time hbase.ugi.login.success.time Sum Double Hadoop:service=HBase,name=UgiMetrics#LoginSuccessAvgTime * LoginSuccessNumOps
success ops hbase.ugi.login.success Sum Long Hadoop:service=HBase,name=UgiMetrics#LoginSuccessNumOps
failure ops hbase.ugi.login.failure Sum Long Hadoop:service=HBase,name=UgiMetrics#LoginFailureNumOps
time hbase.ugi.groups.gets.time Sum Double Hadoop:service=HBase,name=UgiMetrics#v * GetGroupsNumOps
syncs hbase.rs.wal.syncs Sum Long Count of syncs the HLog to HDFS.
Hadoop:service=HBase,name=RegionServer,sub=WAL#SyncTime_num_ops
append time hbase.rs.wal.appends.time Sum Double Time an append to the log took.
Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendTime_num_ops * AppendTime_mean
append max time hbase.rs.wal.appends.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendTime_max
sync time hbase.rs.wal.syncs.time Sum Double The time it took to sync the HLog to HDFS.
Hadoop:service=HBase,name=RegionServer,sub=WAL#SyncTime_num_ops * SyncTime_mean
appends hbase.rs.wal.appends Sum Long Count of appends to the log.
Hadoop:service=HBase,name=RegionServer,sub=WAL#appendCount
append size hbase.rs.wal.appends.size Sum Double Size (in bytes) of the data appended to the HLog.
Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendSize_num_ops * AppendSize_mean
append max size hbase.rs.wal.appends.size.max Max Long Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendSize_max
append min time hbase.rs.wal.appends.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendTime_min
appends hbase.rs.wal.appends Sum Long Count of appends to the log.
Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendSize_num_ops
sync max time hbase.rs.wal.syncs.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=WAL#SyncTime_max
appends hbase.rs.wal.appends Sum Long Count of appends to the log.
Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendSize_num_ops
append min size hbase.rs.wal.appends.size.min Min Long Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendSize_min
sync min time hbase.rs.wal.syncs.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=WAL#SyncTime_min
slow appends hbase.rs.wal.appends.slow Sum Long Number of appends that were slow.
Hadoop:service=HBase,name=RegionServer,sub=WAL#slowAppendCount
compaction queue hbase.rs.compaction.queue Avg Long Current depth of the compaction request queue. If increasing, we are falling behind with storefile compaction.
Hadoop:service=HBase,name=RegionServer,sub=Server#compactionQueueLength
compacted cells size hbase.rs.compaction.cells.size Sum Long The total amount of data processed during minor compactions, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#compactedCellsSize
updates blocked time hbase.rs.updates.blocked.time Sum Long Number of MS updates have been blocked so that the memstore can be flushed.
Hadoop:service=HBase,name=RegionServer,sub=Server#updatesBlockedTime
major compacted cells hbase.rs.compaction.major.cells Sum Long The number of cells processed during major compactions.
Hadoop:service=HBase,name=RegionServer,sub=Server#majorCompactedCellsCount
flush queue hbase.rs.compaction.queue.size Avg Long Length of the queue for region flushes.If increasing, we are falling behind with clearing memstores out to HDFS.
Hadoop:service=HBase,name=RegionServer,sub=Server#flushQueueLength
compacted cells hbase.rs.compaction.cells Sum Long The number of cells processed during minor compactions.
Hadoop:service=HBase,name=RegionServer,sub=Server#compactedCellsCount
flushed cells hbase.rs.flushes.cells Sum Long The number of cells flushed to disk.
Hadoop:service=HBase,name=RegionServer,sub=Server#flushedCellsCount
flushed cells size hbase.rs.flushes.cells.size Sum Long The total amount of data flushed to disk, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#flushedCellsSize
major compacted cells size hbase.rs.compaction.major.cells.size Sum Long The total amount of data processed during major compactions, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#majorCompactedCellsSize
assign min time hbase.master.assigns.time.min Min Long Hadoop:service=HBase,name=Master,sub=AssignmentManger#Assign_min
rit count over threshold hbase.master.rit.count.overthreshold Avg Long The number of regions that have been in transition longer than a threshold time (default: 60 seconds).
Hadoop:service=HBase,name=Master,sub=AssignmentManger#ritCountOverThreshold
rit count hbase.master.rit.count Avg Long The number of regions in transition.
Hadoop:service=HBase,name=Master,sub=AssignmentManger#ritCount
assigns hbase.master.assigns Sum Long Hadoop:service=HBase,name=Master,sub=AssignmentManger#Assign_num_ops
assign time hbase.master.assigns.time Sum Double Hadoop:service=HBase,name=Master,sub=AssignmentManger#Assign_mean * Assign_num_ops
bulk assign time hbase.master.assigns.bulk.time Sum Double Hadoop:service=HBase,name=Master,sub=AssignmentManger#BulkAssign_mean * BulkAssign_num_ops
rit oldest age hbase.master.rit.oldest Max Long The age of the longest region in transition.
Hadoop:service=HBase,name=Master,sub=AssignmentManger#ritOldestAge
bulk assign max time hbase.master.assigns.bulk.time.max Max Long Hadoop:service=HBase,name=Master,sub=AssignmentManger#BulkAssign_max
assign max time hbase.master.assigns.time.max Max Long Hadoop:service=HBase,name=Master,sub=AssignmentManger#Assign_max
bulk assign min time hbase.master.assigns.bulk.time.min Min Long Hadoop:service=HBase,name=Master,sub=AssignmentManger#BulkAssign_min
bulk assigns hbase.master.assigns.bulk Sum Long Hadoop:service=HBase,name=Master,sub=AssignmentManger#BulkAssign_num_ops
increment time hbase.rs.ops.increments.time Sum Double Hadoop:service=HBase,name=RegionServer,sub=Server#Increment_num_ops * Increment_mean
slow_puts hbase.rs.ops.puts.slow Sum Long The number of Puts that took over 1000ms to complete.
Hadoop:service=HBase,name=RegionServer,sub=Server#slowPutCount
get max time hbase.rs.ops.gets.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=Server#Get_max
append time hbase.rs.ops.appends.time Sum Double Hadoop:service=HBase,name=RegionServer,sub=Server#Append_num_ops * Append_mean
deletes hbase.rs.ops.deletes Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#Delete_num_ops
replay min time hbase.rs.ops.replays.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=Server#Replay_min
mutate min time hbase.rs.ops.mutates.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=Server#Mutate_min
get time hbase.rs.ops.gets.time Sum Double Hadoop:service=HBase,name=RegionServer,sub=Server#Get_num_ops * Get_mean
increment max time hbase.rs.ops.increments.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=Server#Increment_max
slow appends hbase.rs.ops.appends.slow Sum Long The number of Appends that took over 1000ms to complete.
Hadoop:service=HBase,name=RegionServer,sub=Server#slowAppendCount
mutate max time hbase.rs.ops.mutates.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=Server#Mutate_max
get min time hbase.rs.ops.gets.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=Server#Get_min
increments hbase.rs.ops.increments Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#Increment_num_ops
replay max time hbase.rs.ops.replays.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=Server#Replay_max
delete time hbase.rs.ops.deletes.time Sum Double Hadoop:service=HBase,name=RegionServer,sub=Server#Delete_num_ops * Delete_mean
slow deletes hbase.rs.ops.deletes.slow Sum Long The number of Deletes that took over 1000ms to complete.
Hadoop:service=HBase,name=RegionServer,sub=Server#slowDeleteCount
delete max time hbase.rs.ops.deletes.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=Server#Delete_max
appends hbase.rs.ops.appends Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#Append_num_ops
append max time hbase.rs.ops.append.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=Server#Append_max
delete min time hbase.rs.ops.deletes.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=Server#Delete_min
mutates hbase.rs.ops.mutates Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#Mutate_num_ops
increment min time hbase.rs.ops.increments.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=Server#Increment_min
append min time hbase.rs.ops.appends.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=Server#Append_min
replay time hbase.rs.ops.replays.time Sum Double Hadoop:service=HBase,name=RegionServer,sub=Server#Replay_num_ops * Replay_mean
slow increments hbase.rs.ops.increments.slow Sum Long The number of Increments that took over 1000ms to complete.
Hadoop:service=HBase,name=RegionServer,sub=Server#slowIncrementCount
replays hbase.rs.ops.replays Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#Replay_num_ops
gets hbase.rs.ops.gets Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#Get_num_ops
slow gets hbase.rs.ops.gets.slow Sum Long The number of Gets that took over 1000ms to complete.
Hadoop:service=HBase,name=RegionServer,sub=Server#slowGetCount
mutate time hbase.rs.ops.mutates.time Sum Double Hadoop:service=HBase,name=RegionServer,sub=Server#Mutate_num_ops * Mutate_mean
restore max time hbase.snapshots.restore.time.max Max Long Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotRestoreTime_max
snapshot time hbase.snapshots.time Sum Double Time it takes to finish snapshot().
Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotTime_num_ops * SnapshotTime_mean
snapshot max time hbase.snapshots.time.max Max Long Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotTime_max
clone time hbase.snapshots.clone.time Sum Double Time it takes to finish cloneSnapshot().
Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotCloneTime_num_ops * SnapshotCloneTime_mean
snapshots hbase.snapshots Sum Long Count of snapshot() invocations.
Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotTime_num_ops
restores hbase.snapshots.restores Sum Long Count of restoreSnapshot() invocations.
Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotRestoreTime_num_ops
clone min time hbase.snapshots.clone.time.min Min Long Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotCloneTime_min
clones hbase.snapshots.clones Sum Long Count of cloneSnapshot() invocations.
Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotCloneTime_num_ops
snapshot min time hbase.snapshots.time.min Min Long Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotTime_min
clone max time hbase.snapshots.clone.time.max Max Long Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotCloneTime_max
restore time hbase.snapshots.restore.time Sum Double Time it takes to finish restoreSnapshot().
Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotRestoreTime_num_ops * SnapshotRestoreTime_mean
restore min time hbase.snapshots.restore.time.min Min Long Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotRestoreTime_min
meta hlog split max size hbase.master.hlog.meta.splits.size.max Max Long Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitSize_max
hlog split size hbase.master.meta.splits.size Sum Double Size of HLog files being split.
Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitSize_num_ops * HlogSplitSize_mean
hlog split min time hbase.master.hlog.splits.time.min Min Long Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitTime_min
hlog split max size hbase.master.hlog.splits.size.max Max Long Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitSize_max
hlog splits hbase.master.hlog.splits Sum Long Count of HLog.splitLog() invocations.
Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitTime_num_ops
meta hlog split time hbase.master.hlog.meta.splits.time Sum Double Time it takes to finish splitMetaLog().
Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitTime_num_ops * MetaHlogSplitTime_mean
meta hlog split max time hbase.master.hlog.meta.splits.time.max Max Long Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitTime_max
hlog split time hbase.master.hlog.splits.time Sum Double Time it takes to finish HLog.splitLog().
Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitTime_num_ops * HlogSplitTime_mean
hlog split min size hbase.master.hlog.splits.size.min Min Long Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitSize_min
meta hlog splits hbase.master.hlog.meta.splits Sum Long Count of splitMetaLog() invocations.
Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitSize_num_ops
meta hlog split min size hbase.master.hlog.meta.splits.size.min Min Long Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitSize_min
meta hlog split min time hbase.master.hlog.meta.splits.time.min Min Long Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitTime_min
meta hlog splits hbase.master.hlog.meta.splits Sum Long Count of splitMetaLog() invocations.
Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitTime_num_ops
hlog split max time hbase.master.hlog.splits.time.max Max Long Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitTime_max
hlog splits hbase.master.hlog.splits Sum Long Count of HLog.splitLog() invocations.
Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitSize_num_ops
meta hlog split size hbase.master.hlog.meta.splits.size Sum Double Size of hbase:meta HLog files being split.
Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitSize_num_ops * MetaHlogSplitSize_mean
lower limit hbase.rs.memstore.size.limit.lower Avg Long Property 'hbase.regionserver.global.memstore.lowerLimit' value.
static bloom size hbase.rs.static.bloom.size Avg Long Uncompressed size of the static bloom filters.
Hadoop:service=HBase,name=RegionServer,sub=Server#staticBloomSize
store files hbase.rs.stores.files Avg Long Number of Store Files.
Hadoop:service=HBase,name=RegionServer,sub=Server#storeFileCount
store file index size hbase.rs.stores.index.size Avg Long Size of indexes in storefiles on disk.
Hadoop:service=HBase,name=RegionServer,sub=Server#storeFileIndexSize
store file size hbase.rs.stores.files.size Avg Long Size of storefiles being served.
Hadoop:service=HBase,name=RegionServer,sub=Server#storeFileSize
mem store size hbase.rs.memstore.size Avg Long Size of the memstore.
Hadoop:service=HBase,name=RegionServer,sub=Server#memStoreSize
static index size hbase.rs.static.index.size Avg Long Uncompressed size of the static indexes.
Hadoop:service=HBase,name=RegionServer,sub=Server#staticIndexSize
upper limit hbase.rs.memstore.size.limit.upper Avg Long Property 'hbase.regionserver.global.memstore.upperLimit' value.
stores hbase.rs.stores Avg Long Number of Stores.
Hadoop:service=HBase,name=RegionServer,sub=Server#storeCount
regions hbase.rs.regions Avg Long Number of regions.
Hadoop:service=HBase,name=RegionServer,sub=Server#regionCount
active handlers hbase.ipc.handlers.active Avg Long Number of active rpc handlers.
Hadoop:service=HBase,name=IPC,sub=IPC#numActiveHandler
process call time hbase.ipc.handlers.process.calls.time Sum Double Hadoop:service=HBase,name=IPC,sub=IPC#ProcessCallTime_mean * ProcessCallTime_num_ops
process call min time hbase.ipc.process.call.time.min Min Long Hadoop:service=HBase,name=IPC,sub=IPC#ProcessCallTime_min
calls in priority queue hbase.ipc.queue.priority.size Avg Long The number of currently enqueued priority (internal housekeeping) requests.
Hadoop:service=HBase,name=IPC,sub=IPC#numCallsInPriorityQueue
authorization successes hbase.ipc.authorization.successes Sum Long Number of authorization successes.
Hadoop:service=HBase,name=IPC,sub=IPC#authorizationSuccesses
received bytes hbase.ipc.bytes.received Sum Long Number of bytes received.
Hadoop:service=HBase,name=IPC,sub=IPC#receivedBytes
sent bytes hbase.ipc.bytes.sent Sum Long Number of bytes sent.
Hadoop:service=HBase,name=IPC,sub=IPC#sentBytes
authorization failures hbase.ipc.authorization.failures Sum Long Number of authorization failures.
Hadoop:service=HBase,name=IPC,sub=IPC#authorizationFailures
process call max time hbase.ipc.process.call.time.max Max Long Hadoop:service=HBase,name=IPC,sub=IPC#ProcessCallTime_max
calls in replication queue hbase.ipc.queue.replication.size Avg Long Number of calls in the replication call queue.
Hadoop:service=HBase,name=IPC,sub=IPC#numCallsInReplicationQueue
process calls hbase.ipc.handlers.process.calls Sum Long Hadoop:service=HBase,name=IPC,sub=IPC#ProcessCallTime_num_ops
calls in general queue hbase.ipc.queue.size Avg Long The number of currently enqueued user requests.
Hadoop:service=HBase,name=IPC,sub=IPC#numCallsInGeneralQueue
queue call time hbase.ipc.handlers.queue.calls.time Sum Double Hadoop:service=HBase,name=IPC,sub=IPC#QueueCallTime_mean * QueueCallTime_num_ops
queue call min time hbase.ipc.queue.call.time.min Min Long Hadoop:service=HBase,name=IPC,sub=IPC#QueueCallTime_min
queue call max time hbase.ipc.queue.call.time.max Max Long Hadoop:service=HBase,name=IPC,sub=IPC#QueueCallTime_max
queue size hbase.ipc.queue.bytes Avg Long Number of bytes in the call queues.
Hadoop:service=HBase,name=IPC,sub=IPC#queueSize
authentication failures hbase.ipc.authentication.failures Sum Long Number of authentication failures.
Hadoop:service=HBase,name=IPC,sub=IPC#authenticationFailures
queue calls hbase.ipc.handlers.queue.calls Sum Long Hadoop:service=HBase,name=IPC,sub=IPC#QueueCallTime_num_ops
open connections hbase.ipc.connections.open Avg Long The number of open connections at the RPC layer.
Hadoop:service=HBase,name=IPC,sub=IPC#numOpenConnections
authentication successes hbase.ipc.authentication.successes Sum Long Number of authentication successes.
Hadoop:service=HBase,name=IPC,sub=IPC#authenticationSuccesses
total requests hbase.rs.requests Sum Long Total number of requests this RegionServer has answered.
Hadoop:service=HBase,name=RegionServer,sub=Server#totalRequestCount
read requests hbase.rs.requests.read Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#readRequestCount
write requests hbase.rs.requests.write Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#writeRequestCount
applied ops hbase.rs.replication.ops.applied Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#sink.appliedOps
applied batches hbase.rs.replication.batches.applied Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#sink.appliedBatches
balancer min time hbase.master.balancer.time.min Min Long Hadoop:service=HBase,name=Master,sub=Balancer#BalancerCluster_min
ops hbase.master.balancer.ops Avg Long Hadoop:service=HBase,name=Master,sub=Balancer#BalancerCluster_num_ops
time hbase.master.balancer.time Sum Double Hadoop:service=HBase,name=Master,sub=Balancer#BalancerCluster_num_ops * BalancerCluster_mean
misc invocations hbase.master.balancer.invocations Sum Long Hadoop:service=HBase,name=Master,sub=Balancer#miscInvocationCount
balancer max time hbase.master.balancer.time.max Max Long Hadoop:service=HBase,name=Master,sub=Balancer#BalancerCluster_max
flushed cells hbase.rs.mob.flushes.cells Sum Long The number of mob cells flushed to disk.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFlushedCellsCount
scan cells size hbase.rs.mob.scan.cells.size Sum Long The total amount of scanned mob cells, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobScanCellsSize
file cache hits hbase.rs.mob.cache.files.hits Avg Long The hit percent to the mob file cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFileCacheHitPercent
file caches hbase.rs.mob.cache.files Avg Long The count of cached mob files.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFileCacheCount
compacted into mob cells hbase.rs.mob.compactions.cells.into Sum Long The number of cells moved to mob during compaction.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobCompactedIntoMobCellsCount
file cache accesses hbase.rs.mob.cache.files.accesses Sum Long The count of accesses to the mob file cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFileCacheAccessCount
scan cells hbase.rs.mob.scan.cells Sum Long The number of scanned mob cells.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobScanCellsCount
file cache misses hbase.rs.mob.cache.files.misses Sum Long The count of misses to the mob file cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFileCacheMissCount
compacted from mob cells hbase.rs.mob.compactions.cells.from Sum Long The number of cells moved from mob during compaction.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobCompactedFromMobCellsCount
compacted into mob cells size hbase.rs.mob.compactions.cells.into.size Sum Long The total amount of cells move to mob during compaction, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobCompactedIntoMobCellsSize
compacted from mob cells size hbase.rs.mob.compactions.cells.from.size Sum Long The total amount of cells move from mob during compaction, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobCompactedFromMobCellsSize
file cache evictions hbase.rs.mob.cache.files.evictions Sum Long The number of items evicted from the mob file cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFileCacheEvictedCount
flushes hbase.rs.mob.flushes Sum Long The number of the flushes in mob-enabled stores.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFlushCount
flushed cells size hbase.rs.mob.flushes.cells.size Sum Long The total amount of mob cells flushed to disk, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFlushedCellsSize
local files hbase.rs.files.local Avg Long The percent of HFiles that are stored on the local hdfs data node.
Hadoop:service=HBase,name=RegionServer,sub=Server#percentFilesLocal
hlog files hbase.rs.files.hlog Avg Long The number of write ahead logs not yet archived.
Hadoop:service=HBase,name=RegionServer,sub=Server#hlogFileCount
hlog files size hbase.rs.files.hlog.size Avg Long Size of all HLog Files.
Hadoop:service=HBase,name=RegionServer,sub=Server#hlogFileSize
cluster requests hbase.master.requests Sum Long Hadoop:service=HBase,name=Master,sub=Server#clusterRequests
dead region servers hbase.master.servers.region.dead Avg Double Hadoop:service=HBase,name=Master,sub=Server#numDeadRegionServers
region servers hbase.master.servers.region Avg Double Hadoop:service=HBase,name=Master,sub=Server#numRegionServers
average load hbase.master.load Avg Double Hadoop:service=HBase,name=Master,sub=Server#averageLoad
mutations without wal hbase.ops.mutates.nowal Sum Long Number of mutations that have been sent by clients with the write ahead logging turned off.
Hadoop:service=HBase,name=RegionServer,sub=Server#mutationsWithoutWALCount
mutate failed ops hbase.ops.mutates.failed Sum Long Number of Check and Mutate calls that failed the checks.
Hadoop:service=HBase,name=RegionServer,sub=Server#checkMutateFailedCount
mutate passed ops hbase.ops.mutates.passed Sum Long Number of Check and Mutate calls that passed the checks.
Hadoop:service=HBase,name=RegionServer,sub=Server#checkMutatePassedCount
mutations without wal size hbase.ops.mutates.nowal.size Sum Long Size of data that has been sent by clients with the write ahead logging turned off.
Hadoop:service=HBase,name=RegionServer,sub=Server#mutationsWithoutWALSize
hits hbase.cache.hits Sum Long Count of the hit on the block cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheHitCount
express hit% hbase.cache.hits.express.rate Avg Long The percent of the time that requests with the cache turned on hit the cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheExpressHitPercent
hit% hbase.cache.hits.rate Avg Long Percent of block cache requests that are hits.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCountHitPercent
cache free size hbase.cache.block.free Avg Long Size of the block cache that is not occupied.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheFreeSize
block cache size hbase.cache.block.size Avg Long Size of the block cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheSize
miss hbase.cache.misses Sum Long Number of requests for a block that missed the block cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheMissCount
cache count hbase.cache.block.count Avg Long Number of block in the block cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheCount
evictions hbase.cache.evictions Sum Long Count of the number of blocks evicted from the block cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheEvictionCount
shipped batches hbase.rs.replication.batches.shipped Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.shippedBatches
shipped ops hbase.rs.replication.ops.shipped Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.shippedOps
log edits read hbase.rs.replication.log.edits.read Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.logEditsRead
log queue hbase.rs.replication.log.queue Avg Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.sizeOfLogQueue
shipped bytes hbase.rs.replication.batches.shipped.size Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.shippedKBs
log edits filtered hbase.rs.replication.log.edits.filtered Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.logEditsFiltered
log read bytes hbase.rs.replication.log.edits.read.bytes Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.logReadInBytes
hedged read wins hbase.reads.hedged.wins Sum Long The number of times we started a hedged read and a hedged read won.
Hadoop:service=HBase,name=RegionServer,sub=Server#hedgedReadWins
hedged reads hbase.reads.hedged Sum Long The number of times we started a hedged read.
Hadoop:service=HBase,name=RegionServer,sub=Server#hedgedReads
stores hbase.stores Avg Long
flushes hbase.flushes Sum Long
compactions hbase.compactions Sum Long
store files hbase.store.files Avg Long
store file index size hbase.store.index.size Avg Long
sync hbase.fs.sync.latency.max Max Long
read hbase.fs.read.latency.max Max Long
write hbase.fs.write.latency.max Max Long
sync hbase.fs.sync.latency.min Min Long
read hbase.fs.read.latency.min Min Long
write hbase.fs.write.latency.min Min Long
flush queue hbase.flushes.queue.size Avg Double
max hbase.memstore.flushes.time.max Max Long
min hbase.flushes.size.min Min Long
flushes hbase.memstore.flushes Sum Long
max hbase.flushes.size.max Max Long
memstore size hbase.memstore.size Avg Long
min hbase.memstore.flushes.time.min Min Long
min hbase.compactions.time.min Min Long
max hbase.compactions.time.max Max Long
max hbase.compactions.size.max Max Long
compactions hbase.compactions Sum Long
compaction queue hbase.compactions.queue.size Avg Double
min hbase.compactions.size.min Min Long
cache size hbase.cache.block.size Avg Long
miss count hbase.cache.block.misses Sum Long
cache free hbase.cache.block.free Avg Long
hit count hbase.cache.block.hits Sum Long
block cache count hbase.cache.block.count Avg Long
blockCacheHitRatio hbase.cache.block.hits.ratio Avg Long
evicted count hbase.cache.block.evictions Sum Long
blockCacheHitCachingRatio hbase.cache.block.caching.hits.ratio Avg Long
max hbase.splits.time.max Max Long
splits hbase.splits Sum Long
max hbase.splits.size.max Max Long
min hbase.splits.size.min Min Long
splits hbase.splits Sum Long
regions hbase.regions Avg Long

0.98

Metric Name Key Agg Type Description
total max jvm.memory.size.max Max Double Hadoop:service=HBase,name=JvmMetrics#MemMaxM
info jvm.log.info Sum Long Hadoop:service=HBase,name=JvmMetrics#LogInfo
non heap used jvm.nonheap.used Avg Double Hadoop:service=HBase,name=JvmMetrics#MemNonHeapUsedM
error jvm.log.error Sum Long Hadoop:service=HBase,name=JvmMetrics#LogError
heap committed jvm.heap.committed Avg Double Hadoop:service=HBase,name=JvmMetrics#MemHeapCommittedM
heap max jvm.heap.size.max Max Double Hadoop:service=HBase,name=JvmMetrics#MemHeapMaxM
blocked jvm.threads.blocked Avg Long Hadoop:service=HBase,name=JvmMetrics#ThreadsBlocked
terminated jvm.threads.terminated Avg Long Hadoop:service=HBase,name=JvmMetrics#ThreadsTerminated
warn jvm.log.warn Sum Long Hadoop:service=HBase,name=JvmMetrics#LogWarn
timed waiting jvm.threads.waiting.timed Avg Long Hadoop:service=HBase,name=JvmMetrics#ThreadsTimedWaiting
heap used jvm.heap.used Avg Double Hadoop:service=HBase,name=JvmMetrics#MemHeapUsedM
fatal jvm.log.fatal Sum Long Hadoop:service=HBase,name=JvmMetrics#LogFatal
non heap max jvm.nonheap.size.max Avg Double Hadoop:service=HBase,name=JvmMetrics#MemNonHeapMaxM
waiting jvm.threads.waiting Avg Long Hadoop:service=HBase,name=JvmMetrics#ThreadsWaiting
new jvm.threads.new Avg Long Hadoop:service=HBase,name=JvmMetrics#ThreadsNew
non heap committed jvm.nonheap.committed Avg Double Hadoop:service=HBase,name=JvmMetrics#MemNonHeapCommittedM
runnable jvm.threads.runnable Avg Long Hadoop:service=HBase,name=JvmMetrics#ThreadsRunnable
ops hbase.ugi.groups.gets Sum Long Hadoop:service=HBase,name=UgiMetrics#GetGroupsNumOps
failure time hbase.ugi.login.failure.time Sum Double Hadoop:service=HBase,name=UgiMetrics#LoginFailureAvgTime * LoginFailureNumOps
success time hbase.ugi.login.success.time Sum Double Hadoop:service=HBase,name=UgiMetrics#LoginSuccessAvgTime * LoginSuccessNumOps
success ops hbase.ugi.login.success Sum Long Hadoop:service=HBase,name=UgiMetrics#LoginSuccessNumOps
failure ops hbase.ugi.login.failure Sum Long Hadoop:service=HBase,name=UgiMetrics#LoginFailureNumOps
time hbase.ugi.groups.gets.time Sum Double Hadoop:service=HBase,name=UgiMetrics#v * GetGroupsNumOps
syncs hbase.rs.wal.syncs Sum Long Count of syncs the HLog to HDFS.
Hadoop:service=HBase,name=RegionServer,sub=WAL#SyncTime_num_ops
append time hbase.rs.wal.appends.time Sum Double Time an append to the log took.
Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendTime_num_ops * AppendTime_mean
append max time hbase.rs.wal.appends.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendTime_max
sync time hbase.rs.wal.syncs.time Sum Double The time it took to sync the HLog to HDFS.
Hadoop:service=HBase,name=RegionServer,sub=WAL#SyncTime_num_ops * SyncTime_mean
appends hbase.rs.wal.appends Sum Long Count of appends to the log.
Hadoop:service=HBase,name=RegionServer,sub=WAL#appendCount
append size hbase.rs.wal.appends.size Sum Double Size (in bytes) of the data appended to the HLog.
Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendSize_num_ops * AppendSize_mean
append max size hbase.rs.wal.appends.size.max Max Long Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendSize_max
append min time hbase.rs.wal.appends.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendTime_min
appends hbase.rs.wal.appends Sum Long Count of appends to the log.
Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendSize_num_ops
sync max time hbase.rs.wal.syncs.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=WAL#SyncTime_max
appends hbase.rs.wal.appends Sum Long Count of appends to the log.
Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendSize_num_ops
append min size hbase.rs.wal.appends.size.min Min Long Hadoop:service=HBase,name=RegionServer,sub=WAL#AppendSize_min
sync min time hbase.rs.wal.syncs.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=WAL#SyncTime_min
slow appends hbase.rs.wal.appends.slow Sum Long Number of appends that were slow.
Hadoop:service=HBase,name=RegionServer,sub=WAL#slowAppendCount
compaction queue hbase.rs.compaction.queue Avg Long Current depth of the compaction request queue. If increasing, we are falling behind with storefile compaction.
Hadoop:service=HBase,name=RegionServer,sub=Server#compactionQueueLength
compacted cells size hbase.rs.compaction.cells.size Sum Long The total amount of data processed during minor compactions, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#compactedCellsSize
updates blocked time hbase.rs.updates.blocked.time Sum Long Number of MS updates have been blocked so that the memstore can be flushed.
Hadoop:service=HBase,name=RegionServer,sub=Server#updatesBlockedTime
major compacted cells hbase.rs.compaction.major.cells Sum Long The number of cells processed during major compactions.
Hadoop:service=HBase,name=RegionServer,sub=Server#majorCompactedCellsCount
flush queue hbase.rs.compaction.queue.size Avg Long Length of the queue for region flushes.If increasing, we are falling behind with clearing memstores out to HDFS.
Hadoop:service=HBase,name=RegionServer,sub=Server#flushQueueLength
compacted cells hbase.rs.compaction.cells Sum Long The number of cells processed during minor compactions.
Hadoop:service=HBase,name=RegionServer,sub=Server#compactedCellsCount
flushed cells hbase.rs.flushes.cells Sum Long The number of cells flushed to disk.
Hadoop:service=HBase,name=RegionServer,sub=Server#flushedCellsCount
flushed cells size hbase.rs.flushes.cells.size Sum Long The total amount of data flushed to disk, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#flushedCellsSize
major compacted cells size hbase.rs.compaction.major.cells.size Sum Long The total amount of data processed during major compactions, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#majorCompactedCellsSize
assign min time hbase.master.assigns.time.min Min Long Hadoop:service=HBase,name=Master,sub=AssignmentManger#Assign_min
rit count over threshold hbase.master.rit.count.overthreshold Avg Long The number of regions that have been in transition longer than a threshold time (default: 60 seconds).
Hadoop:service=HBase,name=Master,sub=AssignmentManger#ritCountOverThreshold
rit count hbase.master.rit.count Avg Long The number of regions in transition.
Hadoop:service=HBase,name=Master,sub=AssignmentManger#ritCount
assigns hbase.master.assigns Sum Long Hadoop:service=HBase,name=Master,sub=AssignmentManger#Assign_num_ops
assign time hbase.master.assigns.time Sum Double Hadoop:service=HBase,name=Master,sub=AssignmentManger#Assign_mean * Assign_num_ops
bulk assign time hbase.master.assigns.bulk.time Sum Double Hadoop:service=HBase,name=Master,sub=AssignmentManger#BulkAssign_mean * BulkAssign_num_ops
rit oldest age hbase.master.rit.oldest Max Long The age of the longest region in transition.
Hadoop:service=HBase,name=Master,sub=AssignmentManger#ritOldestAge
bulk assign max time hbase.master.assigns.bulk.time.max Max Long Hadoop:service=HBase,name=Master,sub=AssignmentManger#BulkAssign_max
assign max time hbase.master.assigns.time.max Max Long Hadoop:service=HBase,name=Master,sub=AssignmentManger#Assign_max
bulk assign min time hbase.master.assigns.bulk.time.min Min Long Hadoop:service=HBase,name=Master,sub=AssignmentManger#BulkAssign_min
bulk assigns hbase.master.assigns.bulk Sum Long Hadoop:service=HBase,name=Master,sub=AssignmentManger#BulkAssign_num_ops
increment time hbase.rs.ops.increments.time Sum Double Hadoop:service=HBase,name=RegionServer,sub=Server#Increment_num_ops * Increment_mean
slow_puts hbase.rs.ops.puts.slow Sum Long The number of Puts that took over 1000ms to complete.
Hadoop:service=HBase,name=RegionServer,sub=Server#slowPutCount
get max time hbase.rs.ops.gets.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=Server#Get_max
append time hbase.rs.ops.appends.time Sum Double Hadoop:service=HBase,name=RegionServer,sub=Server#Append_num_ops * Append_mean
deletes hbase.rs.ops.deletes Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#Delete_num_ops
replay min time hbase.rs.ops.replays.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=Server#Replay_min
mutate min time hbase.rs.ops.mutates.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=Server#Mutate_min
get time hbase.rs.ops.gets.time Sum Double Hadoop:service=HBase,name=RegionServer,sub=Server#Get_num_ops * Get_mean
increment max time hbase.rs.ops.increments.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=Server#Increment_max
slow appends hbase.rs.ops.appends.slow Sum Long The number of Appends that took over 1000ms to complete.
Hadoop:service=HBase,name=RegionServer,sub=Server#slowAppendCount
mutate max time hbase.rs.ops.mutates.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=Server#Mutate_max
get min time hbase.rs.ops.gets.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=Server#Get_min
increments hbase.rs.ops.increments Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#Increment_num_ops
replay max time hbase.rs.ops.replays.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=Server#Replay_max
delete time hbase.rs.ops.deletes.time Sum Double Hadoop:service=HBase,name=RegionServer,sub=Server#Delete_num_ops * Delete_mean
slow deletes hbase.rs.ops.deletes.slow Sum Long The number of Deletes that took over 1000ms to complete.
Hadoop:service=HBase,name=RegionServer,sub=Server#slowDeleteCount
delete max time hbase.rs.ops.deletes.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=Server#Delete_max
appends hbase.rs.ops.appends Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#Append_num_ops
append max time hbase.rs.ops.append.time.max Max Long Hadoop:service=HBase,name=RegionServer,sub=Server#Append_max
delete min time hbase.rs.ops.deletes.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=Server#Delete_min
mutates hbase.rs.ops.mutates Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#Mutate_num_ops
increment min time hbase.rs.ops.increments.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=Server#Increment_min
append min time hbase.rs.ops.appends.time.min Min Long Hadoop:service=HBase,name=RegionServer,sub=Server#Append_min
replay time hbase.rs.ops.replays.time Sum Double Hadoop:service=HBase,name=RegionServer,sub=Server#Replay_num_ops * Replay_mean
slow increments hbase.rs.ops.increments.slow Sum Long The number of Increments that took over 1000ms to complete.
Hadoop:service=HBase,name=RegionServer,sub=Server#slowIncrementCount
replays hbase.rs.ops.replays Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#Replay_num_ops
gets hbase.rs.ops.gets Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#Get_num_ops
slow gets hbase.rs.ops.gets.slow Sum Long The number of Gets that took over 1000ms to complete.
Hadoop:service=HBase,name=RegionServer,sub=Server#slowGetCount
mutate time hbase.rs.ops.mutates.time Sum Double Hadoop:service=HBase,name=RegionServer,sub=Server#Mutate_num_ops * Mutate_mean
restore max time hbase.snapshots.restore.time.max Max Long Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotRestoreTime_max
snapshot time hbase.snapshots.time Sum Double Time it takes to finish snapshot().
Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotTime_num_ops * SnapshotTime_mean
snapshot max time hbase.snapshots.time.max Max Long Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotTime_max
clone time hbase.snapshots.clone.time Sum Double Time it takes to finish cloneSnapshot().
Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotCloneTime_num_ops * SnapshotCloneTime_mean
snapshots hbase.snapshots Sum Long Count of snapshot() invocations.
Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotTime_num_ops
restores hbase.snapshots.restores Sum Long Count of restoreSnapshot() invocations.
Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotRestoreTime_num_ops
clone min time hbase.snapshots.clone.time.min Min Long Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotCloneTime_min
clones hbase.snapshots.clones Sum Long Count of cloneSnapshot() invocations.
Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotCloneTime_num_ops
snapshot min time hbase.snapshots.time.min Min Long Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotTime_min
clone max time hbase.snapshots.clone.time.max Max Long Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotCloneTime_max
restore time hbase.snapshots.restore.time Sum Double Time it takes to finish restoreSnapshot().
Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotRestoreTime_num_ops * SnapshotRestoreTime_mean
restore min time hbase.snapshots.restore.time.min Min Long Hadoop:service=HBase,name=Master,sub=Snapshots#SnapshotRestoreTime_min
meta hlog split max size hbase.master.hlog.meta.splits.size.max Max Long Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitSize_max
hlog split size hbase.master.meta.splits.size Sum Double Size of HLog files being split.
Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitSize_num_ops * HlogSplitSize_mean
hlog split min time hbase.master.hlog.splits.time.min Min Long Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitTime_min
hlog split max size hbase.master.hlog.splits.size.max Max Long Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitSize_max
hlog splits hbase.master.hlog.splits Sum Long Count of HLog.splitLog() invocations.
Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitTime_num_ops
meta hlog split time hbase.master.hlog.meta.splits.time Sum Double Time it takes to finish splitMetaLog().
Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitTime_num_ops * MetaHlogSplitTime_mean
meta hlog split max time hbase.master.hlog.meta.splits.time.max Max Long Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitTime_max
hlog split time hbase.master.hlog.splits.time Sum Double Time it takes to finish HLog.splitLog().
Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitTime_num_ops * HlogSplitTime_mean
hlog split min size hbase.master.hlog.splits.size.min Min Long Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitSize_min
meta hlog splits hbase.master.hlog.meta.splits Sum Long Count of splitMetaLog() invocations.
Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitSize_num_ops
meta hlog split min size hbase.master.hlog.meta.splits.size.min Min Long Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitSize_min
meta hlog split min time hbase.master.hlog.meta.splits.time.min Min Long Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitTime_min
meta hlog splits hbase.master.hlog.meta.splits Sum Long Count of splitMetaLog() invocations.
Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitTime_num_ops
hlog split max time hbase.master.hlog.splits.time.max Max Long Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitTime_max
hlog splits hbase.master.hlog.splits Sum Long Count of HLog.splitLog() invocations.
Hadoop:service=HBase,name=Master,sub=FileSystem#HlogSplitSize_num_ops
meta hlog split size hbase.master.hlog.meta.splits.size Sum Double Size of hbase:meta HLog files being split.
Hadoop:service=HBase,name=Master,sub=FileSystem#MetaHlogSplitSize_num_ops * MetaHlogSplitSize_mean
lower limit hbase.rs.memstore.size.limit.lower Avg Long Property 'hbase.regionserver.global.memstore.lowerLimit' value.
static bloom size hbase.rs.static.bloom.size Avg Long Uncompressed size of the static bloom filters.
Hadoop:service=HBase,name=RegionServer,sub=Server#staticBloomSize
store files hbase.rs.stores.files Avg Long Number of Store Files.
Hadoop:service=HBase,name=RegionServer,sub=Server#storeFileCount
store file index size hbase.rs.stores.index.size Avg Long Size of indexes in storefiles on disk.
Hadoop:service=HBase,name=RegionServer,sub=Server#storeFileIndexSize
store file size hbase.rs.stores.files.size Avg Long Size of storefiles being served.
Hadoop:service=HBase,name=RegionServer,sub=Server#storeFileSize
mem store size hbase.rs.memstore.size Avg Long Size of the memstore.
Hadoop:service=HBase,name=RegionServer,sub=Server#memStoreSize
static index size hbase.rs.static.index.size Avg Long Uncompressed size of the static indexes.
Hadoop:service=HBase,name=RegionServer,sub=Server#staticIndexSize
upper limit hbase.rs.memstore.size.limit.upper Avg Long Property 'hbase.regionserver.global.memstore.upperLimit' value.
stores hbase.rs.stores Avg Long Number of Stores.
Hadoop:service=HBase,name=RegionServer,sub=Server#storeCount
regions hbase.rs.regions Avg Long Number of regions.
Hadoop:service=HBase,name=RegionServer,sub=Server#regionCount
active handlers hbase.ipc.handlers.active Avg Long Number of active rpc handlers.
Hadoop:service=HBase,name=IPC,sub=IPC#numActiveHandler
process call time hbase.ipc.handlers.process.calls.time Sum Double Hadoop:service=HBase,name=IPC,sub=IPC#ProcessCallTime_mean * ProcessCallTime_num_ops
process call min time hbase.ipc.process.call.time.min Min Long Hadoop:service=HBase,name=IPC,sub=IPC#ProcessCallTime_min
calls in priority queue hbase.ipc.queue.priority.size Avg Long The number of currently enqueued priority (internal housekeeping) requests.
Hadoop:service=HBase,name=IPC,sub=IPC#numCallsInPriorityQueue
authorization successes hbase.ipc.authorization.successes Sum Long Number of authorization successes.
Hadoop:service=HBase,name=IPC,sub=IPC#authorizationSuccesses
received bytes hbase.ipc.bytes.received Sum Long Number of bytes received.
Hadoop:service=HBase,name=IPC,sub=IPC#receivedBytes
sent bytes hbase.ipc.bytes.sent Sum Long Number of bytes sent.
Hadoop:service=HBase,name=IPC,sub=IPC#sentBytes
authorization failures hbase.ipc.authorization.failures Sum Long Number of authorization failures.
Hadoop:service=HBase,name=IPC,sub=IPC#authorizationFailures
process call max time hbase.ipc.process.call.time.max Max Long Hadoop:service=HBase,name=IPC,sub=IPC#ProcessCallTime_max
calls in replication queue hbase.ipc.queue.replication.size Avg Long Number of calls in the replication call queue.
Hadoop:service=HBase,name=IPC,sub=IPC#numCallsInReplicationQueue
process calls hbase.ipc.handlers.process.calls Sum Long Hadoop:service=HBase,name=IPC,sub=IPC#ProcessCallTime_num_ops
calls in general queue hbase.ipc.queue.size Avg Long The number of currently enqueued user requests.
Hadoop:service=HBase,name=IPC,sub=IPC#numCallsInGeneralQueue
queue call time hbase.ipc.handlers.queue.calls.time Sum Double Hadoop:service=HBase,name=IPC,sub=IPC#QueueCallTime_mean * QueueCallTime_num_ops
queue call min time hbase.ipc.queue.call.time.min Min Long Hadoop:service=HBase,name=IPC,sub=IPC#QueueCallTime_min
queue call max time hbase.ipc.queue.call.time.max Max Long Hadoop:service=HBase,name=IPC,sub=IPC#QueueCallTime_max
queue size hbase.ipc.queue.bytes Avg Long Number of bytes in the call queues.
Hadoop:service=HBase,name=IPC,sub=IPC#queueSize
authentication failures hbase.ipc.authentication.failures Sum Long Number of authentication failures.
Hadoop:service=HBase,name=IPC,sub=IPC#authenticationFailures
queue calls hbase.ipc.handlers.queue.calls Sum Long Hadoop:service=HBase,name=IPC,sub=IPC#QueueCallTime_num_ops
open connections hbase.ipc.connections.open Avg Long The number of open connections at the RPC layer.
Hadoop:service=HBase,name=IPC,sub=IPC#numOpenConnections
authentication successes hbase.ipc.authentication.successes Sum Long Number of authentication successes.
Hadoop:service=HBase,name=IPC,sub=IPC#authenticationSuccesses
total requests hbase.rs.requests Sum Long Total number of requests this RegionServer has answered.
Hadoop:service=HBase,name=RegionServer,sub=Server#totalRequestCount
read requests hbase.rs.requests.read Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#readRequestCount
write requests hbase.rs.requests.write Sum Long Hadoop:service=HBase,name=RegionServer,sub=Server#writeRequestCount
applied ops hbase.rs.replication.ops.applied Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#sink.appliedOps
applied batches hbase.rs.replication.batches.applied Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#sink.appliedBatches
balancer min time hbase.master.balancer.time.min Min Long Hadoop:service=HBase,name=Master,sub=Balancer#BalancerCluster_min
ops hbase.master.balancer.ops Avg Long Hadoop:service=HBase,name=Master,sub=Balancer#BalancerCluster_num_ops
time hbase.master.balancer.time Sum Double Hadoop:service=HBase,name=Master,sub=Balancer#BalancerCluster_num_ops * BalancerCluster_mean
misc invocations hbase.master.balancer.invocations Sum Long Hadoop:service=HBase,name=Master,sub=Balancer#miscInvocationCount
balancer max time hbase.master.balancer.time.max Max Long Hadoop:service=HBase,name=Master,sub=Balancer#BalancerCluster_max
flushed cells hbase.rs.mob.flushes.cells Sum Long The number of mob cells flushed to disk.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFlushedCellsCount
scan cells size hbase.rs.mob.scan.cells.size Sum Long The total amount of scanned mob cells, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobScanCellsSize
file cache hits hbase.rs.mob.cache.files.hits Avg Long The hit percent to the mob file cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFileCacheHitPercent
file caches hbase.rs.mob.cache.files Avg Long The count of cached mob files.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFileCacheCount
compacted into mob cells hbase.rs.mob.compactions.cells.into Sum Long The number of cells moved to mob during compaction.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobCompactedIntoMobCellsCount
file cache accesses hbase.rs.mob.cache.files.accesses Sum Long The count of accesses to the mob file cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFileCacheAccessCount
scan cells hbase.rs.mob.scan.cells Sum Long The number of scanned mob cells.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobScanCellsCount
file cache misses hbase.rs.mob.cache.files.misses Sum Long The count of misses to the mob file cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFileCacheMissCount
compacted from mob cells hbase.rs.mob.compactions.cells.from Sum Long The number of cells moved from mob during compaction.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobCompactedFromMobCellsCount
compacted into mob cells size hbase.rs.mob.compactions.cells.into.size Sum Long The total amount of cells move to mob during compaction, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobCompactedIntoMobCellsSize
compacted from mob cells size hbase.rs.mob.compactions.cells.from.size Sum Long The total amount of cells move from mob during compaction, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobCompactedFromMobCellsSize
file cache evictions hbase.rs.mob.cache.files.evictions Sum Long The number of items evicted from the mob file cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFileCacheEvictedCount
flushes hbase.rs.mob.flushes Sum Long The number of the flushes in mob-enabled stores.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFlushCount
flushed cells size hbase.rs.mob.flushes.cells.size Sum Long The total amount of mob cells flushed to disk, in bytes.
Hadoop:service=HBase,name=RegionServer,sub=Server#mobFlushedCellsSize
local files hbase.rs.files.local Avg Long The percent of HFiles that are stored on the local hdfs data node.
Hadoop:service=HBase,name=RegionServer,sub=Server#percentFilesLocal
hlog files hbase.rs.files.hlog Avg Long The number of write ahead logs not yet archived.
Hadoop:service=HBase,name=RegionServer,sub=Server#hlogFileCount
hlog files size hbase.rs.files.hlog.size Avg Long Size of all HLog Files.
Hadoop:service=HBase,name=RegionServer,sub=Server#hlogFileSize
cluster requests hbase.master.requests Sum Long Hadoop:service=HBase,name=Master,sub=Server#clusterRequests
dead region servers hbase.master.servers.region.dead Avg Double Hadoop:service=HBase,name=Master,sub=Server#numDeadRegionServers
region servers hbase.master.servers.region Avg Double Hadoop:service=HBase,name=Master,sub=Server#numRegionServers
average load hbase.master.load Avg Double Hadoop:service=HBase,name=Master,sub=Server#averageLoad
mutations without wal hbase.ops.mutates.nowal Sum Long Number of mutations that have been sent by clients with the write ahead logging turned off.
Hadoop:service=HBase,name=RegionServer,sub=Server#mutationsWithoutWALCount
mutate failed ops hbase.ops.mutates.failed Sum Long Number of Check and Mutate calls that failed the checks.
Hadoop:service=HBase,name=RegionServer,sub=Server#checkMutateFailedCount
mutate passed ops hbase.ops.mutates.passed Sum Long Number of Check and Mutate calls that passed the checks.
Hadoop:service=HBase,name=RegionServer,sub=Server#checkMutatePassedCount
mutations without wal size hbase.ops.mutates.nowal.size Sum Long Size of data that has been sent by clients with the write ahead logging turned off.
Hadoop:service=HBase,name=RegionServer,sub=Server#mutationsWithoutWALSize
hits hbase.cache.hits Sum Long Count of the hit on the block cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheHitCount
express hit% hbase.cache.hits.express.rate Avg Long The percent of the time that requests with the cache turned on hit the cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheExpressHitPercent
hit% hbase.cache.hits.rate Avg Long Percent of block cache requests that are hits.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCountHitPercent
cache free size hbase.cache.block.free Avg Long Size of the block cache that is not occupied.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheFreeSize
block cache size hbase.cache.block.size Avg Long Size of the block cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheSize
miss hbase.cache.misses Sum Long Number of requests for a block that missed the block cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheMissCount
cache count hbase.cache.block.count Avg Long Number of block in the block cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheCount
evictions hbase.cache.evictions Sum Long Count of the number of blocks evicted from the block cache.
Hadoop:service=HBase,name=RegionServer,sub=Server#blockCacheEvictionCount
shipped batches hbase.rs.replication.batches.shipped Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.shippedBatches
shipped ops hbase.rs.replication.ops.shipped Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.shippedOps
log edits read hbase.rs.replication.log.edits.read Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.logEditsRead
log queue hbase.rs.replication.log.queue Avg Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.sizeOfLogQueue
shipped bytes hbase.rs.replication.batches.shipped.size Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.shippedKBs
log edits filtered hbase.rs.replication.log.edits.filtered Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.logEditsFiltered
log read bytes hbase.rs.replication.log.edits.read.bytes Sum Long Hadoop:service=HBase,name=RegionServer,sub=Replication#source.logReadInBytes
hedged read wins hbase.reads.hedged.wins Sum Long The number of times we started a hedged read and a hedged read won.
Hadoop:service=HBase,name=RegionServer,sub=Server#hedgedReadWins
hedged reads hbase.reads.hedged Sum Long The number of times we started a hedged read.
Hadoop:service=HBase,name=RegionServer,sub=Server#hedgedReads

0.94

Metric Name Key Agg Type Description
stores hbase.stores Avg Long
flushes hbase.flushes Sum Long
compactions hbase.compactions Sum Long
store files hbase.store.files Avg Long
store file index size hbase.store.index.size Avg Long
sync hbase.fs.sync.latency.max Max Long
read hbase.fs.read.latency.max Max Long
write hbase.fs.write.latency.max Max Long
sync hbase.fs.sync.latency.min Min Long
read hbase.fs.read.latency.min Min Long
write hbase.fs.write.latency.min Min Long
flush queue hbase.flushes.queue.size Avg Double
max hbase.memstore.flushes.time.max Max Long
min hbase.flushes.size.min Min Long
flushes hbase.memstore.flushes Sum Long
max hbase.flushes.size.max Max Long
memstore size hbase.memstore.size Avg Long
min hbase.memstore.flushes.time.min Min Long
min hbase.compactions.time.min Min Long
max hbase.compactions.time.max Max Long
max hbase.compactions.size.max Max Long
compactions hbase.compactions Sum Long
compaction queue hbase.compactions.queue.size Avg Double
min hbase.compactions.size.min Min Long
cache size hbase.cache.block.size Avg Long
miss count hbase.cache.block.misses Sum Long
cache free hbase.cache.block.free Avg Long
hit count hbase.cache.block.hits Sum Long
block cache count hbase.cache.block.count Avg Long
blockCacheHitRatio hbase.cache.block.hits.ratio Avg Long
evicted count hbase.cache.block.evictions Sum Long
blockCacheHitCachingRatio hbase.cache.block.caching.hits.ratio Avg Long
max hbase.splits.time.max Max Long
splits hbase.splits Sum Long
max hbase.splits.size.max Max Long
min hbase.splits.size.min Min Long
splits hbase.splits Sum Long
regions hbase.regions Avg Long

FAQ

How do I enable JMX in HBase

Please see HBase Metrics page for instructions.

Do I need to add a separate SPM Application for each HBase server/node I want to monitor

No, one Application is enough. Think of an SPM "Application" as a "HBase Cluster". Thus, to monitor N HBase servers that belong to the same cluster you would create just a single SPM Applicationand use its Token in SPM configuration file on all HBase servers that are a part of this cluster.

Why don't some HBase metrics graphs have any data

There could be 2 possible reasons:

  1. Some metrics are for RegionServers (HBase slaves), some for HBase Master. Thus, if you select the Master node in the UI, graphs that contain Slave-specific metrics will be blank and vice-versa.
  2. Different versions of HBase provide different metrics. Thus, if you have an older version of HBase, it may not be providing all metrics that SPM collects and graphs.

Which versions of HBase does SPM support

SPM has been tested with HBase 0.90, 0.92, 0.94, and 0.98, but will work for newer versions, including all CDH versions.