乐趣区

jmxtrans+influxdb+grafana监控zookeeper实战


本文主要研究一下如何使用 jmxtrans+influxdb+granfa 监控 zookeeper
配置 zookeeper jmx
在 conf 目录下新增 zookeeper-env.sh,并使用 chmod + x 赋予执行权限,内容如下
JMXLOCALONLY=false
JMXDISABLE=false
JMXPORT=8999
JMXAUTH=false
JMXSSL=false
指定 JMXPORT 为 8999
搭建 influxdb 及 grafana
这里使用 docker 构建,如下
docker network create monitoring-network
docker run -d -p 8083:8083 -p 8086:8086 \
–net monitoring-network \
–name influxdb \
influxdb:1.5.4
curl -POST http://192.168.99.100:8086/query –data-urlencode “q=CREATE DATABASE zookeeper”
docker run -d -p 3000:3000 –net monitoring-network –name grafana grafana/grafana:5.2.4
curl ‘http://admin:admin@192.168.99.100:3000/api/datasources’ -X POST \
-H ‘Content-Type: application/json;charset=UTF-8’ \
–data-binary ‘{“name”:”influx”,”type”:”influxdb”,”url”:”http://192.168.99.100:8086″,
“access”:”proxy”,”isDefault”:true,”database”:”zookeeper”,”user”:”admin”,”password”:”admin”}’

创建 monitoring-network,让 docker 中的 influxdb 与 grafana 连通,然后使用 rest api 创建 zookeeper 数据库
创建 grafana 并指定 net 为 monitoring-network,然后使用 rest api 配置数据源

jmxtrans 配置
下载 jmxtrans
wget http://central.maven.org/maven2/org/jmxtrans/jmxtrans/270/jmxtrans-270-all.jar
下载执行脚本
wget -q https://raw.githubusercontent.com/jmxtrans/jmxtrans/master/jmxtrans/jmxtrans.sh
chmod +x jmxtrans.sh
配置 zookeeper.json
{
“servers”: [
{
“port”: 8999,
“host”: “localhost”,
“alias”: “zk”,
“queries”: [
{
“outputWriters” : [
{
“@class” : “com.googlecode.jmxtrans.model.output.InfluxDbWriterFactory”,
“url” : “http://192.168.99.100:8086/”,
“database” : “zookeeper”,
“username” : “admin”,
“password” : “admin”
} ],
“obj”: “org.apache.ZooKeeperService:name0=ReplicatedServer_id*”,
“attr”: [
“QuorumSize”
]
},
{
“outputWriters” : [
{
“@class” : “com.googlecode.jmxtrans.model.output.InfluxDbWriterFactory”,
“url” : “http://192.168.99.100:8086/”,
“database” : “zookeeper”,
“username” : “admin”,
“password” : “admin”
} ],
“obj”: “org.apache.ZooKeeperService:name0=ReplicatedServer_id*,name1=replica*,name2=*”,
“attr”: [
“MaxClientCnxnsPerHost”,
“MaxRequestLatency”,
“AvgRequestLatency”,
“MinRequestLatency”,
“MaxSessionTimeout”,
“MinSessionTimeout”,
“OutstandingRequests”,
“PacketsReceived”,
“PacketsSent”,
“PendingRevalidationCount”,
“TickTime”
]
},
{
“outputWriters” : [
{
“@class” : “com.googlecode.jmxtrans.model.output.InfluxDbWriterFactory”,
“url” : “http://192.168.99.100:8086/”,
“database” : “zookeeper”,
“username” : “admin”,
“password” : “admin”
} ],
“obj”: “org.apache.ZooKeeperService:name0=ReplicatedServer_id*,name1=replica*,name2=*,name3=InMemoryDataTree”,
“attr”: [
“NodeCount”,
“WatchCount”
]
},
{
“outputWriters” : [
{
“@class” : “com.googlecode.jmxtrans.model.output.InfluxDbWriterFactory”,
“url” : “http://192.168.99.100:8086/”,
“database” : “zookeeper”,
“username” : “admin”,
“password” : “admin”
} ],
“obj”: “org.apache.ZooKeeperService:name0=ReplicatedServer_id*,name1=replica*,name2=*,name3=Connections,name4=*,name5=*”,
“resultAlias”: “Clients”,
“attr”: [
“AvgLatency”,
“LastLatency”,
“MaxLatency”,
“MinLatency”,
“OutstandingRequests”,
“PacketsReceived”,
“PacketsSent”
]
},
{
“outputWriters” : [
{
“@class” : “com.googlecode.jmxtrans.model.output.InfluxDbWriterFactory”,
“url” : “http://192.168.99.100:8086/”,
“database” : “zookeeper”,
“username” : “admin”,
“password” : “admin”
} ],
“obj”: “org.apache.ZooKeeperService:name0=StandaloneServer_port*”,
“resultAlias”: “Clients”,
“attr”: [
“AvgLatency”,
“LastLatency”,
“MaxLatency”,
“MinLatency”,
“OutstandingRequests”,
“PacketsReceived”,
“PacketsSent”,
“NumAliveConnections”
]
},
{
“outputWriters” : [
{
“@class” : “com.googlecode.jmxtrans.model.output.InfluxDbWriterFactory”,
“url” : “http://192.168.99.100:8086/”,
“database” : “zookeeper”,
“username” : “admin”,
“password” : “admin”
} ],
“obj”: “java.lang:type=Memory”,
“resultAlias”: “Memory”,
“attr”: [
“HeapMemoryUsage”,
“NonHeapMemoryUsage”
]
},
{
“outputWriters” : [
{
“@class” : “com.googlecode.jmxtrans.model.output.InfluxDbWriterFactory”,
“url” : “http://192.168.99.100:8086/”,
“database” : “zookeeper”,
“username” : “admin”,
“password” : “admin”
} ],
“obj”: “java.lang:type=OperatingSystem”,
“resultAlias”: “Process”,
“attr”: [
“OpenFileDescriptorCount”,
“ProcessCpuLoad”
]
},
{
“outputWriters” : [
{
“@class” : “com.googlecode.jmxtrans.model.output.InfluxDbWriterFactory”,
“url” : “http://192.168.99.100:8086/”,
“database” : “zookeeper”,
“username” : “admin”,
“password” : “admin”
} ],
“obj”: “java.lang:type=Threading”,
“resultAlias”: “Threading”,
“attr”: [
“ThreadCount”
]
}
],
“numQueryThreads”: 2
}
]
}
放在跟 jmxtrans.sh 脚本同一个目录下即可自动加载
启动
JAR_FILE=jmxtrans-270-all.jar ./jmxtrans.sh start
日志输出实例
2018-10-04 11:28:30 [main] level org.quartz.impl.StdSchedulerFactory [StdSchedulerFactory.java:1179] – Using default implementation for ThreadExecutor
2018-10-04 11:28:30 [main] level org.quartz.core.SchedulerSignalerImpl [SchedulerSignalerImpl.java:60] – Initialized Scheduler Signaller of type: class org.quartz.core.SchedulerSignalerImpl
2018-10-04 11:28:30 [main] level org.quartz.core.QuartzScheduler [QuartzScheduler.java:229] – Quartz Scheduler v.1.8.6 created.
2018-10-04 11:28:30 [main] level org.quartz.simpl.RAMJobStore [RAMJobStore.java:139] – RAMJobStore initialized.
2018-10-04 11:28:30 [main] level org.quartz.core.QuartzScheduler [QuartzScheduler.java:255] – Scheduler meta-data: Quartz Scheduler (v1.8.6) ‘ServerScheduler’ with instanceId ‘Mars.local1538623710548’
Scheduler class: ‘org.quartz.core.QuartzScheduler’ – running locally.
NOT STARTED.
Currently in standby mode.
Number of jobs executed: 0
Using thread pool ‘org.quartz.simpl.SimpleThreadPool’ – with 10 threads.
Using job-store ‘org.quartz.simpl.RAMJobStore’ – which does not support persistence. and is not clustered.

2018-10-04 11:28:30 [main] level org.quartz.impl.StdSchedulerFactory [StdSchedulerFactory.java:1324] – Quartz scheduler ‘ServerScheduler’ initialized from an externally opened InputStream.
2018-10-04 11:28:30 [main] level org.quartz.impl.StdSchedulerFactory [StdSchedulerFactory.java:1328] – Quartz scheduler version: 1.8.6
2018-10-04 11:28:30 [main] level org.quartz.core.QuartzScheduler [QuartzScheduler.java:2267] – JobFactory set to: com.googlecode.jmxtrans.guice.GuiceJobFactory@263f04ca
2018-10-04 11:28:30 [main] level com.googlecode.jmxtrans.JmxTransformer [JmxTransformer.java:177] – Starting Jmxtrans on : .
2018-10-04 11:28:30 [main] level org.quartz.core.QuartzScheduler [QuartzScheduler.java:519] – Scheduler ServerScheduler_$_Mars.local1538623710548 started.
2018-10-04 11:28:30 [main] level com.googlecode.jmxtrans.model.output.InfluxDbWriterFactory [InfluxDbWriterFactory.java:121] – Result Tags to write set to: [ResultAttribute(name=typeName), ResultAttribute(name=objDomain), ResultAttribute(name=className), ResultAttribute(name=attributeName)]
2018-10-04 11:28:30 [main] level com.googlecode.jmxtrans.model.output.InfluxDbWriterFactory [InfluxDbWriterFactory.java:102] – Connecting to url: http://192.168.99.100:8086/ as: username: admin
2018-10-04 11:28:31 [main] level com.googlecode.jmxtrans.model.output.InfluxDbWriterFactory [InfluxDbWriterFactory.java:121] – Result Tags to write set to: [ResultAttribute(name=typeName), ResultAttribute(name=objDomain), ResultAttribute(name=className), ResultAttribute(name=attributeName)]
查询 influxdb 数据
docker exec -it influxdb influx
Connected to http://localhost:8086 version 1.5.4
InfluxDB shell version: 1.5.4
> use zookeeper
Using database zookeeper
> show MEASUREMENTS
name: measurements
name
—-
Clients
Memory
Process
Threading
> show series from “Memory”
key

Memory,attributeName=HeapMemoryUsage,className=sun.management.MemoryImpl,hostname=zk,objDomain=java.lang,typeName=type\=Memory
Memory,attributeName=NonHeapMemoryUsage,className=sun.management.MemoryImpl,hostname=zk,objDomain=java.lang,typeName=type\=Memory

之后通过 select * from “Memory” 可以查看具体指标数据
查询到有数据之后,就可以在 grafana 上面进行配置可视化,然后进行监控

小结
对于 zookeeper、kafka 之类应用来说,内置了 jmx,因而其监控可以通过 jmxtrans 进行指标收集转换,然后输出到 influxdb 或者 graphite 或者 prometheus 等,最后通过 grafana 进行可视化及监控报警。
doc

jmxtrans
jmxtrans-270-all.jar
How do i enable remote jmx with port in zookeeper zkServer.cmd
Monitoring Apache Kafka with Grafana / InfluxDB via JMX

退出移动版