共计 6880 个字符,预计需要花费 18 分钟才能阅读完成。
打造一套 NGINX 监控
0. 前文
笔者之前无心查看 nginx 的拜访日志 access.log
文件,发现有很多的像 /phpMyadmin/index.php
这种拜访,IP 归属地来自世界各地,应该有人在用肉鸡在始终扫描服务器破绽,就萌发了想打造一套 nginx 监控零碎的想法。我的要求很简略,只有能在网站上实时查看日志就行,不必每次去服务器查看。最开始我想到的是 ELK
这套解决方案,我之前我理解过,然而前面想了一下,ELK
有点太重了,次要是外围的部件 Elasticsearch
是用 Java
写的,比拟占资源,用在我这里有点杀鸡用牛刀的感觉(服务器扛不住)。前面在网上查找的时候,理解到了 loki
+promtail
+grafana
的这套轻量级解决方案,便决定部署来试试看。
以下操作应用的是 Centos 7 进行演示
1. 装置 nginx
编译 nginx,加载 ngx_http_geoip2_module 模块,获取拜访 ip 的 geoip2 区域信息
1.1 下载、编译 nginx
# 装置编译环境
$ yum -y install gcc pcre pcre-devel zlib zlib-devel openssl openssl-devel
# 下载安装 libmaxminddb
$ wget https://github.com/maxmind/libmaxminddb/releases/download/1.3.2/libmaxminddb-1.3.2.tar.gz
$ tar -zxvf libmaxminddb-1.3.2.tar.gz
$ cd libmaxminddb-1.3.2
$ ./configure && make && make install
$ echo /usr/local/lib >> /etc/ld.so.conf.d/local.conf
$ ldconfig
# 下载 ngx_http_geoip2_module 模块
$ yum install -y git
$ git clone https://github.com/leev/ngx_http_geoip2_module
# 下载编译 nginx
$ wget https://nginx.org/download/nginx-1.18.0.tar.gz
$ tar -zxvf nginx-1.18.0.tar.gz
$ cd nginx-1.18.0
# 如果之前曾经装置过 nginx,能够用 /path/to/nginx/sbin/nginx - V 查看 nginx 编译参数
# /root/ngx_http_geoip2_module 换成你本人的门路
$ ./configure --prefix=/usr/local/nginx --with-http_ssl_module --add-module=/root/ngx_http_geoip2_module
$ make && make install
1.2 下载 GeoLite2 数据库
maxmind 官网下载,能下载最新的,但须要注册登录
笔者提供的,不是最新的,但应该也够用
GeoLite2-City_20220705.tar.gz
GeoLite2-Country_20220705.tar.gz
$ tar -zxvf GeoLite2-City_20220705.tar.gz
$ mv ./GeoLite2-City_20220705/GeoLite2-City.mmdb /usr/local/nginx/conf
$ tar -zxvf GeoLite2-Country_20220705.tar.gz
$ mv ./GeoLite2-Country_20220705/GeoLite2-Country.mmdb /usr/local/nginx/conf
1.3 配置 nginx
须要将日志格局配置成 JSON,外面蕴含了 geoip_country_code
http {
include mime.types;
default_type application/octet-stream;
log_format json_analytics '{'
'"msec": "$msec", '# request unixtime in seconds with a milliseconds resolution'"connection":"$connection",' # connection serial number
'"connection_requests": "$connection_requests", '# number of requests made in connection'"pid":"$pid",' # process pid
'"request_id": "$request_id", '# the unique request id'"request_length":"$request_length",' # request length (including headers and body)
'"remote_addr": "$remote_addr", '# client IP'"remote_user":"$remote_user",' # client HTTP username
'"remote_port": "$remote_port", '# client port'"time_local":"$time_local",'
'"time_iso8601": "$time_iso8601", '# local time in the ISO 8601 standard format'"request":"$request",' # full path no arguments if the request
'"request_uri": "$request_uri", '# full path and arguments if the request'"args":"$args",' # args
'"status": "$status", '# response status code'"body_bytes_sent":"$body_bytes_sent",' # the number of body bytes exclude headers sent to a client
'"bytes_sent": "$bytes_sent", '# the number of bytes sent to a client'"http_referer":"$http_referer",' # HTTP referer
'"http_user_agent": "$http_user_agent", '# user agent'"http_x_forwarded_for":"$http_x_forwarded_for",' # http_x_forwarded_for
'"http_host": "$http_host", '# the request Host: header'"server_name":"$server_name",' # the name of the vhost serving the request
'"request_time": "$request_time", '# request processing time in seconds with msec resolution'"upstream":"$upstream_addr",' # upstream backend server for proxied requests
'"upstream_connect_time": "$upstream_connect_time", '# upstream handshake time incl. TLS'"upstream_header_time":"$upstream_header_time",' # time spent receiving upstream headers
'"upstream_response_time": "$upstream_response_time", '# time spend receiving upstream body'"upstream_response_length":"$upstream_response_length",' # upstream response length
'"upstream_cache_status": "$upstream_cache_status", '# cache HIT/MISS where applicable'"ssl_protocol":"$ssl_protocol",' # TLS protocol
'"ssl_cipher": "$ssl_cipher", '# TLS cipher'"scheme":"$scheme",' # http or https
'"request_method": "$request_method", '# request method'"server_protocol":"$server_protocol",' # request protocol, like HTTP/1.1 or HTTP/2.0
'"pipe": "$pipe", '#"p"if request was pipelined,"."otherwise'"gzip_ratio":"$gzip_ratio",'
'"http_cf_ray": "$http_cf_ray",''"geoip_country_code":"$geoip2_data_country_code"''}';
access_log logs/json_access.log json_analytics;
geoip2 /usr/local/nginx/conf/GeoLite2-Country.mmdb {
$geoip2_data_country_code country iso_code;
$geoip2_data_country_name country names en;
}
geoip2 /usr/local/nginx/conf/GeoLite2-City.mmdb {
$geoip2_data_city_name city names en;
$geoip2_data_province_name subdivisions 0 names en;
}
}
1.4 启动 nginx
$ /usr/local/nginx/sbin/nginx
2. 装置 loki、promtail、grafana
应用 docker 部署,须要提前装置好 docker,这里不再赘述,具体参见 docker 官网
docker 官网
2.1 loki 配置文件
auth_enabled: false
server:
http_listen_port: 3100
grpc_listen_port: 9096
common:
path_prefix: /tmp/loki
storage:
filesystem:
chunks_directory: /tmp/loki/chunks
rules_directory: /tmp/loki/rules
replication_factor: 1
ring:
instance_addr: 127.0.0.1
kvstore:
store: inmemory
schema_config:
configs:
- from: 2020-10-24
store: boltdb-shipper
object_store: filesystem
schema: v11
index:
prefix: index_
period: 24h
ruler:
alertmanager_url: http://localhost:9093
frontend:
# Maximum number of outstanding requests per tenant per frontend; requests
# beyond this error with HTTP 429.
# CLI flag: -querier.max-outstanding-requests-per-tenant
max_outstanding_per_tenant: 2048 # default = 100
query_range:
# Perform query parallelisations based on storage sharding configuration and
# query ASTs. This feature is supported only by the chunks storage engine.
# CLI flag: -querier.parallelise-shardable-queries
parallelise_shardable_queries: false # default = true
compactor:
retention_enabled: true
retention_delete_delay: 1m
delete_request_cancel_period: 1m
也可从官网下载模板
$ wget https://raw.githubusercontent.com/grafana/loki/v2.6.1/cmd/loki/loki-local-config.yaml -O loki-config.yaml
2.2 promtail 配置文件
server:
http_listen_port: 9080
grpc_listen_port: 0
positions:
filename: /tmp/positions.yaml
clients:
- url: http://loki:3100/loki/api/v1/push
scrape_configs:
- job_name: system
static_configs:
- targets:
- localhost
labels:
job: nginxlogs
__path__: /var/log/json_access.log
也可从官网下载模板
$ wget https://raw.githubusercontent.com/grafana/loki/v2.6.1/clients/cmd/promtail/promtail-docker-config.yaml -O promtail-config.yaml
2.3 docker-compose.yml 配置文件
version: '3'
networks:
loki:
services:
loki:
image: "grafana/loki:2.5.0"
ports:
- "3100:3100"
volumes:
- /usr/local/Loki/conf:/etc/loki
command: -config.file=/etc/loki/loki-config.yaml
networks:
- loki
restart: always
promtail:
image: "grafana/promtail:2.5.0"
volumes:
- /usr/local/Loki/conf:/etc/promtail
- /usr/local/nginx/logs:/var/log
command: -config.file=/etc/promtail/promtail-config.yaml
networks:
- loki
restart: always
grafana:
image: grafana/grafana:latest
ports:
- "3000:3000"
networks:
- loki
restart: always
也可从官网下载模板
$ wget https://raw.githubusercontent.com/grafana/loki/v2.6.1/production/docker-compose.yaml -O docker-compose.yaml
注意事项:配置文件的地位以本人的为准(批改 volumes),我都是放在 /usr/local/Loki/conf 中
2.4 启动 loki、promtail、grafana
$ docker-compose up -d
3. 配置 Dashboard
都胜利启动后,就应该能够失常拜访了,grafana 的默认端口是 3000,能够依据须要进行映射或者再进行代理,笔者倡议间接带端口拜访,不须要再用 nginx 进行代理,若是须要用 nginx 进行代理,要牢记,不要让拜访 grafana 的日志呈现在 nginx 中,否则你每次拜访 grafana,看到的都是拜访它本人的日志。
nginx 不记录日志配置
access_log off;
3.1 登录 grafana
默认账号密码是 admin/admin,登录后记得批改明码
3.1 增加数据源
点击增加数据源,抉择 Loki
URL 输出 http://loki:3100,点击保留和测试,呈现 ’Data source connected and labels found’ 的提醒就 OK 了
3.1 增加 Dashboard
点击 import,输出 12559,再导入,这是他人制作好的现成的模板,你也能够本人制作属于本人的模板,或者去 grafana 搜寻模板,grafana dashboards
当初功败垂成!
注意事项:配置较差的机器(轻量应用服务器)不要抉择太长的工夫区间进行搜寻,会很慢,甚至报错