作者:
发表于:和作者信息及。
链接:
有一段时间没更新了主要是时间有点紧张。然而最近时不时有互联网的朋友问我关于服务器监控方面的问题?问常用的服务器监控除了用开源软件,比如:,监控外是否可以自己写shell脚本呢?根据自己的需求写出的shell脚本更能满足需求,更能细化主机监控的全面性。
下面是我常用的几个主机监控的脚本,大家可以根据自己的情况在进行修改,希望能给大家一点帮助。
1、查看主机网卡流量
#!/bin/bash
#network
#mike.xu
while : ; do
time='date %m"-"%d" "%k":"%m'
day='date %m"-"%d'
rx_before='ifconfig eth0|sed -n "8"p|awk '{print $2}'|cut -c7-'
tx_before='ifconfig eth0|sed -n "8"p|awk '{print $6}'|cut -c7-'
sleep 2
rx_after='ifconfig eth0|sed -n "8"p|awk '{print $2}'|cut -c7-'
tx_after='ifconfig eth0|sed -n "8"p|awk '{print $6}'|cut -c7-'
rx_result=$[(rx_after-rx_before)/256]
tx_result=$[(tx_after-tx_before)/256]
echo "$time now_in_speed: "$rx_result"kbps now_out_speed: "$tx_result"kbps"
sleep 2
done
2. 新看到的一则shell实现同样的功能
#!/bin/bash
function usage
{
echo "use ./test_net.sh ethx time"
echo "$1 is you network interface "
echo "$2 is the last time!"
echo "for example: ./test_net.sh eth0 2"
exit 100
}
if [ $# -lt 2 -o $# -gt 2 ];then
usage
fi
eth=$1
time=$2
old_inbw=`cat /proc/net/dev | grep $eth | awk -f'[: ] ' '{print $3}'`
old_outbw=`cat /proc/net/dev | grep $eth | awk -f'[: ] ' '{print $11}'`
while true
do
sleep $time
new_inbw=`cat /proc/net/dev | grep $eth | awk -f'[: ] ' '{print $3}'`
new_outbw=`cat /proc/net/dev | grep $eth | awk -f'[: ] ' '{print $11}'`
inbw=`expr $((($new_inbw-$old_inbw)/$time))`
outbw=`expr $((($new_outbw-$old_outbw)/$time))`
echo "$eth: in:$inbw bytes out:$outbw bytes"
old_inbw=${new_inbw}
old_outbw=${new_outbw}
done
exit 0
运行效果:
[root@bogon shell]# ./test_net.sh eth0 2
eth0: in:3097 bytes out:50374 bytes
eth0: in:3158 bytes out:44202 bytes
eth0: in:2587 bytes out:58932 bytes
eth0: in:2104 bytes out:51543 bytes
2、系统状况监控
#!/bin/sh
#systemstat.sh
#mike.xu
ip=192.168.1.227
top -n 2| grep "cpu" >>./temp/cpu.txt
free -m | grep "mem" >> ./temp/mem.txt
df -k | grep "sda1" >> ./temp/drive_sda1.txt
#df -k | grep sda2 >> ./temp/drive_sda2.txt
df -k | grep "/mnt/storage_0" >> ./temp/mnt_storage_0.txt
df -k | grep "/mnt/storage_pic" >> ./temp/mnt_storage_pic.txt
time=`date %m"."%d" "%k":"%m`
connect=`netstat -na | grep "219.238.148.30:80" | wc -l`
echo "$time $connect" >> ./temp/connect_count.txt
3、监控主机的磁盘空间,当使用空间超过90%就通过发mail来发警告
#!/bin/bash
#monitor available disk space
space='df | sed -n '/ \ / $ / p' | gawk '{print $5}' | sed 's/%//'
if [ $space -ge 90 ]
then
fty89@163.com
fi
4、 监控cpu和内存的使用情况
#!/bin/bash
#script to capture system statistics
outfile=/home/xu/capstats.csv
date='date %m/%d/%y'
time='date %k:%m:%s'
timeout='uptime'
vmout='vmstat 1 2'
users='echo $timeout | gawk '{print $4}' '
load='echo $timeout | gawk '{print $9}' | sed "s/,//' '
free='echo $vmout | sed -n '/[0-9]/p' | sed -n '2p' | gawk '{print $4} ' '
idle='echo $vmout | sed -n '/[0-9]/p' | sed -n '2p' |gawk '{print $15}' '
echo "$date,$time,$users,$load,$free,$idle" >> $outfile
5、全方位监控主机
#!/bin/bash
# check_xu.sh
# 0 * * * * /home/check_xu.sh
dat="`date %y%m%d`"
hour="`date %h`"
dir="/home/oslog/host_${dat}/${hour}"
delay=60
count=60
# whether the responsible directory exist
if ! test -d ${dir}
then
/bin/mkdir -p ${dir}
fi
# general check
export term=linux
/usr/bin/top -b -d ${delay} -n ${count} > ${dir}/top_${dat}.log 2>&1 &
# cpu check
/usr/bin/sar -u ${delay} ${count} > ${dir}/cpu_${dat}.log 2>&1 &
#/usr/bin/mpstat -p 0 ${delay} ${count} > ${dir}/cpu_0_${dat}.log 2>&1 &
#/usr/bin/mpstat -p 1 ${delay} ${count} > ${dir}/cpu_1_${dat}.log 2>&1 &
# memory check
/usr/bin/vmstat ${delay} ${count} > ${dir}/vmstat_${dat}.log 2>&1 &
# i/o check
/usr/bin/iostat ${delay} ${count} > ${dir}/iostat_${dat}.log 2>&1 &
# network check
/usr/bin/sar -n dev ${delay} ${count} > ${dir}/net_${dat}.log 2>&1 &
#/usr/bin/sar -n edev ${delay} ${count} > ${dir}/net_edev_${dat}.log 2>&1 &
放在crontab里每小时自动执行:
0 * * * * /home/check_xu.sh
这样会在/home/oslog/host_yyyymmdd/hh目录下生成各小时cpu、内存、网络,io的统计数据。
如果某个时间段产生问题了,就可以去看对应的日志信息,看看当时的主机性能如何。