Xymon configuration Report
DateThu Sep 19 19:14:15 2024
7 hosts included usaea1sapuas550 usaea1sapuas551 usaea1satuas200 usaea1swbuas260 usaea1swpuas131 usaea1swpuas132 xymonserver



Explanation of Terms
Basics
AliasesNames for this host other than the primary name, e.g. a hostname used by a client installed on the server
Monitoring locationThe location of this host on the Xymon webpages
Comment
Description
Explanatory text about the host
Planned downtimeTime of day/week when the host monitoring is disabled
SLA Reporting PeriodTime of day/week where the status impacts the SLA availability calculation
Network tests
ServiceCorresponds to the column-name on the Xymon webpage
CriticalWhether this test appears on the Critical Systems view
C/Y/R limitsIf set, this is the number of failures that must happen before the status changes to Clear/Yellow/Red
SpecificsDetails about how this status is monitored
Local tests
ServiceCorresponds to the column-name on the Xymon webpage
CriticalWhether this test appears on the Critical view
C/Y/R limitsIf set, this is the number of failures that must happen before the status changes to Clear/Yellow/Red
ConfigurationDetails about how this status is monitored. NOTE: The exact thresholds for each test are configured on the client, and may differ from that listed here.

Basics usaea1sapuas550 (10.31.131.112)
Monitoring location: Top Page
Description: resin:ip-10-31-131-112.ec2.internal
SLA Reporting Period: 24x7
Network tests
ServiceCriticalC/Y/R limitsSpecifics
httpNo-/-/-http://10.31.131.112:8888/
sshNo-/-/-
Local tests
ServiceCriticalC/Y/R limitsConfiguration (NB: Thresholds on client may differ)
clientlogNo-/-/- 
cpuNo-/-/-UNIX - Yellow: Load average > 1.5, Red: Load average > 3.0
Windows - Yellow: CPU utilisation > 80%, Red: CPU utilisation > 95%
diskNo-/-/-Default limits: Yellow 90% full, Red 95% full
/root
/var
/home
/tmp
/opt
/data/ebs/logs
filesNo-/-/- 
inodeNo-/-/- 
memoryNo-/-/-Yellow: swap/pagefile use > 80%, Red: swap/pagefile use > 90%
msgsNo-/-/- 
portsNo-/-/- 
procsNo-/-/-No processes monitored

Basics usaea1sapuas551 (10.31.131.60)
Monitoring location: Top Page
Description: resin:ip-10-31-131-60.ec2.internal
SLA Reporting Period: 24x7
Network tests
ServiceCriticalC/Y/R limitsSpecifics
httpNo-/-/-http://10.31.131.60:8888/
sshNo-/-/-
Local tests
ServiceCriticalC/Y/R limitsConfiguration (NB: Thresholds on client may differ)
clientlogNo-/-/- 
cpuNo-/-/-UNIX - Yellow: Load average > 1.5, Red: Load average > 3.0
Windows - Yellow: CPU utilisation > 80%, Red: CPU utilisation > 95%
diskNo-/-/-Default limits: Yellow 90% full, Red 95% full
/root
/var
/home
/tmp
/opt
/data/ebs/logs
filesNo-/-/- 
inodeNo-/-/- 
memoryNo-/-/-Yellow: swap/pagefile use > 80%, Red: swap/pagefile use > 90%
msgsNo-/-/- 
portsNo-/-/- 
procsNo-/-/-No processes monitored

Basics usaea1satuas200 (10.31.131.42)
Monitoring location: Top Page
Description: triad:ip-10-31-131-42.ec2.internal
SLA Reporting Period: 24x7
Network tests
ServiceCriticalC/Y/R limitsSpecifics
sshNo-/-/-
Local tests
ServiceCriticalC/Y/R limitsConfiguration (NB: Thresholds on client may differ)
clientlogNo-/-/- 
cpuNo-/-/-UNIX - Yellow: Load average > 1.5, Red: Load average > 3.0
Windows - Yellow: CPU utilisation > 80%, Red: CPU utilisation > 95%
diskNo-/-/-Default limits: Yellow 90% full, Red 95% full
/root
/tmp
/var
/home
/opt
filesNo-/-/- 
inodeNo-/-/- 
memoryNo-/-/-Yellow: swap/pagefile use > 80%, Red: swap/pagefile use > 90%
msgsNo-/-/- 
portsNo-/-/- 
procsNo-/-/-No processes monitored

Basics usaea1swbuas260 (10.31.131.14)
Monitoring location: Top Page
Description: apache:ip-10-31-131-14.ec2.internal
SLA Reporting Period: 24x7
Network tests
ServiceCriticalC/Y/R limitsSpecifics
httpNo-/-/-http://10.31.131.14/healthcheck.txt
sshNo-/-/-
Local tests
ServiceCriticalC/Y/R limitsConfiguration (NB: Thresholds on client may differ)
clientlogNo-/-/- 
cpuNo-/-/-UNIX - Yellow: Load average > 1.5, Red: Load average > 3.0
Windows - Yellow: CPU utilisation > 80%, Red: CPU utilisation > 95%
diskNo-/-/-Default limits: Yellow 90% full, Red 95% full
/root
/tmp
/home
/var
/opt
/data/ebs/logs
filesNo-/-/- 
inodeNo-/-/- 
memoryNo-/-/-Yellow: swap/pagefile use > 80%, Red: swap/pagefile use > 90%
msgsNo-/-/- 
portsNo-/-/- 
procsNo-/-/-No processes monitored

Basics usaea1swpuas131 (10.31.131.8)
Monitoring location: Top Page
Description: PrequalServerASG:ip-10-31-131-8.ec2.internal
SLA Reporting Period: 24x7
Network tests
ServiceCriticalC/Y/R limitsSpecifics
httpNo-/-/-http://10.31.131.8:80/healthcheck.txt
http2No-/-/-http://10.31.131.8:8090/
sshNo-/-/-
Local tests
ServiceCriticalC/Y/R limitsConfiguration (NB: Thresholds on client may differ)
clientlogNo-/-/- 
cpuNo-/-/-UNIX - Yellow: Load average > 1.5, Red: Load average > 3.0
Windows - Yellow: CPU utilisation > 80%, Red: CPU utilisation > 95%
diskNo-/-/-Default limits: Yellow 90% full, Red 95% full
/root
/tmp
/var
/home
/opt
/data/ebs/logs
filesNo-/-/- 
inodeNo-/-/- 
memoryNo-/-/-Yellow: swap/pagefile use > 80%, Red: swap/pagefile use > 90%
msgsNo-/-/- 
portsNo-/-/- 
procsNo-/-/-No processes monitored

Basics usaea1swpuas132 (10.31.131.167)
Monitoring location: Top Page
Description: FraudServerASG:ip-10-31-131-167.ec2.internal
Planned downtime: All days:0000:2359 (status:http,http2) (cause:product is not live)
All days:0400:1200 (status:All) (cause:product is not live)
SLA Reporting Period: 24x7
Network tests
ServiceCriticalC/Y/R limitsSpecifics
httpNo-/-/-http://10.31.131.167:80/healthcheck.txt
http2No-/-/-http://10.31.131.167:8090/
sshNo-/-/-
Local tests
ServiceCriticalC/Y/R limitsConfiguration (NB: Thresholds on client may differ)
clientlogNo-/-/- 
cpuNo-/-/-UNIX - Yellow: Load average > 1.5, Red: Load average > 3.0
Windows - Yellow: CPU utilisation > 80%, Red: CPU utilisation > 95%
diskNo-/-/-Default limits: Yellow 90% full, Red 95% full
/root
/var
/home
/tmp
/opt
filesNo-/-/- 
inodeNo-/-/- 
memoryNo-/-/-Yellow: swap/pagefile use > 80%, Red: swap/pagefile use > 90%
msgsNo-/-/- 
portsNo-/-/- 
procsNo-/-/-No processes monitored

Basics xymonserver (127.0.0.1)
Monitoring location: Top Page
SLA Reporting Period: 24x7
Network tests
ServiceCriticalC/Y/R limitsSpecifics
bbdNo-/-/-
connNo-/-/- 
httpNo-/-/-https://localhost/xymon/www/
Local tests
ServiceCriticalC/Y/R limitsConfiguration (NB: Thresholds on client may differ)
clientlogNo-/-/- 
cpuNo-/-/-UNIX - Yellow: Load average > 1.5, Red: Load average > 3.0
Windows - Yellow: CPU utilisation > 80%, Red: CPU utilisation > 95%
diskNo-/-/-Default limits: Yellow 90% full, Red 95% full
/root
/tmp
/var
/home
/opt
/data/ebs/jenkins
filesNo-/-/- 
inodeNo-/-/- 
memoryNo-/-/-Yellow: swap/pagefile use > 80%, Red: swap/pagefile use > 90%
msgsNo-/-/- 
portsNo-/-/- 
procsNo-/-/-No processes monitored
sslcertNo-/-/- 
xymondNo-/-/- 
xymongenNo-/-/- 
xymonnetNo-/-/- 

Xymon column descriptions
bbdThe bbd column shows the status of the Xymon or Big Brother service on the host. The bbd service is an essential part of the Xymon or Big Brother monitoring system, so a failure of this service typically means that a large part of the monitoring system is no longer operational.
clientlogThe clientlog column shows the current raw client message last sent by the host.
connThe conn test performs a "ping" of the host.
cpuThe cpu column shows the status of the system processor (CPU) on the host. It monitors the system to check if it is getting too busy to be able to handle the load.
diskThe disk column shows the status of the system disks and other file storage areas.
filesThe files test shows the status of file- and directory-checks performed on the host. This is typically tests that check the size of files or directories, or check that they exist with the correct owner/group/permissions.
httpThe http column shows the status of one or more Web requests sent to the server. http is now the ubiquitous method for exchanging information across a network, it is the service used when your webbrowser requests information from the Internet.
inodeThe inode column shows how many inodes (roughly, filenames) are used on a filesystem. Running out of inodes is equivalent to filling up your disk.
memoryThe memory column shows how much of the system memory (RAM) and swap-space is being used. If memory is running low, performance of the system will begin to degrade.
msgsThe msgs column monitors system log-files or the Event log for warnings or critical errors.
portsThe ports column shows the status of select tcp ports and connections that are expected to exist on the system.
procsThe procs column shows the status of select processes that are expected to run on the system.
sshThe ssh column shows the status when trying to communicate with the Secure Shell (ssh) server on the host. ssh is commonly used for encrypted console access to Unix servers, or for copying files between systems.
sslcertThe sslcert column shows the status of one or more SSL certificates on the server. SSL certificates are needed for services that use encryption, e.g. if you have a secure webserver. Certificates are normally issued by trusted organisations such as Verisign or Thawte, and are valid for a limited period of time.
xymondThe xymond column shows the status of the central Xymon daemon.
xymongenThe xymongen column shows the status of the Xymon xymongen task. This task is responsible for updating the webpages you see when looking at the Xymon status view.
xymonnetThe xymonnet column shows the status of the Xymon network-service monitoring task. This task is responsible for testing all of the network services being monitored.