Site24x7 Server Monitoring (Agent Based)
Monitor your servers with our light weight, easy-to-install agent. Stay ahead of outages and performance degradation issues with over 60 performance metrics, intuitive dashboards, RCA, IT Automation, and instant alerting mechanisms.
- Agent architecture - U.S. data center
- Agent security
- OS and features
- Plugin integrations
- Reports, dashboards, & IT Automation
- Troubleshooting tips
Agent Architecture - U.S. Data Center
In Site24x7, two data centers -- Primary Data Center (PDC) and Disaster Recovery Data Center (DRDC) are set up at different locations to ensure monitoring services remain uninterrupted in the event of a data center failure.
- Primary Data Center (PDC) - plus.site24x7.com
- Disaster Recovery Data Center (DRDC) - Primary (plus2.site24x7.com) and Secondary (plus3.site24x7.com)
In case of any error in the PDC, status information via heartbeat check is automatically sent to the Primary DRDC, thus avoiding false alerts due to intermittent network failure. If there occurs an error in the Primary DRDC, the Secondary DRDC will still receive the heartbeat check to ensure continued monitoring. Server shutdowns under normal circumstances is clearly communicated by the agent to our data centers. Also, ensure the necessary domains, port, and IP addresses are whitelisted to allow access in your firewall.
Once the agent is successfully installed in the user machine, its communication to the PDC is initiated using a one-way HTTPS connection. Agent communication occurs via three major means:
- The heartbeat check that is performed every minute
- Data collection with a polling frequency of one minute and five minutes. Learn how to change this setting in bulk or for a single monitor
- Real time communication via the Device Messaging System (DMS) that happens every 30 seconds
This is a status information check done every minute by the agent to the PDC. The server's uptime is decided by how long it's been since Site24x7 received a heartbeat check from the agent running on your server. If the agent is unable to communicate with the PDC, it will try contacting the primary DRDC and then the secondary DRDC.
If the heartbeat check has not occurred for a continuous time interval of seven minutes, the Server Availability Watcher will alert the user about possible network communication failure or system crash. You can also customize this time interval in the Admin > Server Monitor > Settings page and set an appropriate value for Set Threshold for Heartbeat Check. Learn more.
Data Collection with a Polling Interval of One Minute:
The server monitoring agent will collect data every minute, consolidate it and communicate it to the Site24x7 data center every five minutes. Only in case of any threshold violation, the agent will immediately communicate to the data center and trigger an alert to the user. In addition, you can also set polling interval from a range of 1 min, 5 min, 10 mins, 15 mins, 30 mins, 1 hour, 3 hours, 6 hours, 12 hours, and one day.
During a network connection failure, the data will be stored in the server as Zip files. Once the connection is restored, the agent will send the backlog data to the Site24x7 data center. Thus, there will be no data loss even during a communication failure.
Device Messaging System (DMS):
This ensures real-time communication to the monitored servers. Communication to the DMS (dms.zoho.com) happens every 30 seconds and the agent performs several actions triggered by the user like starting/stopping a process, discovering a service/process, upgrading the agent, root cause analysis, adding a port, URL, file, directory, and event log/syslog checks.
Server Monitoring Architecture
Network Sanity Checks:
Before reporting an outage, Site24x7 checks the availability status of its own internal network by performing a Network Sanity Check. This is to make sure that it is the resource that's down and not its network. Check out the possible outage scenarios and steps to troubleshoot them.
Agent Security
As the agent has to be downloaded and installed on every server that needs monitoring, it is important to understand about its security, its resource usage and the prerequisites for successful installation. Learn more.
OS and Features
Learn more about the platforms supported, the bulk deployment options available for installing the agent in 100's of servers in one go, and the various performance metrics provided for each of these platforms.
Plugin Integrations
Monitor your databases, applications, web servers, caches, message brokers and keep track of data that matters the most. Take a look at our 50+ out-of-the-box plugins. If you don't find the one you wish to monitor, write your own using Python and Shell script for Linux and VB, Powershell, Batch and DLL for Windows.
Reports, Dashboards, & IT Automation
Receive a detailed root cause analysis report for every server downtime, view your entire server environment in one dashboard, and automate fault resolution with exclusive IT Automation tools.
Troubleshooting Tips
Having a trouble with any of the servers or applications installed? Check our troubleshooting tips.