Monitoring Nodes
Explains how to monitor nodes using either the Control System or the CLI.
About this task
You can check the health of the nodes on the cluster in the Control System, organized by service or by topology, or by using the CLI.
Monitoring Node Health Using the Control System
About this task
Procedure
-
Log in to the Control System and click:
- Overview to view the health of the nodes in the Node Health pane.
- Nodes to view the health of the nodes in the Node Health pane.
-
Select one of the following from the drop-down menu in the Node Health
pane.
- By Service to organize the display of nodes
by services.
This is the default view in the Overview page. This view contains the list of services and the nodes on which the service is running () and is down ().
NOTEThe color of the node (which reflects the status of the service) is even when a service is stopped (not running) on the node. - By Topology to view the display of nodes by
topology.
This is the default view in the Nodes page. This view contains the list of topologies and the health of the nodes (as shown in the following table) in the topology.
Node Color Description Indicates the node is healthy. Indicates the node is degraded and/or may need attention. A node is considered to be in degraded state if: - There is no heartbeat from the HPE Ezmeral Data Fabric filesystem/NFS node for over 60 seconds.
- One or more services are down on the node.
- One or more alarms are raised on the node.
Indicates the node is in maintenance mode. Indicates critical issue(s) on the node. A node is considered to be in critical state if: - There is no heartbeat from the node for more than 5 minutes.
- All HPE Ezmeral Data Fabric files system disks on the node are dead or are offline.
- All containers on the node are being re-replicated because either the node was removed, unregistered, or there was no heartbeat from the node for more than 1 hour.
- File server is dead/inactive because there is no heartbeat for a long time.
- NFS server on node is dead.
- HPE Ezmeral Data Fabric install directory is full.
- Node reported high HPE Ezmeral Data Fabric filesystem memory usage.
- By Service to organize the display of nodes
by services.
Monitoring Node Resource Utilization from the Control System
Procedure
Monitoring Active Node Alarms from the Control System
About this task
See Viewing Active Node Alarms for more information.
Monitoring Node Health Using the CLI or REST API
About this task
You can check general health of the nodes with the following command:
maprcli node heatmap -cluster <cluster>
This command displays a heatmap for the nodes on the specified cluster; a subset
of the output can also be visualized on the Control System. For complete
reference information, see node heatmap
.