- Configure Metric Retention
By default, OpenTSDB stores two weeks of metrics. Based on your requirements, you can change the metric-retention period.
- Configure Queue Filters for mapr.rm.<value> Metrics
The YARN application metrics that are collected by JMX have the metric name syntax mapr.rm.<metric_name>
and the metric values are aggregated among all the queues in the default queue. However, you can configure collectd to create a filter for each queue. As an alternative, you can use the REST API queue metrics (mapr.rm_queue.<metric_name>
) which are by default set up for filtering by queue.
- Configure the Collectd Service Heap Size
The collectd
service uses an embedded JVM when it gathers metrics from the CLDB, Node Manager, Resource Manager, and Drill. You can edit the Plugin Java section of collectd.conf to configure limits to the collectd virtual memory footprint.
- CPU Metrics
Every 10 seconds, the collectd service uses the cpu plugin to gather the following CPU metrics on each node in the cluster.
- Disk Free Metrics
Every 10 seconds, the collectd service uses the df plugin to gather the following disk free metrics on each node in the cluster.
- Disk Metrics
Every 10 seconds, the collectd service uses the disk plugin to gather the following disk metrics on each node in the cluster.
- Drill Metrics
Every 10 seconds, the collectd service uses the plugin to gather the following Drill metrics on each node in the cluster.
- Hive JMX Metrics
Every 10 seconds, the collectd
service uses the Hive plug-in to gather the following Hive JMX metrics on each node in the cluster. Descriptions for the Hive metrics are not currently available.
- Kafka JMX Metrics
Starting in EEP 9.0.0, you can enable metrics collection for Kafka consumers and producers. When enabled, JMX collects Kafka producer and consumer metrics from client applications. You can view the metrics through the JConsole UI or JMXTerm CLI.
- Load Metrics
Every 10 seconds, the collectd service uses the load plugin to gather the following load metrics on each node in the cluster.
- Alarm Metrics
Every 10 seconds, the collectd service uses a plugin to gather the cluster alarms.
- Cache Metrics
Every 10 seconds, the collectd service uses a plugin to gather the following Data Fabric file system cache metrics on each node in the cluster.
- CLDB Metrics
Every 10 seconds, the collectd service uses a HPE Ezmeral Data Fabric plugin to gather the following CLDB metrics on the primary CLDB node in the cluster.
- HPE Ezmeral Data Fabric Database Metrics
Every 10 seconds, the collectd service uses a plugin to gather HPE Ezmeral Data Fabric Database metrics on each node in the cluster. HPE Ezmeral Data Fabric Database provides both node and table metrics.
- HPE Ezmeral Data Fabric Streams Metrics
Every 10 seconds, the collectd service uses a plugin to gather the following Streams metrics on each node in the cluster.
- file system Metrics
Every 10 seconds, the collectd service uses a HPE Data Fabric plugin to gather the following file system metrics on each node in the cluster.
- Process Metrics
Every 10 seconds, the collectd service uses a plugin to gather the following process metrics on each node in the cluster.
- I/O Metrics
Every 10 seconds, the collectd service uses a plugin to gather the following I/O metrics on each node in the cluster.
- RPC Metric
Every 10 seconds, the collectd service uses a plugin to gather the following RPC metrics on each node in the cluster.
- Spark JMX Metrics
Every 10 seconds, the collectd
service gathers the following Spark JMX metrics on each node in the cluster.
- Topology Metrics
Every 60 seconds, the collectd service uses a plugin to gather the following topology metrics on each node in the cluster. Use these metrics to understand disk utilization across a topology or rack. By default, these metrics include all racks and topologies associated with the cluster. However, you can use tags to specify which rack(s) or topologies(s) to include. Note: Racks and topologies can span multiple nodes and one rack can be associated with multiple topologies.
- Volume Metrics
Every 10 seconds, the collectd service uses a plugin to gather the following volume metrics on each CLDB node in the cluster.
- Virtual Memory Metrics
Every 10 seconds, the collectd service uses the vmem plugin to gather the following memory metrics on each node in the cluster.
- Memory Metrics
Every 10 seconds, the collectd service uses the memory and swap plugins to gather the following memory metrics on each node in the cluster.
- Network Metrics
Every 10 seconds, the collectd service uses the interface plugin to gather network metrics on each node in the cluster.
- Node Manager Metrics
Every 10 seconds, the collectd service uses a plugin to gather the following Node Manager metrics on each node in the cluster.
- Resource Manager Metrics
Every 10 seconds, the collectd
service uses a HPE Ezmeral Data Fabric plugin to gather Resource Manager metrics on the active Resource Manager. Collectd
gathers metrics on the Resource Manager JVM process, YARN applications, and nodes that are managed by the Resource Manager. The method used to gather the metrics differs based on the metric type.