HBase Migration
The HPE Ezmeral Data Fabric Hadoop distribution includes HBase, with a number of Data Fabric-exclusive enhancements.
HBase is the Hadoop database, which provides random, real-time read/write access to very
large datasets. The Data Fabric Hadoop distribution includes
HBase and is fully integrated with Data Fabric enhancements
for speed, usability, and dependability. Data Fabric provides
a volume (normally
mounted at /hbase
) to store HBase data.
- HBase bulk load jobs: If you are currently using HBase bulk load jobs to import
data into the HDFS, make sure to load your data into a path under the
/hbase
volume. - Compression: The HBase write-ahead log (WAL) writes many tiny records, and compressing it would cause massive CPU load. Before using HBase, turn off Data Fabric compression for directories in the HBase volume.