Copying Data from Apache Hadoop to a Data Fabric Cluster
Describes the procedure to copy data from an Apache Hadoop to a Data Fabric cluster.
You can use the hdfs protocol, webhdfs protocol, or NFS for the HPE Ezmeral Data Fabric to copy data from Apache Hadoop to a Data Fabric cluster.
The following table describes these methods:
Method | Description |
---|---|
hdfs:// protocol | Use the hadoop distcp command with the
hdfs:// protocol to copy data from an HDFS cluster into
a Data Fabric
cluster if the HDFS cluster and the Data Fabric cluster
use the same RPC protocol version. For all other scenarios, use the
webhdfs:// protocol or NFS for the HPE Ezmeral Data
Fabric gateway to copy data to a Data Fabric
cluster. |
webhdfs:// protocol | Use the hadoop distcp command with the
webhdfs:// protocol to copy data from an HDFS cluster
into a Data Fabric cluster. |
NFS | Mount a Data Fabric cluster to an HDFS cluster using NFS for the
HPE Ezmeral Data Fabric mount. Then use the hadoop
distcp command to copy data between the two clusters. |