Copying Data from Apache Hadoop to a Data Fabric Cluster
Describes the procedure to copy data from an Apache Hadoop to a Data Fabric cluster.
You can use the hdfs protocol, webhdfs protocol, or NFS for the HPE Data Fabric to copy data from Apache Hadoop to a Data Fabric cluster.
The following table describes these methods:
| Method | Description |
|---|---|
| hdfs:// protocol | Use the hadoop distcp command with the
hdfs:// protocol to copy data from an HDFS cluster into
a Data Fabric
cluster if the HDFS cluster and the Data Fabric cluster
use the same RPC protocol version. For all other scenarios, use the
webhdfs:// protocol or NFS for the HPE Data Fabric gateway to copy data to a
Data Fabric cluster. |
| webhdfs:// protocol | Use the hadoop distcp command with the
webhdfs:// protocol to copy data from an HDFS cluster
into a Data Fabric cluster. |
| NFS | Mount a Data Fabric cluster to an HDFS cluster using NFS for the
HPE Data Fabric mount. Then use the
hadoop distcp command to copy data between the two
clusters. |