Data Migration
After you migrate your applications to the Data Fabric cluster, you can copy your data from the Apache Hadoop HDFS to the Data Fabric cluster.
Once you have installed and configured your Data Fabric cluster in a test environment and migrated your applications to the Data Fabric cluster you can begin to copy over your data from the Apache Hadoop HDFS to the Data Fabric cluster.
Use any of the following methods to copy data from an HDFS cluster to a Data Fabric cluster:
Method | Description |
---|---|
hdfs:// protocol | You can use the hadoop distcp command with the hdfs:// protocol
to copy data from an HDFS cluster into a Data Fabric
cluster. Use this method if the HDFS cluster and the Data Fabric cluster use the same RPC protocol version. For
all other scenarios, use the webhdfs:// protocol or NFS gateway to
copy data to a Data Fabric cluster. |
webhdfs:// protocol | You can use the hadoop distcp command with the webhdfs://
protocol to copy data from an HDFS cluster into a Data Fabric cluster. |
NFS | You can mount a Data Fabric cluster to an HDFS cluster via NFS mount and then use the hadoop distcp command to copy data between the two clusters. |