Data Migration

After you migrate your applications to the Data Fabric cluster, you can copy your data from the Apache Hadoop HDFS to the Data Fabric cluster.

Once you have installed and configured your Data Fabric cluster in a test environment and migrated your applications to the Data Fabric cluster you can begin to copy over your data from the Apache Hadoop HDFS to the Data Fabric cluster.

Use any of the following methods to copy data from an HDFS cluster to a Data Fabric cluster:

Method Description
hdfs:// protocol You can use the hadoop distcp command with the hdfs:// protocol to copy data from an HDFS cluster into a Data Fabric cluster. Use this method if the HDFS cluster and the Data Fabric cluster use the same RPC protocol version. For all other scenarios, use the webhdfs:// protocol or NFS gateway to copy data to a Data Fabric cluster.
webhdfs:// protocol You can use the hadoop distcp command with the webhdfs:// protocol to copy data from an HDFS cluster into a Data Fabric cluster.
NFS You can mount a Data Fabric cluster to an HDFS cluster via NFS mount and then use the hadoop distcp command to copy data between the two clusters.