A DataPlane application for replicating HDFS and Hive data between two clusters along with any associated metadata and security policies. Clusters already registered with DataPlane can be paired, at which point replication policies can be defined, which result in replication jobs running at the selected interval. Supports replicating between HDFS and cloud object storage (with some caveats around replication of security policies), replication of encrypted HDFS data, TLS encryption of replication streams, one to many replication, support for Atlas metadata replication, reporting on and management of replication jobs and HDFS snapshottable directories, with jobs executed by DLM Engine processes on the appropriate cluster. Stated future plans include support for automatic tiering of data between clusters and point in time backup and restore.
Type Sub-Project Parent Project Hortonworks DataPlane Last Updated April 2019 - 1.4
version release date release links release comment 1.0 2017-11-01 Alongside DataPlane 1.0 1.1 unknown What’s New Object store, encryption and automatic snapshot support 1.2 2018-10-16 What’s New Atlas metadata replication; one to many replication 1.3 2018-12-15 What’s New Support for Google Cloud storage; Support for HDP 3.1 1.4 2019-03-27 What’s New Hive External table replication