Data Lifecycle Manager edit   discuss  

A DataPlane application for replicating HDFS and Hive data between two clusters along with any associated metadata and security policies. Clusters already registered with DataPlane can be paired, at which point replication policies can be defined, which result in replication jobs running at the selected interval. Supports replicating between HDFS and cloud object storage (with some caveats around replication of security policies), replication of encrypted HDFS data, TLS encryption of replication streams, one to many replication, support for Atlas metadata replication, reporting on and management of replication jobs and HDFS snapshottable directories, with jobs executed by DLM Engine processes on the appropriate cluster. Stated future plans include support for automatic tiering of data between clusters and point in time backup and restore.

Technology Information

TypeSub-Project
Parent ProjectHortonworks DataPlane
Last UpdatedApril 2019 - 1.4

Release History

versionrelease daterelease linksrelease comment
1.02017-11-01 Alongside DataPlane 1.0
1.1unknownWhat’s NewObject store, encryption and automatic snapshot support
1.22018-10-16What’s NewAtlas metadata replication; one to many replication
1.32018-12-15What’s NewSupport for Google Cloud storage; Support for HDP 3.1
1.42019-03-27What’s NewHive External table replication