Apache Falcon edit  

Data feed management system for Hadoop. Supports the definition, scheduling and orchestration (including support for late data and retry policies) of data processing pipelines (referred to as processes, with support for Ozzie, Spark, Hive and Pig jobs), the management of the data produced and consumed by these pipelines (referred to as feeds, with support for data in HDFS and Hive) and the generation and visualisation of pipeline lineage information, all across multiple Hadoop clusters. Also includes the ability to mirror or replicate HDFS and Hive data between clusters, to failover processing between clusters and to import and export data using Sqoop. Supports both a web and command line interface and a REST API. An Apache project, graduating in December 2014, having been originally donated by inMobi in April 2013. Hasn't yet reached a v1.0 milestone, is seeing very little development activity, and as of HDP 3.0 will no longer be distributed by Hortonworks.

Technology Information

Other NamesFalcon
VendorsThe Apache Software Foundation
TypeOpen Source - Quiet
Last UpdatedMarch 2018 - v0.11

Release History

versionrelease daterelease linksrelease comment
0.112018-03-12release note 


  • Blog updates via the Hortonworks and Apache blogs

Blog Posts