Elasticsearch-Hadoop edit   discuss  

A suite of open source components for querying and writing documents to Elasticsearch from a range of Hadoop technologies, including MapReduce, Hive, Pig, Spark, Cascading and Storm. Specific functionality includes InputFormat and OutputFormat libraries for MapReduce, a Hive storage handler allowing external tables to be defined over Elasticsearch indexes, read and write functions for Pig, Java and Scala RDD based libraries for Spark, Spark SQL support, Spark Streaming support, an Elasticsearch Tap for Cascading and a dedicated Spout and Bolt for Storm. Used to include functionality for writing snapshots of Elasticsearch indexes to HDFS which is now part of the Snapshot and Restore functionality in Elasticsearch. Certified with CDH, MapR and HDP.

Technology Information

Other NamesES-Hadoop
VendorsElastic
TypeCommercial Open Source
Last UpdatedAugust 2019 - v7.3

Related Technologies

Add on toElasticsearch

Release History

versionrelease daterelease linksrelease comment
5.52017-07-06announcement; release notesHadoop 1.x and Elasticsearch of YARN Beta deprecated
5.62017-07-06release notes 
6.02017-11-14announcement; release notesSpark streaming support and removal of Elasticsearch on YARN beta
6.12017-12-13release notesOne bug fix!
6.22018-02-06announcement; release notesCustom error handlers
6.32018-06-13announcement; release notesSpark 2.3 support
6.42018-08-23announcement; release notes 
6.52018-11-14announcement; release notes 
6.62019-01-29release notes 
6.72019-03-26announcement; release notesCascading support deprecated; Elasticsearch Kerberos support
6.82019-05-20release notesCompatibility release
7.02019-04-10announcement; release notesJava 8 or higher; Cascading deprecation
7.12019-05-20release notesCompatibility release
7.22019-06-25release notesCompatibility release
7.32019-07-31announcementCompatibility release