Apache Kudu edit   discuss  

Columnar storage technology for tables of structured data, supporting low latency reads, updates and deletes by primary key, as well as analytical column/table scans. Provides Java, C++ and Python APIs, is queryable via Impala and Spark SQL, and provides Spark, Flume and MapReduce connectors. Supports cluster deployments (including co-existence with Hadoop), with tables partitioned into tablets (configurable on a per table basis), with tablets then replicated and distributed across the cluster, using the Raft Consensus Algorithm for consistency. Also supports variable column encoding (including bit shuffle, run length, dictionary and prefix encoding) and compression. Includes a web UI for reporting operational information, and metrics available from the command line, via HTTP or via a log file. Started in November 2012, with a initial beta release in September 2015. Donated to the Apache Foundation in December 2015, graduating in July 2016, with a 1.0 release in September 2016. Implemented in C++.

Technology Information

Other NamesKudu
VendorsThe Apache Software Foundation
TypeCommercial Open Source
Last UpdatedMarch 2019 - v1.9

Related Technologies

Is packaged byCloudera CDH

Release History

versionrelease daterelease linksrelease comment
1.42017-06-13summary 
1.52017-09-08summary; blog post 
1.62017-12-07summary; blog post 
1.72018-03-23summary; blog post 
1.82018-10-26summary; blog post 
1.92019-03-12summary; blog post 

News

Blog Posts