Columnar storage technology for tables of structured data, supporting low latency reads, updates and deletes by primary key, as well as analytical column/table scans. Provides Java, C++ and Python APIs, is queryable via Impala and Spark SQL, and provides Spark, Flume and MapReduce connectors. Supports cluster deployments (including co-existence with Hadoop), with tables partitioned into tablets (configurable on a per table basis), with tablets then replicated and distributed across the cluster, using the Raft Consensus Algorithm for consistency. Also supports variable column encoding (including bit shuffle, run length, dictionary and prefix encoding) and compression. Includes a web UI for reporting operational information, and metrics available from the command line, via HTTP or via a log file. Started in November 2012, with a initial beta release in September 2015. Donated to the Apache Foundation in December 2015, graduating in July 2016, with a 1.0 release in September 2016. Implemented in C++.
Other Names Kudu Vendors The Apache Software Foundation Type Commercial Open Source Last Updated March 2019 - v1.9
Is packaged by Cloudera CDH
version release date release links release comment 1.4 2017-06-13 summary 1.5 2017-09-08 summary; blog post 1.6 2017-12-07 summary; blog post 1.7 2018-03-23 summary; blog post 1.8 2018-10-26 summary; blog post 1.9 2019-03-12 summary; blog post