An MPP query engine that supports the execution of SQL queries over in HDFS, HBase, Kudu and S3 based on tables defined in the Hive Metastore. Focus is on analytical (OLAP) use cases, and more specifically on low latency interactive queries (rather than long running batch queries), with some support for batch inserts of data. Supports DDL statements for updating the Hive Metastore, uses (broadly) the same SQL syntax as Hive (including UDFs and a range of aggregate and analytical functions), as well as the same JDBC / ODBC drivers, and is therefore compatible with any Hive query tool (such as Beeline). Supports querying over data in Parquet, Text, Avro, RCFile and SequenceFile formats, with the ability to write Parquet and Text data. Support Kerberos and LDAP authentication, and integration with Apache Sentry for authorisation. Includes a shell (Impala Shell) that supports some shell only commands for tuning performance and diagnosing problems. Created by Cloudera, started in May 2011 and first announced in October 2012, with a 1.0 GA release in May 2013. Donated to the Apache Foundation in December 2015, graduating in November 2017, and is still under active development.
Other Names Impala Vendors The Apache Software Foundation Type Commercial Open Source Last Updated April 2019 - v3.2
Is packaged by Cloudera CDH, MapR Expansion Pack, Cloudera Altus Data Warehouse
version release date release links release comment 2.9 2017-06-17 changelog 2.10 2017-09-15 changelog 2.11 2018-01-18 changelog 2.12 2018-05-01 changelog 3.0 2018-05-09 changelog 3.1 2018-12-06 changelog 3.2 2019-03-29 changelog