Azure HDInsight edit   discuss  

Service for dynamically provisioning Hadoop clusters on Azure Virtual Machines based on a set of pre-defined cluster templates for Hadoop, Spark, HBase, Storm, Hive LLAP, Kafka or Machine Learning. Based on the Hortonworks HDP distribution of Hadoop, with support for Azure Blob Storage and Azure Data Lake Storage (both strongly consistent) but not local HDFS. Supports manual scaling of in-flight clusters, integration with Azure Log Analytics, encryption, use of external SQL database for Hive metadata and script actions (scripts that can be run during or after cluster creation). Comes with an Enterprise Security Package add-on that adds integration with Azure Active Directory, role based access control for Hive and Spark using Apache Ranger and security audit logs. Manageable via the Azure Portal, Powershell, a REST API and integrates with a number of development IDEs (e.g. for interactive development of Spark jobs). Priced at an hourly rate (billed per minute) based on the VM instance types being used, in addition to any Virtual Machine charges.

Technology Information

Other NamesHDInsight
TypeCommercial
Last UpdatedApril 2019 - v4.0

Related Technologies

PackagesHortonworks Data Platform

Bundled Technologies

Note that HDInsight is largely based on HDP releases, however it doesn’t include some components (Atlas, Accumulo, Knox, Solr).

Release History

versionrelease daterelease linksrelease comment
4.02019-blog post; announcementHDP 3.0; Hive LLAP; HBase 2.0

News

See Azure updates

Blog Posts