Apache Flink edit  

Specialised stream processing technology inspired by the Google Data Flow model. Based on a single record (not micro batch) model, with exactly once processing semantics (for supported sources and sinks) via light weight checkpointing, and focusing on high throughput, low latency use cases. Supports both a Java and Scala API, with a fluent DataStream API for working with continuous data flows (including a flexible windowing API that supports both event time and processing time windows and support for out of order or late data), and a DataSet API for working with batch data sets (that uses the same streaming execution engine). Also supports a number of connectors and extra libraries, including experimental support for SQL expressions, a CEP library (FlinkCEP) that can be used to detect complex event patterns, a beta package for running Storm apps on Flink, a graph processing library (Gelly) and a machine learning library (FlinkML). Clustered, with support for YARN, Mesos and Kubernetes as well as standalone clusters. Open sourced by Data Artisans in April 2013, donated to the Apache Foundation in April 2014 before graduating in August 2014. Under active development with a large number of contributors and a range of user case studies. Sold as a hosted managed service (dA Platform) by Data Artisans who also supply training.

Technology Information

Other NamesFlink
VendorsThe Apache Software Foundation, Data Artisans
TypeCommercial Open Source
Last UpdatedApril 2019 - 1.8

Related Technologies

Is packaged byApache Bigtop, Amazon EMR

Release History

versionrelease daterelease linksrelease comment
1.52018-06-05summaryDatanami view
1.62018-08-09summary; Data Artisans view 


Blog Posts