The Mid Week News 05/06/2019 edit  

If you’ve been following us on twitter (@OnDataEng) then you’ll have seen all this news already.

If not - here’s your new weekly news…

Technology updates (details are on the relevant technology pages):

Other technology news:

  • From the code972 blog, thoughts on the AWS Open Distro for Elastic Search from someone who knows what they’re talking about. See also his posts on why you shouldn’t use AWS Elastic Service, and running Elasticsearch on Kubernetes - link
  • Amazon Managed Streaming for Apache Kafka is now generally available - link; InfoQ
  • I always through their tech looks pretty interesting, but it look’s like MapR are having financial troubles, although MapR have published a response - Datanami; MapR
  • Are you into or looking at Apache Beam - they’ve got a bunch of new Katas out if you’re looking to learn it - link
  • From Qubole - scaling Apache Hive through the use of a dedicate cluster for running HiveServer2 - link
  • From Datanami - the latest on Snowflake, the cloud native analytical database - link
  • From Qubole, and normally I’d not link to something like this, but this feels a reasonable stab at the difference between a Data Lake and a Data Warehouse - link
  • From Datanami - how IBM is Turning DB2 into an AI database - link
  • From Cloudera, part 2 of their blog on the introduction of attribute based access control to Apache Solr - link
  • From Elastic - how to secure your Elasticsearch cluster using all the security features now available for free - link
  • Snowflake on Google Cloud Platform has been announced - link
  • From Solutions Review, cloud data integrator Matillion has secured a bunch of VC funding. Anyone any experience with this, and happy to write us a tech summary? link