OnDataEngineering is a site focused on how to make the right data available in the right form in the right place at the right time to allow users to efficiently exploit it. This covers a range of use cases (including the acquisition and staging of data, “big data” preparation and data warehousing) to support any type of exploitation (be it reporting, big data analytics, machine learning or any one of a dozen similar capabilities).
This site currently hosts a catalog of technologies that fit within this space, organised alphabetically, by category and by vendor. We also have a blog that covers updates to the site as well as industry news, and a set of forums .
In the future it is planned for this site to also look at key use cases for different technologies within the Data Engineering space along with a set of independent, technical and critical evaluations of various technologies and architectural patterns against these. Please see the introduction blog post for further details.
This is a community owned and authored site. All the content on this site is licensed under a Creative Commons Attribution license (see here for details) and the content of the site is hosted in a public GitHub repository here, with contributions welcomed (see here for details).
For details on how to get new information added to the site delivered straight to you via e-mail, RSS or Twitter, please see here.
To get in contact with the site administrators please e-mail firstname.lastname@example.org