This started out as a page for Hadoop-related stuff, but my feeling is that there’s a lot more interesting stuff out there than “just” Hadoop, so I’ll eventually be adding generic Big Data resources.
TODO: add the remainder of my private list, after filtering it a bit.
|2014-06||Kafka||A distributed messaging system with commit logs|
|2014-04||Druid||A clustered column store able to ingest and query data on the fly|
|PigPen||A Clojure library for orchestrating complex Hadoop jobs.|
|2013-10||H2O||An analytics/machine learning toolkit|