This started out as a page for Hadoop-related stuff, but my feeling is that there’s a lot more interesting stuff out there than “just” Hadoop, so I’ll be adding generic Big Data resources
TODO: add the remainder of my private list, after filtering it a bit.
|Jul 7||Kafka||A distributed messaging system with commit logs|
|Apr 1||Druid||A clustered column store able to ingest and query data on the fly|
|PigPen||A Clojure library for orchestrating complex Hadoop jobs.|
|Oct 1||H2O||An analytics/machine learning toolkit|