This started out as a page for Hadoop-related stuff, but my feeling is that there’s a lot more interesting stuff out there than “just” Hadoop, so I’ll be adding generic Big Data resources

TODO: add the remainder of my private list, after filtering it a bit.

Date Link Notes
Jul 7 Kafka A distributed messaging system with commit logs
Apr 1 Druid A clustered column store able to ingest and query data on the fly
PigPen A Clojure library for orchestrating complex Hadoop jobs.
Oct 1 H2O An analytics/machine learning toolkit