Hadoop

This started out as a page for Hadoop-related stuff, but my feeling is that there’s a lot more interesting stuff out there than “just” Hadoop, so I’ll eventually be adding generic Big Data resources.

TODO: add the remainder of my private list, after filtering it a bit.

Date Link Notes
2014-06 Kafka

A distributed messaging system with commit logs

2014-04 Druid

A clustered column store able to ingest and query data on the fly

PigPen

A Clojure library for orchestrating complex Hadoop jobs.

2013-10 H2O

An analytics/machine learning toolkit

This page is referenced in: