R

R is a free software environment for statistical computing and graphics that I’ve been using for a while for some specific tasks.

I wholeheartedly recommend the the awesome RStudio if you want to get to grips with it, and looking into RHadoop if you want to hook up with your Hadoop cluster (or, better still, SparkR)

Resources:

Category Date Link Notes
Integration 2015 SparkR bindings to run R jobs in Spark.
2012 Rpy A set of Python bindings
Packages 2014 CausalImpact An R package for Bayesian inference.
2012 Slidify A very nice way to present data from R projects.
Extrafont Use external fonts in charts