Below is a (incomplete!) list of big data tools. You have been assigned one to answer the following questions:
Does it help with storage, access, processing, modelling or visualization of data?
What does it do? (in words you understand)
What other tools (in this list or elsewhere) is it related to and how?
Is there a corresponding R package?
Provide a link where people could find out more about the tool.
You should provide your answers in the text submission box in the Blackboard assignment.
Tools:
Storm - James
JSON - Brintz
NoSQL - Zhang
MLib - Pahukula
MongoDB - Wei
Weka - Lei
Mahout - Skalland
PMML - Narendra Babu
D3.js - Dong
Flot - Choi
Spark - Zongo
Hive - Guermond
Pig - Kitada
Drill - Shellhammer
BigQuery - Edwards, E.
GraphX - Eng
Blink - Wang
Redshift - Edwards, M.
Shark - Olstad
CouchDB - Bernath
MLBase - Zhuo
Samoa (by Yahoo) - Guyer