MapReduce project on a cluster

Yesterday during my Advanced Databases class, our prof gave us an alternative final project: set up a MapReduce (Hadoop) system on his new 42-node cluster and implement some database algorithms using MapReduce.

I jumped on the opportunity, but I asked if I could use Disco instead of Hapdoop. He said that was fine but said the documentation may not be as good. I'm so excited about this!

I'm also hoping I can use erlSim for my project in my Discrete Event System Simulation class.

Written on October 25, 2008