QIZMT
I have been working mostly on windows for the past 2.5 years and I badly wanted to get hands on using some MapReduce framework implementation like Hadoop. For quite sometime, I was thinking to install and configure Hadoop on my windows machine. The introductory tutorials about Hadoop on Windows were really daunting as every tutorial started with a caution about using Hadoop on windows as Hadoop is not officially supported on Windows. I badly wanted to play with some parallel computing framework. Though I know about few of them, I was certainly uncertain of which one could be really handy to try out few freaky things. Then I came across MySpace's Qizmt (Kiz-Mit). Qizmt is an implementation of the MapReduce framework from MySpace. It is free and open. Qizmt is licensed through GNU GPL V3. I tried installing Qizmt and I got everything ready in matter of few minutes. I ran my first example in 10 mins or so.
Overall, the getting started experience was so easy and smooth. I simply love these kind of toolsets and frameworks that make your life easy rather than chasing some installation or configuration issues. I ran a simple word count program and everything went fine. Qizmt has a very nice feature. (I am not sure if other frameworks readily support it). You can create sample data on the fly and check the correctness of your map and reduce code instantly. And all the job definitions are serialized as xml data (even the map and reduce module code). As of now, I am not sure when that code is compiled. The toolset also comes with a decent debugger. It supports viewing call stack, local variables and debug output. Immediate window and thread window have been really good. All I tried was a single machine set up and I am yet to do the cluster setup.
Tutorial related to QIZMT can be found here : http://code.google.com/p/qizmt/
I have been working mostly on windows for the past 2.5 years and I badly wanted to get hands on using some MapReduce framework implementation like Hadoop. For quite sometime, I was thinking to install and configure Hadoop on my windows machine. The introductory tutorials about Hadoop on Windows were really daunting as every tutorial started with a caution about using Hadoop on windows as Hadoop is not officially supported on Windows. I badly wanted to play with some parallel computing framework. Though I know about few of them, I was certainly uncertain of which one could be really handy to try out few freaky things. Then I came across MySpace's Qizmt (Kiz-Mit). Qizmt is an implementation of the MapReduce framework from MySpace. It is free and open. Qizmt is licensed through GNU GPL V3. I tried installing Qizmt and I got everything ready in matter of few minutes. I ran my first example in 10 mins or so.
Overall, the getting started experience was so easy and smooth. I simply love these kind of toolsets and frameworks that make your life easy rather than chasing some installation or configuration issues. I ran a simple word count program and everything went fine. Qizmt has a very nice feature. (I am not sure if other frameworks readily support it). You can create sample data on the fly and check the correctness of your map and reduce code instantly. And all the job definitions are serialized as xml data (even the map and reduce module code). As of now, I am not sure when that code is compiled. The toolset also comes with a decent debugger. It supports viewing call stack, local variables and debug output. Immediate window and thread window have been really good. All I tried was a single machine set up and I am yet to do the cluster setup.
Tutorial related to QIZMT can be found here : http://code.google.com/p/qizmt/