Hadoop

From Michael G. Noll

Jump to: navigation, search

In my PhD project, I use Hadoop quite a lot. Hadoop is a Yahoo-sponsored open source framework for distributed computing and data storage, similar to Google's MapReduce and GFS. If you are passionate software developer, you should definitely give it a try. I know you want it! ;-)

Tutorials

If you are interested in Hadoop, these tutorials will get you started. Enjoy!

Use Cases

I have used Hadoop for a variety of things such as web crawling, creation of research data sets or straight-forward data analysis. Here are some examples from my personal experience. You might also want to check the Hadoop website or the Hadoop user mailing list to find out what other problems can and are being solved with Hadoop.


Tags: articles, tutorials, cabs120k08, development, dmoz100k06, hadoop, programming, python, research