Michael G. Noll

Applied Research. Big Data. Distributed Systems. Open Source.

Tutorials

Hadoop

Apache Hadoop is a free and open source implementation of frameworks for reliable, scalable, distributed computing and data storage. It enables applications to work with thousands of nodes and petabytes of data, and as such is a great tool for research and business operations. Hadoop was inspired by Google’s MapReduce and Google File System (GFS) papers.

I have written the following tutorials related to the Hadoop technology stack:

Storm

Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing. It is being used at companies and institutions such as Twitter, Yahoo! and PARC.

I have written the following tutorials related to the Storm:

Mozilla Firefox

If you are a Firefox add-on developer, the tutorials below might come in handy.