Hadoop – Part 1

November 1, 2010 No comments yet

Hadoop is an extremely powerful and popular framework for large-scale application and data distribution. Companies like Amazon, Twitter, Rakuten and Facebook deploy Hadoop across clusters of literally thousands of machines crunching petabytes of data under thousands of processing cores.

Scrape the First Paragraph & Image from a Wikipedia Entry

July 26, 2010 No comments yet

Automate fetching Wikipedia descriptions and images for webpage content. Render content dynamically based on specific keywords.