Hadoop

  • submit to reddit

Hadoop, Mahout, R and Lucene/Solr on One Cloud Platform

Some more cool news has come out of this week's Lucene Revolution conference related to their theme of 'Solr's not just for search, it's for analytics.  In...

0 replies - 7020 views - 05/12/12 by Mitch Pronschinske in Articles

Effective Testing Strategies for MapReduce Applications

Effective Testing Strategies for MapReduce Applications In this article I demonstrate various strategies that I have used to test Hadoop...

0 replies - 5763 views - 05/02/12 by Tim Reardon in Articles

Amazon EMR Tutorial: Running a Hadoop Job Using Custom JAR

IntroductionAmazon EMR is a web service which can be used to easily and efficiently process enormous amounts of data. It uses a hosted Hadoop framework running...

0 replies - 9646 views - 04/23/12 by Muhammad Khojaye in Articles

XpoLog Released New Log Data Analysis for Hadoop and the Cloud

XpoLog released a new log data analysis platform with integration support to HDFS, Google App Engine, Amazon EC2. The new version is immediately...

0 replies - 938 views - 04/22/12 by Haim Ko in Announcements

How to Use Windows Azure Blobs Data w/ Hadoop on Azure CTP

Introduction Avkash Chauhan (@avkashchauhan) and Denny Lee (@dennylee) have written several blog posts about the use of Windows Azure blobs as data...

0 replies - 4204 views - 04/09/12 by Roger Jennings in Articles

A Recipe for Success in Flavor Graph, the Neo4j Heroku Challenge Winner

The content of this article was originally written by Andreas Kollegger on the Neo4j blog. Flavorwocky, an amusing name for a clever idea that highlights...

0 replies - 3612 views - 04/01/12 by Eric Genesky in Articles

Acceleration for Big Data, Hadoop, and Memcached

This video is presented by Dhabaleswar K. Panda, a professor of computer science and engineering at Ohio State University and leader of the Network-Based...

0 replies - 3969 views - 03/22/12 by Eric Genesky in Articles

How to Get a Twitter-esque Architecture Out of the Box

Today, a developer can work on a platform that integrates Hadoop, Cassandra, and Solr on a single cluster…  Hey!  Those technologies are used at another...

0 replies - 6759 views - 03/21/12 by Mitch Pronschinske in Articles

Hadoop Basics - Creating a MapReduce Program

Hadoop is an open source project for processing large datasets in parallel with the use of low level commodity machines.Hadoop is build on two main parts. An...

0 replies - 31998 views - 03/18/12 by Carlo Scarioni in Articles

Getting Started with "Blur" - Search on Top of Hadoop and Lucene.

Blur is a new Apache 2.0 licensed software project that provides a search capability built on top of Hadoop and Lucene. Elastic Search and Solr already...

0 replies - 5889 views - 03/16/12 by Scott Leberknight in Articles

Joins with MapReduce

I have been reading up on Join implementations available for Hadoop for past few days. In this post I recap some techniques I learnt during the process....

0 replies - 7259 views - 03/12/12 by Buddhika Chamith in Articles

Scaling Solr Indexing with SolrCloud, Hadoop and Behemoth

We’ve been doing a lot of work at Lucid lately on scaling out Solr, so I thought I would blog about some of the things we’ve been working on recently...

0 replies - 8298 views - 03/06/12 by Grant Ingersoll in Articles

Hadoop in Practice

...

0 replies - 16198 views - 02/28/12 by Chris Smith in Articles

Enterprise Job Scheduling for Big Data & Hadoop

Businesses of all sizes are looking beyond traditional business intelligence taking a more broader approach to BI that goes beyond the traditional data...

0 replies - 1463 views - 02/14/12 by Rauf Issa in Articles

Visually programming Hadoop with Kettle

More than 6 years ago I announced on Javalobby the open source availability of a data integration tool called Kettle. This tool and the community around it...

0 replies - 2019 views - 01/30/12 by Matt Casters in Articles