
Yesterday we were working on setting up our first Hadoop cluster. Though
there is much online documentation on this even then we faced a few
challenges...
0 replies - 2907 views - 07/09/12 by Abhishek Jain in Articles

There are a lot of different ways to write MapReduce jobs!!!
Sample code for this post https://github.com/joestein/amaunet
I find streaming scripts a good...
1 replies - 4048 views - 07/06/12 by Joe Stein in Articles

Well, we have seen new versions and new releases in software as days roll by. No wonder!Same is the case with Hadoop! Past few months we saw new releases on...
0 replies - 3702 views - 07/05/12 by Swathi Venkatachala in Articles

Hadoop is a great piece of software. It is not original but that
certainly does not take away its glory. It builds on parallel
processing, a concept...
2 replies - 3933 views - 07/01/12 by Tharindu Mathew in Articles

We have added many cool features in GridGain 4.1. One of them is tight integration with Hadoop ecosystem. There are two ways you can integrate with Hadoop. One...
0 replies - 5279 views - 06/28/12 by Dmitriy Setrakyan in Articles

Time to do something meaningful with C#, Azure and Apache Hadoop. In
this post, we’ll explore how to create a Mapper and Reducer in C#, to
analyze...
2 replies - 4429 views - 06/26/12 by Anoop Madhusudanan in Articles

Data growth curve: Terabytes -> Petabytes -> Exabytes ->
Zettabytes -> Yottabytes -> Brontobytes -> Geopbytes. It is
getting more...
1 replies - 8026 views - 06/21/12 by Ravi Kalakota in Articles

Note: This post is the second half of my recent Executing an Elastic MapReduce Hive Workflow from the AWS Management Console
article with a slightly modified...
0 replies - 4108 views - 06/21/12 by Roger Jennings in Articles

I am about to go live with my first production Hadoop job for a client as a proof of concept.
I found that a lot of the documentation out there is quite text...
6 replies - 4743 views - 06/19/12 by Ben Wootton in Articles

Scalability Challenges in Big Data ScienceYesterday I gave a talk on scalability and machine learning at the BerlinBuzzword
conference. I give an overview of...
0 replies - 3812 views - 06/18/12 by Mikio Braun in Articles

0 replies - 13010 views - 06/17/12 by Buddhika Chamith in Book Reviews

Last week, I attended a Hadoop Tutorial presented in Durham, NC by Sarah Sproehnle, the Director of Educational Services at Cloudera. The tutorial offered...
2 replies - 2845 views - 06/05/12 by Eric Genesky in Articles

Companies like DataStax and MapR have found a significant demand for having technologies like Hadoop and Solr easily deployable in one cluster. Jack Norris...
1 replies - 4673 views - 06/02/12 by Mitch Pronschinske in Articles

Background Amazon Web Services (AWS) introduced its Elastic MapReduce (EMR) feature with an Announcing Amazon Elastic MapReduce post by Jeff Barr on April 2,...
1 replies - 4097 views - 05/31/12 by Roger Jennings in Articles

Hadoop was written in Java. So it makes sense that Java will always be Hadoop's best friend. Hadoop streaming is great tool that allows interoperability...
1 replies - 11018 views - 05/21/12 by Mitch Pronschinske in Articles