Apache Hadoop

  • submit to reddit

Accelerating Big Data with Hadoop (HDFS, MapReduce, and HBase)

In this video, D.K. Panda from Ohio State University presents: Accelerating Big Data with Hadoop (HDFS, MapReduce and HBase) and Memcached. "The SuperMUC...

0 replies - 1888 views - 04/02/13 by Eric Genesky in Articles

Hadoop Developer - WordCount tutorial using Maven and NetBeans 7.3RC2

I have adapted the WordCount tutorial to Maven based development as this probably the most popular way to develop in companies. I am not going to...

0 replies - 1207 views - 02/13/13 by Armel Nene in Articles

Hadoop Hangover: Launch a Hadoop Cluster CDH4 Using Apache Whirr

This post is about how-to launch a CDH4 MRv1 or CDH4 Yarn cluster on EC2 instances. It's said that you can launch a cluster with the help of Whirr and in a...

0 replies - 1374 views - 02/12/13 by Swathi Venkatachala in Articles

Testing MapReduce with MRUnit

Testing and debugging multi threaded programs is hard. Now take the same programs and massively distribute them across multiple JVMs deployed on a cluster of...

0 replies - 2211 views - 02/05/13 by Muhammad Ashraf in Articles

Starfish : A Hadoop Performance Tuning Tool

Its been a long time since I've blogged... a lapse of 3-4months or so... :( Well, I thought of writing about an awesome tool for performance tuning in...

0 replies - 2153 views - 11/25/12 by Swathi Venkatachala in Articles

Innovation and Big Data in Corporations: A Roadmap

Big Data is all about technology and business model innovation.  Why? Because, a lot of next generation business models are DATA centric.  Almost all...

0 replies - 2340 views - 09/10/12 by Ravi Kalakota in Articles

Berkeley Researchers Highlight Emergence of In-Memory Processing

Researchers at the University of California, Berkeley released an excellent paper recently, analyzing data from the Hadoop installation at Facebook -- one...

0 replies - 1663 views - 09/05/12 by Nikita Ivanov in Articles

Big Data: Enterprise Hype or the Future of Enterprise?

Of all the myriad of terms that the tech industry throws around at the moment, none is as often subverted for marketing spin as “big data”. So much so...

1 replies - 2221 views - 08/15/12 by Ben Kepes in Articles

Grid Engine an Early Supporter of Hadoop Apps

Sun Microsystem's Grid Engine was recently updated with plenty of new features, including industry first they say.  Grid Engine 6.2 update 5 (SGE 6.2u5) just...

0 replies - 8178 views - 01/14/10 by Mitch Pronschinske in News

Apache Mahout Tackles A.I.

Artificial intelligence is a term frequently associated with science fiction, not software development.  However, A.I. is becoming increasingly viable as a...

3 replies - 11092 views - 11/30/09 by Mitch Pronschinske in News