Hadoop

  • submit to reddit

Hadoop: Data Operating System of the Future?

Standford's Amr Awadallah argues that Hadoop is the "data operating system of the future": 

0 replies - 1062 views - 05/01/13 by Eric Gregory in Articles

So What? – Monitoring Hadoop beyond Ganglia

Over the last couple of months I have been talking to more and more customers who are either bringing their Hadoop clusters into production or...

1 replies - 2725 views - 04/25/13 by Michael Kopp in Articles

Hadoop/R Integration I: Streaming

If you've spent any time with MapReduce frameworks in general, by now you probably know the word-count example is the MapReduce equivalent of "Hello...

0 replies - 5437 views - 03/27/13 by Wayne Adams in Articles

Distributed Graph Computing with Gremlin

The script-step in Faunus’ Gremlin allows for the arbitrary execution of a Gremlin script against all vertices in the Faunus...

0 replies - 429 views - 03/07/13 by Marko Rodriguez in Articles

NoSQL Week in Review #16

This week around the NoSQLsphere, we found an interesting best practices list for Java devs, as well as an informative post by Martin Fowler on...

0 replies - 1258 views - 03/01/13 by Eric Genesky in Articles

Non-Stop NameNode Removes Hadoop’s Single Point of Failure

We’re pleased to announce the release of the WANdisco Non-Stop NameNode, the only 100% uptime solution for Apache Hadoop. Built on our Non-Stop patented...

0 replies - 1597 views - 02/28/13 by Jessica Thornsby in Articles

DZone Links You Don't Want To Miss (2/21/13)

18 API Business Models: The Breakdown  A useful model of 18 different ways to monetize your APIs.  See what models Twitter, Facebook, Amazon,...

0 replies - 2877 views - 02/21/13 by Mitch Pronschinske in Articles

DZone Links You Don't Want To Miss (2/19/13)

Interactive Design Trends of 2013  You need to download this slidedeck.  It's overflowing with advice that will keep you ahead of the curve. Why...

1 replies - 101338 views - 02/19/13 by Mitch Pronschinske in Articles

An Introduction to Hadoop on Azure

At the ØREDEV conference in Sweden, Yaniv Rodenski spoke about Hadoop on Azure, discussing how it works, various storage options, cloud service...

0 replies - 1029 views - 02/08/13 by Eric Gregory in Articles

Lexicographically Sorting Large Files in Linux

When I hear the word “sort” my first thought is usually “Hadoop”! Yes, sorting is one thing that Hadoop does well, but if you’re working with large...

0 replies - 1816 views - 01/24/13 by Alex Holmes in Articles

Controlling User Logging in Hadoop

Imagine that you’re a Hadoop administrator, and to make things interesting you’re managing a multi-tenant Hadoop cluster where data scientists, developers...

0 replies - 1745 views - 01/24/13 by Alex Holmes in Articles

Apache Hadoop: Decreasing Technical Debt through Refactoring

Technical Debt is worth nothing if no pragmatic action is taken into code, in order to control and tackle it. To illustrate the capability to automatically...

2 replies - 3759 views - 01/23/13 by Michael Muller in Articles

Here's How to Build an Optimal Hadoop Cluster

If you're ringing in the New Year by building a Hadoop cluster, then you might want to take a look at Atlantbh's detailed tutorial:Amount of data stored...

1 replies - 2589 views - 01/03/13 by Eric Gregory in Articles

GridGain and Hadoop: Differences and Synergies

GridGain is Java-based middleware for in-memory processing of big data in a distributed environment. It is based on high performance in-memory data platform...

0 replies - 1607 views - 12/31/12 by Dmitriy Setrakyan in Articles

MongoDB Sharding

If I should have made some safe bets on the near future, I would choose two: Hadoop and MongoDB.  There is a huge demand for both technologies and...

0 replies - 3133 views - 12/26/12 by Moshe Kaplan in Articles