If you have read the paper published by Google’s Jeffrey Dean and Sanjay Ghemawat (MapReduce: Simplied Data Processing on Large Clusters),
0 replies - 8992 views - 08/05/12 by Istvan Szegedi in Articles
Yesterday we were working on setting up our first Hadoop cluster. Though
there are many online documentation on this even then we faced a few
0 replies - 3376 views - 08/02/12 by Abhishek Jain in Articles
We are pleased to announce that Treasure Data Heroku Add-on is now GA on Heroku!
Treasure Data Hadoop Add-on lets Heroku users access our cloud-based...
0 replies - 2882 views - 08/01/12 by Sadayuki Furuhashi in Articles
This post describes a prototype implementation of a
simple PAAS built on the Hadoop YARN framework and the key findings from the
experiment. While there...
0 replies - 5071 views - 07/31/12 by Jaigak Song in Articles
I’ve been looking at how it might be possible to bring data from Twitter into SQL Server.
You might ask, Why ????
Well, why not ? It’s more an exercise...
1 replies - 4168 views - 07/25/12 by Nick Haslam in Articles
Episode #8 of the Podcast is a talk with Arun C. Murthy.
We talked about Hortonworks HDP1, the first release from Hortonworks, Apache Hadoop 2.0, NextGen...
0 replies - 5264 views - 07/25/12 by Joe Stein in Articles
I have noticed often that the check Hadoop uses to calculate usage
for the data nodes causes a fair amount of wait io on them driving up
0 replies - 2873 views - 07/23/12 by Joe Stein in Articles
Spring for Apache Hadoop
is a Spring project to support writing applications that can benefit of
the integration of Spring Framework and Hadoop. This...
0 replies - 4210 views - 07/18/12 by Istvan Szegedi in Articles
Episode #6 of the Podcast is a talk with Todd Lipcon from Cloudera discussing HBase. We talked about NoSQL and how it should stand for “Not Only SQL” and...
0 replies - 2241 views - 07/17/12 by Joe Stein in Articles
Before I get to the book review, I wanted to mention a basic note
about book reviews. In the past I have reviewed books in a less than
0 replies - 3808 views - 07/17/12 by Robert Diana in Articles
So if you are looking for a good NoSQL read of HBase vs. Cassandra you can check out...
1 replies - 7762 views - 07/16/12 by Joe Stein in Articles
HBase is a NoSQL database. It is based on Google’s Bigtable distributed storage system
– as it is described in Google research paper; “A Bigtable is a...
0 replies - 7240 views - 07/14/12 by Istvan Szegedi in Articles
This post was originally authored by Wayne Citrin on the JNBridge Labs page.The Apache Hadoop framework enables distributed processing of very
0 replies - 4893 views - 07/12/12 by Mitch Pronschinske in Articles
Interesting article at GigaOm: http://bit.ly/OINpfr I won’t repeat the main points - but basically it says that since Hadoop is disk/ETL/batch based it...
3 replies - 6465 views - 07/12/12 by Nikita Ivanov in Articles
A few months back we started to endeavor on a new Hadoop cluster @ medialets
We have been live with Hadoop in production since April 2010 and we
0 replies - 8071 views - 07/11/12 by Joe Stein in Articles