Did you know? DZone has great portals for Python, Cloud, NoSQL, and HTML5!

Hadoop

  • submit to reddit

Visually programming Hadoop with Kettle

More than 6 years ago I announced on Javalobby the open source availability of a data integration tool called Kettle. This tool and the community around it has grown tremendously since then. From a one-person project it grew to have over 60 committers on...

0 replies - 403 views - 01/30/12 by Matt Casters in Announcements

Big Data Predictions for 2012

Well 2011 has been a great year for Hadoop and its supporting ecosystem. There is a growing base of sub projects evolving to fill the many niches in and around Hadoop and there are companies coming out of the wood work to claim their piece of the pie. Not to...

0 replies - 511 views - 01/07/12 by Rauf Issa in Announcements

Clustered Services With Apache Whirr: From Dev Up

Apache Whirr is an incubator project focused on simplifying management of distributed services such as Hadoop, ElasticSearch, and Cassandra. Using Whirr,...

0 replies - 2682 views - 12/25/11 by Mitchell Pronsc... in Videos

Clustered Services With Apache Whirr: From Ops Down

Apache Whirr is an incubator project focused on simplifying management of distributed services such as Hadoop, ElasticSearch, and Cassandra. Using Whirr,...

1 replies - 2607 views - 12/24/11 by Mitchell Pronsc... in Videos

Hadoop Meets JobServer - Big Data Job Scheduling

Grand Logic is pleased to announce the release of JobServer 3.4. This release delivers support for Hadoop allowing Hadoop customers to use JobServer as their central access point to schedule, track and report on their Hadoop jobs and environment. Hadoop is...

0 replies - 479 views - 12/10/11 by Rauf Issa in Announcements

SOAfaces Powers up Hadoop

soafaces has always been a very flexible open source toolkit for building GWT clients and server components. Now with the release of v2.6.0, soafaces enables easier integration and management with third party systems like Hadoop by adding key features needed...

0 replies - 438 views - 12/10/11 by Rauf Issa in Announcements

How DataSift is Datamining 120K Tweets Per Second

Attention architectural gurus!  Get ready to learn about how one company puts together its amazing datamining architecture, and hopefully you'll also walk away with some ideas of your own after reading Todd Hoff's new post on High Scalability.  His post...

0 replies - 4567 views - 11/30/11 by Mitchell Pronsc... in Articles

Using Lucene and Cascalog for Fast Text Processing at Scale

This post explains text processing and analytics techniques used at the startup Yieldbot.  Their technology uses open source tools including Cascalog, Lucene, Hadoop, and Clojure's Java Interop.  The following post was authored by Soren Macbeth, a Data...

0 replies - 3299 views - 11/09/11 by Mitchell Pronsc... in Articles

Exponential Growth Reported for Hadoop and Puppet Job Needs - Java is Also Surging

Some stats reported by Dice.com (mainly IT jobs) and Indeed.com show that the April to June timespan from this year had 3x as many postings about Puppet skills requirements and over double the postings for Hadoop skills.  Although the Indeed.com fastest...

0 replies - 4529 views - 11/02/11 by Mitchell Pronsc... in News

Video: Search + Big Data: It's (still) All About the User

Thanks in no small part to Lucene, quality keyword search is easily obtainable. Likewise, tools like Apache Hadoop and its ecosystem have made it easier to store and process large quantities of data.

0 replies - 2801 views - 10/29/11 by Grant Ingersoll in Articles

Hadoop lets you store everything; with Lucene/Solr and more

This month’s Wired Magazine features a story on the roots of Hadoop at Yahoo and the three companies vying to drive its commercial frontiers farther forward faster: Hortonworks (Apache Lucene Eurocon Barcelona Keynote Video now available, see below),...

0 replies - 4591 views - 10/28/11 by David Fishman in Articles

Solr and Hadoop 'HUG' it Out

This talk on using Hadoop and Solr together for a NoSQL-like result was given by Ken Krugler, a friend of DZone who wrote the amazingly popular article, Solr +...

0 replies - 3577 views - 10/28/11 by Mitchell Pronsc... in Videos

Video: DevOps & BigData at Massive Scale

The MIT TR35 presentations hosted an awesome session today with Jeff Hammerbacher, a Data specialist who worked at Facebook, and Jesse Robbins, the CEO of Opscode (the makers of Chef).  Jeff talks in the first half of this video about his time at Facebook,...

0 replies - 2925 views - 10/18/11 by Mitchell Pronsc... in News

Daily Dose: Cloudera And Dell Set To Deliver "Complete" Hadoop Solution

Cloudera and Dell have partnered to deliver the industry's first total Apache Hadoop solution, which will combine Dell servers and networking components with Cloudera's Distribution, including Apache Hadoop, management tools, training and support. Customers...

0 replies - 16170 views - 08/07/11 by Jim Moscater in Daily Dose

Daily Dose: Google Enters the Social Network Ring with Google+

The long-awaited social networking project from Google was revealed yesterday, dubbed Google+. It is not clear from whether or not the social network is aimed to take on Facebook, but Google+ does have some compelling features:   

0 replies - 15494 views - 06/29/11 by Ross Jernigan in Daily Dose