Pig

  • submit to reddit

Cassandra Bulk CDC Extract

ProblemOur client was not able to achieve the performance they needed from their traditional RDBMS persistence platform.  Their requirement was to have...

0 replies - 1728 views - 06/14/13 by Todd Homa in Articles

Herding Apache Pig: Using Pig with Perl and Python

The past week or so we got some new data that we had to process quickly. There are quite a few technologies out there to quickly churn map/reduce jobs on...

0 replies - 1577 views - 03/05/13 by Arnon Rotem-gal-oz in Articles

This Quick Pig Overview Brings You Up to Speed Line by Line

This twenty minute tutorial from Dan Morrill explains a simple Pig script line by line. Concise and useful:

0 replies - 1536 views - 02/15/13 by Eric Gregory in Articles

Hadoop, Pig, and Broken Dreams of Environment Variables

Okay, this post isn't nearly as melodramatic as its title – I’m doing some log analysis with Hadoop and Pig. As the logs are coming from a webserver and...

0 replies - 3016 views - 09/12/12 by Oliver Hookins in Articles

The Search for a Better BIG Data Analytics Pipeline

"Big Data Analytics" has recently been one of the hottest buzzwords.  It is a combination of "Big Data" and "Deep...

0 replies - 3899 views - 08/14/12 by Ricky Ho in Articles

An Introduction to Apache Bigtop / Installing Hive, HBase, and Pig

In the previous post we learned how easy it was to install Hadoop with Apache Bigtop! We know its not just Hadoop and there are sub-projects around the...

0 replies - 4770 views - 07/15/12 by Swathi Venkatachala in Articles

How to Develop Map/Reduce with Reduced Assumptions

It all started with this odd bug…  One of our teams is writing a service, that among other things, runs map/reduce jobs built as pig scripts with Java...

0 replies - 4240 views - 05/10/12 by Arnon Rotem-gal-oz in Articles

Pig / Cassandra: binary operator expected

If you are trying to run Pig on Cassandra and you encounter: "binary operator expected"You are most likely running pig_cassandra against the latest...

0 replies - 3705 views - 01/01/12 by Brian O' Neill in Articles

Hadoop Study Reveals Usage Stats, Benefits, and Challenges

A new survey on Hadoop suggests that companies using the Apache project's utilities (which include Hadoop Commons, ZooKeeper, HDFS, Hive, MapReduce, etc.) are...

0 replies - 8707 views - 10/21/10 by Mitch Pronschinske in Articles