Big Data

  • submit to reddit

Integrated Data as the Perfect Zinger

There’s a classic episode of the American television show Seinfeld called “The Comeback” where a main character, George Costanza, is...

0 replies - 893 views - 03/26/13 by Christopher Taylor in Articles

Data Analysis Training via Coursera

I recently finished the coursera course "Data Analysis," which immediately followed and somewhat overlapped "Computing for Data Analysis," also from...

1 replies - 3968 views - 03/26/13 by Wayne Adams in Articles

A Year of Blogging Analyzed with R

Today it’s exactly one year ago that I published my first blogpost on branchandbound.net. During this year I’ve written 12 posts, including this one....

0 replies - 1042 views - 03/25/13 by Sander Mak in Articles

An Introduction to Hadoop Distributed File System

Sameer Farooqui discusses the Hadoop Distributed File System:

0 replies - 1548 views - 03/25/13 by Eric Gregory in Articles

Large-Scale Data Processing with MapReduce and PHP

This PHPDay talk from David Zuelke explores data processing with PHP and MapReduce: The MapReduce framework promises to make computing of large sets of...

1 replies - 2880 views - 03/22/13 by Eric Gregory in Articles

How Cloud Computing is Revolutionizing Pharma and Genomics

Yesterday I attended an event hosted by Booz Allen/Amazon around Big Data and Cloud Computing for life sciences. It was a fascinating event that brought...

0 replies - 2593 views - 03/22/13 by Doug Turnbull in Articles

BigQuery Gets 'Big JOIN' and More New Features

Google recently announced some major new features for its BigQuery analytics tool, including SQL-esque join and aggregate functionality, native TIMESTAMP...

0 replies - 2092 views - 03/21/13 by Eric Gregory in Articles

Tennis, Scala, and Expectation Propagation Bayesian Inference

Here's a tutorial on modelling skills of tennis players with TrueSkill rating model in...

0 replies - 645 views - 03/21/13 by Daniel Korzekwa in Articles

Data and Connectedness Create a Reputation Wild West

Welcome to 24 x 7 x 365 connectedness and the challenges that come with an always-on world. That world is generating data, loads of data. And the more we’re...

0 replies - 1446 views - 03/21/13 by Christopher Taylor in Articles

Surveillance States, How Open Source Threatens SAS, and More Data Links

A very interesting post discovered a few days ago,Internet: “our surveillance state is efficient beyond the wildest dreams of...

0 replies - 1910 views - 03/21/13 by Arthur Charpentier in Articles

MongoDB 2.4 is Out!

Today MongoDB has seen a new big release, 2.4. Between the features that have been added we can see hash-based sharding, capped arrays, and a brand new Text...

0 replies - 8444 views - 03/19/13 by Giorgio Sironi in Articles

Hadoop Will Not Mow Your Lawn

"The best minds of my generation are thinking about how to make people click ads." - Jeff Hammerbacher ex- Facebook Architect It turns out...

0 replies - 4144 views - 03/19/13 by Chris Keene in Articles

On Schneier's Survelliance State

I’ll say this up front before continuing. I absolutely love reading Bruce Schneier and thoroughly respect his opinion on damn near everything that has to do...

0 replies - 794 views - 03/19/13 by Jason Whaley in Articles

Solr Unleashed: Mission Accomplished

This past Wednesday and Thursday (March 13th and 14th) OpenSource Connections held an on-site 2-day Solr training course called Solr Unleashed. We covered...

0 replies - 878 views - 03/18/13 by John Berryman in Articles

Getting Started Quickly with Hadoop and MapReduce

So here’s the problem: You’ve finally found a block of time to set down and get your head around Hadoop and MapReduce. You do a quick Google search for a...

0 replies - 1643 views - 03/17/13 by John Berryman in Articles