Big Data

  • submit to reddit

HBase Schema Introduction for Programmers

Schema design in NoSQL is very different from schema design in a RDBMS. Once you get something like HBase up and running, you may find yourself staring...

0 replies - 2120 views - 04/30/13 by Chase Seibert in Articles

Open Data Backed by Source Control

Source-control backing is a decade-long obsession of mine. Now I'm thinking about “open data.” If something can be represented by a textual document, is...

0 replies - 3902 views - 04/30/13 by Paul Hammant in Articles

Understanding Solr Soft Commits And Data Durability

I ran into an interesting problem today. I was working with the first project where we legitimately needed Solr soft commits and in testing my configuration I...

0 replies - 1404 views - 04/29/13 by John Berryman in Articles

Visualisation – the key that unlocks data’s value?

As the Big Data hype machine continues its relentless attempt to gobble everything in its path, new business units and entire new domains buying into the...

0 replies - 2138 views - 04/29/13 by Paul Miller in Articles

LinkedIn TechTalk: Machine Learning Basics

Via LinkedIn TechTalks, Rob Bekkerman delves into the basics of machine learning:

0 replies - 542 views - 04/28/13 by Eric Gregory in Articles

Data News: Spreadsheet Errors, Jane Austen as Games Theorist, and More

Some extremely interesting posts, this week, again on the Reinhart-Rogoffing story (I do mention many posts and articles related to that story, because I think...

0 replies - 968 views - 04/27/13 by Arthur Charpentier in Articles

Big Data Beyond Apache Hadoop – Integrating All Your Data with Apache Camel and Talend

Slides from my talk "Big Data beyond Apache Hadoop – How to integrate ALL your data" at NoSQLmatters 2013 in Cologne are online.Here the abstract:Big data...

0 replies - 1984 views - 04/26/13 by Kai Wähner in Articles

An Introduction to Machine Learning

This eight-minute tutorial acts as both an introduction to machine learning and a comparison/contrast with data mining:

2 replies - 6378 views - 04/26/13 by Eric Gregory in Articles

Understanding Bayes Theorem with Mario Kart

Trying to understand Bayes' theorem? Here, Luigi uses it to analyze banana-related kart accidents: And for another quick and concise take on the...

0 replies - 1687 views - 04/26/13 by Eric Gregory in Articles

Make Yourself a Data Scientist

Troy Sadkowsky runs through some common challenges in becoming a data scientist, how to overcome them, and his own professional story:

0 replies - 2953 views - 04/25/13 by Eric Gregory in Articles

Data News: Dangerous Predictions, Killing Jargon, and More

Following previous posts on this blog (# 46 and 47), a couple of articles that are worth reading,The end of the Reinhart-Rogoffing story,...

0 replies - 787 views - 04/24/13 by Arthur Charpentier in Articles

Being a Data Scientist at Tumblr and Kickstarter

Data scientists from Tumblr, Kickstarter, and other sites discuss leveraging big data in a startup situation, in this panel from DataGotham 2012:

0 replies - 1325 views - 04/24/13 by Eric Gregory in Articles

Realtime Analytics for Big Data: A Facebook Case Study

This deep dive into analytics at Facebook explores their choice of HBase over Cassandra, and how to learn from Facebook's choices.

0 replies - 2763 views - 04/24/13 by Eric Gregory in Articles

Privacy is Dead. Time to Prepare.

Personal privacy is over.The world knows more about you than you do and soon it will know even more.We can keep fighting the battle to secure our privacy or we...

1 replies - 2976 views - 04/24/13 by John Sonmez in Articles

My New Forecasting Book is Finally Finished

My new online fore­cast­ing book (writ­ten with George Athana­sopou­los) is now com­pleted. I pre­vi­ously described it on this...

0 replies - 1756 views - 04/23/13 by Rob J Hyndman in Articles