
Schema design in NoSQL is very different from schema design in a RDBMS. Once you get something like HBase up and running, you may find yourself staring...
0 replies - 2120 views - 04/30/13 by Chase Seibert in Articles

Source-control backing is a decade-long obsession of mine. Now I'm thinking about “open data.” If something can be represented by a textual document, is...
0 replies - 3902 views - 04/30/13 by Paul Hammant in Articles

I ran into an interesting problem today. I was working with the first project where we legitimately needed Solr soft commits and in testing my configuration I...
0 replies - 1404 views - 04/29/13 by John Berryman in Articles

As the Big Data hype machine continues its relentless attempt to gobble everything in its path, new business units and entire new domains buying into the...
0 replies - 2138 views - 04/29/13 by Paul Miller in Articles

Via LinkedIn TechTalks, Rob Bekkerman delves into the basics of machine learning:
0 replies - 542 views - 04/28/13 by Eric Gregory in Articles

Some extremely interesting posts, this week, again on the Reinhart-Rogoffing story (I do mention many posts and articles related to that story, because I think...
0 replies - 968 views - 04/27/13 by Arthur Charpentier in Articles

Slides from my talk "Big Data beyond Apache Hadoop – How to integrate ALL your data" at NoSQLmatters 2013 in Cologne are online.Here the abstract:Big
data...
0 replies - 1984 views - 04/26/13 by Kai Wähner in Articles

This eight-minute tutorial acts as both an introduction to machine learning and a comparison/contrast with data mining:
2 replies - 6378 views - 04/26/13 by Eric Gregory in Articles

Trying to understand Bayes' theorem? Here, Luigi uses it to analyze banana-related kart accidents:
And for another quick and concise take on the...
0 replies - 1687 views - 04/26/13 by Eric Gregory in Articles

Troy Sadkowsky runs through some common challenges in becoming a data scientist, how to overcome them, and his own professional story:
0 replies - 2953 views - 04/25/13 by Eric Gregory in Articles

Following previous posts on this blog (# 46 and 47), a couple of articles that are worth reading,The end of the Reinhart-Rogoffing story,...
0 replies - 787 views - 04/24/13 by Arthur Charpentier in Articles

Data scientists from Tumblr, Kickstarter, and other sites discuss leveraging big data in a startup situation, in this panel from DataGotham 2012:
0 replies - 1325 views - 04/24/13 by Eric Gregory in Articles

This deep dive into analytics at Facebook explores their choice of HBase over Cassandra, and how to learn from Facebook's choices.
0 replies - 2763 views - 04/24/13 by Eric Gregory in Articles

Personal privacy is over.The world knows more about you than you do and soon it will know even more.We can keep fighting the battle to secure our privacy or we...
1 replies - 2976 views - 04/24/13 by John Sonmez in Articles

My new online forecasting book (written with George Athanasopoulos) is now completed. I previously described it on this...
0 replies - 1756 views - 04/23/13 by Rob J Hyndman in Articles