
Is the inevitable Big Data shakeout coming? If you are an enterprise customer, how do you prepare for this? What strategies do you adopt to take...
0 replies - 1455 views - 01/16/13 by Ravi Kalakota in Articles

This post covers the pattern of secondary sorting, found in chapter 3 of Data-Intensive Text Processing with MapReduce. While Hadoop...
0 replies - 981 views - 01/16/13 by Bill Bejeck in Articles

I was at the Web Directions South conference the other day and you know what really struck me? There is a lot of very cool, very connected stuff...
0 replies - 3205 views - 01/15/13 by Troy Hunt in Articles

I’ve thought about making a personal FAQ page. If I do, one of the questions would be what elementary statistics book I recommend. Unfortunately, I don’t...
0 replies - 1565 views - 01/15/13 by John Cook in Articles

I’ve been wondering for a few years now, why it’s so hard to get companies to prioritize the work that I feel is important. I mean, I’m telling you how...
0 replies - 2249 views - 01/15/13 by Aaron Nichols in Articles

Amazon is taking another step at disrupting an existing market. This time they have their sight set on the Datawarehouse market. Amazon is currently running a...
0 replies - 502 views - 01/15/13 by Maarten Ectors in Articles

Every week, we check in with a new developer/blogger from the DZone community to find out what they're working on now and what's coming next. This week...
0 replies - 3298 views - 01/15/13 by Eric Gregory in Articles

When I started following along the devops movement often times the phrase “delivering value” would appear in conversation. That made me ponder even harder...
0 replies - 1035 views - 01/15/13 by Spike Morelli in Articles

Optimization is a very common problem in data analytics. Given a set of variables (which one has control), how to pick the right value such that...
0 replies - 959 views - 01/15/13 by Ricky Ho in Articles

I recently tweeted an article about a professor who posed a question
to his student. The student gave a simple solution to the professors
problem. He...
0 replies - 2016 views - 01/14/13 by Mahdi Yusuf in Articles

I recently had a converation with a friend that could be paraphrased as follows:Friend: Whoever defined nano as the default crontab editor for Ubuntu deserves...
0 replies - 1408 views - 01/14/13 by Jason Whaley in Articles

As mentioned in the Appendix of Modern Actuarial Risk Theory, “R (and S) is the ‘lingua franca’ of data analysis and statistical computing, used in...
0 replies - 1347 views - 01/14/13 by Arthur Charpentier in Articles

This article is by Stephen Mouring Jr, appearing courtesy of Scott Leberknight.Working with large datasets in Hadoop / Hive works is difficult when you have an...
0 replies - 898 views - 01/14/13 by Scott Leberknight in Articles

As discussed in the previous post about Twitter’s Storm, Hadoop is a batch oriented solution that has a lack of support for ad-hoc, real-time...
0 replies - 1031 views - 01/14/13 by Istvan Szegedi in Articles

Rubytune has put together a reference sheet of DevOps-y command-line tips dealing with process basics, memory, disk/files, network, and more. Want to get...
0 replies - 736 views - 01/14/13 by Eric Gregory in Articles