Tools & Methods

  • submit to reddit

Is the Big Data Shakeout Coming in 2013?

Is the inevitable Big Data shakeout coming?  If you are an enterprise customer, how do you prepare for this? What strategies do you adopt to take...

0 replies - 1455 views - 01/16/13 by Ravi Kalakota in Articles

MapReduce Algorithms – Secondary Sorting

This post covers the pattern of secondary sorting, found in chapter 3 of Data-Intensive Text Processing with MapReduce.  While Hadoop...

0 replies - 981 views - 01/16/13 by Bill Bejeck in Articles

Inviting Hackers into Your Automated Home

I was at the Web Directions South conference the other day and you know what really struck me? There is a lot of very cool, very connected stuff...

0 replies - 3205 views - 01/15/13 by Troy Hunt in Articles

What Should You Read to Learn Elementary Statistics?

I’ve thought about making a personal FAQ page. If I do, one of the questions would be what elementary statistics book I recommend. Unfortunately, I don’t...

0 replies - 1565 views - 01/15/13 by John Cook in Articles

Continuous Deployment: Are You Afraid It Might Work?

I’ve been wondering for a few years now, why it’s so hard to get companies to prioritize the work that I feel is important. I mean, I’m telling you how...

0 replies - 2249 views - 01/15/13 by Aaron Nichols in Articles

Disrupting the Datawarehouse Market with Redshift

Amazon is taking another step at disrupting an existing market. This time they have their sight set on the Datawarehouse market. Amazon is currently running a...

0 replies - 502 views - 01/15/13 by Maarten Ectors in Articles

Dev of the Week: Swathi Venkatachala

Every week, we check in with a new developer/blogger from the DZone community to find out what they're working on now and what's coming next. This week...

0 replies - 3298 views - 01/15/13 by Eric Gregory in Articles

Delivering Value: Engineers and the End User

When I started following along the devops movement often times the phrase “delivering value” would appear in conversation. That made me ponder even harder...

0 replies - 1035 views - 01/15/13 by Spike Morelli in Articles

Optimization in R

Optimization is a very common problem in data analytics.  Given a set of variables (which one has control), how to pick the right value such that...

0 replies - 959 views - 01/15/13 by Ricky Ho in Articles

Debugging is Twice as Hard

I recently tweeted an article about a professor who posed a question to his student. The student gave a simple solution to the professors problem. He...

0 replies - 2016 views - 01/14/13 by Mahdi Yusuf in Articles

Let's Put an End to Unix Editor Snobbery

I recently had a converation with a friend that could be paraphrased as follows:Friend: Whoever defined nano as the default crontab editor for Ubuntu deserves...

0 replies - 1408 views - 01/14/13 by Jason Whaley in Articles

R for Actuarial Science

As mentioned in the Appendix of Modern Actuarial Risk Theory, “R (and S) is the ‘lingua franca’ of data analysis and statistical computing, used in...

0 replies - 1347 views - 01/14/13 by Arthur Charpentier in Articles

Limiting Joins in Apache Hive

This article is by Stephen Mouring Jr, appearing courtesy of Scott Leberknight.Working with large datasets in Hadoop / Hive works is difficult when you have an...

0 replies - 898 views - 01/14/13 by Scott Leberknight in Articles

Cloudera Impala – Fast, Interactive Queries with Hadoop

As discussed in the previous post about Twitter’s Storm, Hadoop is a batch oriented solution that has a lack of support for ad-hoc, real-time...

0 replies - 1031 views - 01/14/13 by Istvan Szegedi in Articles

Here, Have a Rails DevOps Cheatsheet

Rubytune has put together a reference sheet of DevOps-y command-line tips dealing with process basics, memory, disk/files, network, and more. Want to get...

0 replies - 736 views - 01/14/13 by Eric Gregory in Articles