Tools & Methods

  • submit to reddit

Some Notes on Git

This set of notes covers the main things that I think you need to know about working with git. It is not comprehensive and mainly serves as a reminder for...

3 replies - 6812 views - 02/07/13 by Rob Allen in Articles

Four Hours of Concentration

As I’ve blogged about before, and mentioned again in my previous post, the great mathematician and physicist Henri Poincaré put in two hours of...

0 replies - 3458 views - 02/07/13 by John Cook in Articles

What is a Data Scientist?

Scott and I ventured out of the office yesterday evening to check out a new group starting up– Charlottesville’s Big Data Group. The most exciting...

0 replies - 8722 views - 02/07/13 by Doug Turnbull in Articles

The Merits System

It is clear that the annual bonus system doesn't work. A tremendous amount of research says that it demotivates people, destroys collaboration, and causes...

0 replies - 1714 views - 02/07/13 by Jurgen Appelo in Articles

Betweenness Centrality

In Network Analysis the identification of important nodes is a common task. We have various centrality measures that we can use and in this post we...

0 replies - 1407 views - 02/07/13 by Giuseppe Vettigli in Articles

Big Data, Statistics, and Computer Science

“Today, software and hardware together provide far more powerful factories than most statisticians realize, factories that many of today’s most able...

0 replies - 1640 views - 02/07/13 by Arthur Charpentier in Articles

Hbase Error: Region is not online: -ROOT-„0

If you are running HBase and commands are giving you an error that looks like this:Fri Oct 05 21:45:02 UTC 2012,...

0 replies - 1723 views - 02/07/13 by George London in Articles

MessQ: A Simple Message Queue for Socket-Based Message Enqueue/Dequeue Facility

I've spent a good amount of time setting up message-based infrastructures, so I decided to make a tool that would allow me to set up localhost friendly,...

0 replies - 1279 views - 02/07/13 by Abhishek Kumar in Articles

Breaking the Build is Not a Crime

For years I've been taught that breaking continuous integration build is something that should be avoided under all circumstances. Let me first quote few...

3 replies - 6675 views - 02/06/13 by Tomasz Nurkiewicz in Articles

Check Out check_graphite

During my Puppetcamp Gent talk last week, I explained how to get alerts based on trends from graphite. A number of people asked me how to do that.First lets...

0 replies - 1819 views - 02/06/13 by Kris Buytaert in Articles

Taking a Random Walk

Consider the following time series,What does this look like? I know, it's a stupid game, but I keep using it in my time series courses. It does...

0 replies - 1463 views - 02/06/13 by Arthur Charpentier in Articles

Developing Your Own Solr Filter - Part 2

In the previous entry “Developing Your Own Solr Filter” we’ve shown how to implement a simple filter and how to use it in Apache Solr. Recently, one of...

0 replies - 1538 views - 02/06/13 by Rafał Kuć in Articles

Day to Day Tools Used in Agile Projects

Here is the list of tools being used in an Agile environment that were discussed in this forum post. I have added some based on my previous experience as...

0 replies - 2776 views - 02/06/13 by Venkatesh Kris... in Articles

In Search of Organizational Consciousness

What does it mean to be conscious? Flying across the country earlier this month, I read a recent issue of the Atlantic and in particular an article on...

0 replies - 1639 views - 02/05/13 by Eric Minick in Articles

Overdispersion with Different Exposures

In actuarial science, and insurance ratemaking, taking into account the exposure can be a nightmare (in datasets, some clients have been here for a...

0 replies - 1039 views - 02/05/13 by Arthur Charpentier in Articles