Big Data

  • submit to reddit

Real-Time Hadoop Queries Will Be a Reality in 2013

Real-Time Hadoop queries will be a reality in 2013 thanks to two new projects from Cloudera: Impala and Trevni.Impala is the open source...

3 replies - 4854 views - 12/06/12 by Maarten Ectors in Articles

Eight Fallacies of Declarative Computing

Erik Meijer listed eight fallacies of declarative programming in his keynote address at YOW in Melbourne this morning:Exceptions do not...

0 replies - 1773 views - 12/06/12 by John Cook in Articles

Search Solutions 2012: Highlights and Reflections

It’s around this time of year that I re-acquaint myself with a familiar sequence of events: an initial period of doubt as to whether it’s all really worth...

0 replies - 1905 views - 12/05/12 by Tony Russell-rose in Articles

Datapoint: Weather API from the MetOffice

I'm working on a little hobby project which needs to know what the weather is going to be. I had a look around and noticed that the MetOffice had...

0 replies - 2484 views - 12/05/12 by Lorna Mitchell in Articles

Datamaps - a JavaScript Library for Interactive Maps in Data Visualizations

DataMaps is a JavaScript library for interactive maps for data visualizations.  DataMaps is bundled into a single JavaScript file.FEATURESUses D3.js...

0 replies - 3609 views - 12/04/12 by Hirvesh Munogee in Articles

MapReduce's Founding Documents

MapReduce is an incredibly powerful algorithm, especially when used to process large amounts of data using distributed systems of commodity hardware. It...

0 replies - 3734 views - 12/04/12 by Mike Miller in Articles

JSTAT - A JavaScript Statistical Library

jStat is a statistical library written in JavaScript that allows you to perform advanced statistical operations without the need of a dedicated...

0 replies - 3640 views - 12/03/12 by Hirvesh Munogee in Articles

How to Save R Objects

Yesterday, Christopher asked me how to store an R object, to save some time, when working on the project. First, download the csv file for searches...

0 replies - 2148 views - 12/03/12 by Arthur Charpentier in Articles

Building a Data Warehouse, Part 5: Application Development Options

See also:Part I: When to build your data warehouse Part II: Building a new schema Part III: Location of your data warehouse Part IV: Extraction,...

0 replies - 3137 views - 12/02/12 by John Cook in Articles

Winston Churchill, Bessie Braddock, and Python

Last night I was talking with someone about the pros and cons of various programming languages and frameworks for data analysis. One of the pros of Python is...

0 replies - 2082 views - 12/01/12 by John Cook in Articles

Building a Data Warehouse, Part 4: Extraction, Transformation, and Load

See also:Part I: When to build your data warehouse Part II: Building a new schema Part III: Location of your data warehouse In Part I we looked...

1 replies - 2845 views - 11/30/12 by Stephen Forte in Articles

Equivalent Form of the Riemann Hypothesis

The famous Riemann hypothesis is equivalent to the following not-so-famous conjecture:              For...

0 replies - 2441 views - 11/30/12 by John Cook in Articles

Building a Data Warehouse, Part 3: Location of Your Data Warehouse

See also:Part I: When to build your data warehouse Part II: Building a new schema In Part I we looked at the advantages of building a data warehouse...

0 replies - 2588 views - 11/29/12 by Stephen Forte in Articles

This Sample iPad App Works with a JSON-Based Machine Learning System

GoSmarter is a JSon based Machine Learning System that, if given a Natural Language (NLP) Query, will return a list of recommended products. This Machine...

0 replies - 3029 views - 11/29/12 by Krishna Prasad in Articles

Building a Data Warehouse, Part 2: Building a New Schema

See also:Part I: When to build your data warehouse In Part I we looked at when you should build your data warehouse and concluded that you should build it...

0 replies - 3591 views - 11/28/12 by Stephen Forte in Articles