Here's what I came up with to compare word counts in two pieces of text.
If you've got any ideas, I'd love to learn about alternatives!## a function that...
0 replies - 6206 views - 11/29/13 by Kay Cichini in Articles
Seth Godin wrote recently
that cell phones repel UFO’s. He meant that people carrying digital
cameras in their pockets every day take copious pictures of...
0 replies - 4315 views - 11/28/13 by Christopher Taylor in Articles
As a response to my last post, people mentioned mlbase
to me as a potential candidate for bringing scalability and machine
learning closer together. I took...
0 replies - 7243 views - 11/28/13 by Mikio Braun in Articles
(picture via http://blogs.wsj.com/photojournal/2013/…)
Some writings worth reading
“It’s a pretty subtle distinction, but if you’re...
0 replies - 5049 views - 11/27/13 by Arthur Charpentier in Articles
This recent tutorial from Tom Hanlon at Hortonworks demonstrates how to use non-Java languages - R, in particular - to work with Hadoop data through MapReduce...
0 replies - 6347 views - 11/27/13 by Alec Noller in Articles
Cloudera Impala supports low-latency, interactive queries on Hadoop data sets either stored in Hadoop Distributed File System (HDFS) or HBase,...
1 replies - 9779 views - 11/26/13 by Istvan Szegedi in Articles
This session will discuss the transformation of the most widely distributed cable TV network in the United States, building on one of the world's most...
0 replies - 4518 views - 11/26/13 by Mitch Pronschinske in Articles
This set of slides from David Chiu at Trend Micro presents an introduction to machine learning with R. It covers the strong points of R as a language, the...
0 replies - 7766 views - 11/25/13 by Alec Noller in Articles
In order to work on Big Data Analytics (ClickStream, Sentiment, RealTime), it's very important to work with PowerBI (PowerQuery & PowerMap) using Office...
0 replies - 3312 views - 11/25/13 by Anindita Basak in Articles
Make sure you didn't miss anything with this list of the Best of the
Week in the Big Data Zone (Nov. 15 to Nov. 21). Here they are, in order
0 replies - 5516 views - 11/24/13 by Alec Noller in Articles
Nowadays, Python is probably the programming language of choice
(besides R) for data scientists for prototyping, visualization, and
running data analyses...
0 replies - 10027 views - 11/22/13 by Mikio Braun in Articles
Some writings worth reading,
“What If Obesity Is Nobody’s Fault?” http://nautil.us/issue/7/waste/t …“Statistics is the least important part of...
0 replies - 5712 views - 11/22/13 by Arthur Charpentier in Articles
I have started a new open source project - https://github.com/stealthly/scala-cassandra - that is a Scala wrapper for CQL, specifically a wrapper of the...
0 replies - 5594 views - 11/22/13 by Joe Stein in Articles
This blog post from Markus Winand gives some context for one of his Tweets:
MongoDB seems to be as bad for NoSQL as MySQL is for SQL.
0 replies - 3296 views - 11/21/13 by Alec Noller in Articles
Those of you who work with R (or Java) might be interested in FastR, an R virtual machine written in Java. You can find a full description in this PDF, but it...
0 replies - 4983 views - 11/21/13 by Alec Noller in Articles
Attached to this blog entry is the zip of my slides and demos from my Solr presentation at the CF Summit.
During my preparation for this...
0 replies - 4266 views - 11/21/13 by Raymond Camden in Articles
animated video shows how Red Hat JBoss Fuse, an enterprise service bus (ESB)
integration platform, exchanges data between diverse applications,...
0 replies - 250 views - 11/20/13 by Kristine Kelly in Uncategorized
this short video to learn the productivity benefits you will gain by using
JBoss Enterprise Application Platform 6.
0 replies - 202 views - 11/20/13 by Kristine Kelly in Uncategorized
does it mean to say that Red Hat JBoss Enterprise Application Platform is fast?
What does it mean to say that Red Hat JBoss Enterprise Application...
0 replies - 258 views - 11/20/13 by Kristine Kelly in Uncategorized
In my last post,
I wrote about how I compiled a US Social Security Agency data set into
something usable in R, and mentioned some issues scaling it up to...
0 replies - 3103 views - 11/20/13 by Matthew Dubins in Articles