Big Data New Age: Hadoop vs Spark
Over the past few years, Data Science has matured. With this maturity,…
100 Petabytes of Data in Poop?
University of California computer scientist Dr. Larry Smarr is a man on…
What Is a Data Scientist (and What Isn’t)?
The perception among organizations over the past five years is that more…
Ring in the New Year with New Data Products
For web-based businesses, and of course, those with a web presence (which…
Map and Reduce in MapReduce: a SAS Illustration
In last post, I mentioned Hadoop, the open source implementation of Google’s…
How to Program MapReduce Jobs in Hadoop with R
MapReduce is a powerful programming framework for efficiently processing very large amounts…
The concept of non-relational analytics
There is a lot of talk these days about relational vs. non-relational…
Amazon Elastic MapReduce, and other stuff I don’t have time to grok yet
Lots of good stuff have been coming to my attention lately.Amazon just…