Zero Latency: The Next Arms Race
In the near future, your company may be competing with a computer. In fact, companies with the fastest computers, most…
Good Data Warehouse DBAs are Hard to Find
As a consultant I’m often asked about how roles and responsibilities should be delegated or identified within the IT organization…
Reality Mining – Too Much Personalization?
What does your mobile phone usage say about you? Probably a lot more than you think. Mobile phone operators are…
It’s data, Jim, but not as we know it – Part 1: What the echo of the Big Bang tells us about the nature of information
Possibly I am just turning into a grumpy old man in my middle-age, but there are two words that when…
#18: Here’s a thought…
An occasional series in which a review of recent posts on SmartData Collective reveals the following nuggets:Less is moreWe live…
HadoopDB discussion with Daniel Abadi
I spoke to Daniel Abadi a few days ago about his HadoopDB announcement that came out recently. I am sure…
Predicting the next Viral Tweet
It is time to use Twitter data for another reason: Can Predictive Analytics be used to identify which tweets have…
Getting Your Data Freq On
One of the most basic features of a data profiling tool is the ability to generate statistical summaries and frequency…
Slides from OSCON
OSCON 2009 has been a blast so far, and I've really been enjoying the presentations and meeting people from different…
Scraping data from the Web with R
Sometimes the data we need isn't packaged up nicely into a simple comma-separated file or database. It's out there, but…