Interactive stock visualizations with R
Jeroen Ooms, who recently completed his Masters in Statistics at Utrech University, has created an outstanding web-based drag-and-drop application for…
#21: Here’s a thought…
An occasional series in which a review of recent posts on SmartData Collective reveals the following nuggets:They just don’t get…
HCIR: Better Than Magic!
I’m a big fan of using machine learning and automated information extraction to improve search performance and generally support information…
To Parse or Not To Parse
“To Parse, or Not To Parse,—that is the question: Whether 'tis nobler in the data to suffer The slings and…
Considering the Data Diet
I had the unique pleasure of meeting with Pete Fader and Eric Bradlow from the Wharton Interactive Media Initiative last…
Great Series of Posts on Medical Literature Search
Gene Golovchinsky at FXPAL has written a great series of posts on medical literature search, specifically looking at how MeSH…
Finding, Locating, Discovering
Thanks to Tony Hollingsworth for alerting me to a post by Alex Campbell entitled “Stark realisation: I no longer depend…
Free as in Freebase
It’s been a while since I’ve blogged about Freebase, the semantic web database maintained by Metaweb. But I recently had…
Adventures in Data Profiling (Part 5)
In Part 4 of this series: You went totally postal... shifting your focus to postal address by first analyzing the…
New to Data Quality Analysis Try These “9+1 Things To Do”!
Did you just get moved over from one data warehouse support group to another? Do you know nothing or very…