Bing Visual Search Beta
Bing launched a Visual Search beta today that is fun to play with. The name may be a bit misleading…
I’ll show you mine if you show me yours…
Analysts don't usually quote predictive model performance. Data Mining within each industry is different, and even within the telecommunications industry…
Micro vs. Macro Information Retrieval
The Probably Irrelevant blog has been quiet for a while, but I was happy to see a new post there…
The Map is not the Territory
“The word is not the thing, the map is not the territory” is a key principle of General Semantics and…
Machine Learning in R, in a nutshell
Josh Reich has created a concise R script demonstrating various machine-learning techniques in R with simple, self-contained examples. For example,…
#21: Here’s a thought…
An occasional series in which a review of recent posts on SmartData Collective reveals the following nuggets:They just don’t get…
HCIR: Better Than Magic!
I’m a big fan of using machine learning and automated information extraction to improve search performance and generally support information…
To Parse or Not To Parse
“To Parse, or Not To Parse,—that is the question: Whether 'tis nobler in the data to suffer The slings and…
Considering the Data Diet
I had the unique pleasure of meeting with Pete Fader and Eric Bradlow from the Wharton Interactive Media Initiative last…
Great Series of Posts on Medical Literature Search
Gene Golovchinsky at FXPAL has written a great series of posts on medical literature search, specifically looking at how MeSH…