Sign up | Login with →

Posts by Sandro Saitta Subscribe

Data Mining Research Interview: Stuart Shulman

Today on Data Mining Research, Stuart Shulman is answering our questions regarding his tool DiscoverText and his company Texifter. Stuart, thanks for sharing your work and taking some time to answer Data Mining Research questions. Data Mining...

Posted January 30, 2011    

Data Mining Research Awards 2010

Like the music or the cinema, Data Mining Research has its own awards. I think it is important to congratulate people for what they do, especially if they do it well. The following awards have been attributed in a subjective way: they only reflect...

Posted December 30, 2010    

Data preprocessing for clustering: survey

The University of Jena (Germany) is currently working on preprocessing for clustering. It is part of a bigger project (WhiBo). They have prepared a survey that takes around 10 minutes to fill. Here is a brief explanation about their survey: “...

Posted November 28, 2010    

Data Mining Book Review: Business intelligence success factors

I will be clear from the beginning: this book is not about business intelligence. At least not the way most people understand BI. You will not read words such as data bases, data warehouse or data mart. That being said, we can start the review....

Posted November 13, 2010    

Automating Analytics

Information Management has recently published an interesting article about automating analytics. Written by Bill Franks and Scott VanValkenburgh, it explains when automating analytics work and when it doesn’t. I think automating data mining is in...

Posted November 6, 2010    

Auto-correlation for time series analysis

Recently, I was reading the EPFL magazine and was surprised to see an article where they interviewed my master thesis adviser, Francois Fleuret. He explained data mining and gave an example about our project. The goal of my master project was to...

Posted October 23, 2010    

Data Mining: A new weapon in the fight against Medicaid fraud

On Friday July 16, US Attorney General Eric Holder and Secretary of Health and Human Services Kathleen Sibelius arrived in Miami to announce that 94 people had been charged with defrauding Medicaid, the US government’s health system. As the...

Posted August 2, 2010    

Worst Practices in Data Mining

I recently read the article Worst practices in business forecasting written by Michael Gilliland and Udo Sglavo. It is published in the July/August issue of AnalyticsMagazine, which is by the way an excellent journal about analytics. In their...

Posted July 27, 2010    

Guest post: Why Google TV Could Destroy Nielsen’s Data

It’s my pleasure to welcome Daniel Cawrey for this guest post on Data Mining Research. He has written an interesting post about Google TV and the data mining possibilities. I hope you will enjoy it. There has been a lot of hype surrounding Google...

Posted July 8, 2010    

The amount of digital data created in 2010 will equal…

If you want to impress your colleagues/friends with some huge numbers, simply use the funny comparisons made by Information Management in their article “Are You Prepared to Store All This Data?”. Here is an excerpt: “The amount of digital...

Posted July 5, 2010    

Online and offline become 1: a new era has begun (part 2)

This is the second part of the post Online and offline become 1: a new era has begun. In this post, I discuss the second article by David M. Raab. Bridging the Gap Between Online and Database Marketing Raab starts his article with the following...

Posted June 30, 2010    

Online and offline become 1: a new era has begun (part 1)

I recently came across two interesting articles that are closely related to our Customer Online Targeting (COT) tool. Both are from Information Management. The first one, “Online Analytics in Action” by Roman Lenzen, deals with web data and how to...

Posted June 23, 2010