Our network

data matching

A Record Named Duplicate

July 29, 2010 by Jim Harris
with 612 views
0

Although The Rolling Forecasts recently got the band back together for the Data Rock Star World Tour, the tour scheduling (as well as its funding and corporate sponsorship) has encountered some unexpected delays.  For now, please enjoy the following lyrics from another one of our greatest hits—this one reflects our country music... [read more]

The Very True Fear of False Positives

July 18, 2009 by Jim Harris
with 121 views
0

Data matching is commonly defined as the comparison of two or more records in order to evaluate if they correspond to the same real world entity (i.e. are duplicates) or represent some other data relationship (e.g. a family household). The need for data matching solutions is one of the primary reasons that companies invest in data... [read more]

The Two-Headed Monster of Data Matching

June 3, 2009 by Jim Harris
with 104 views
0

Data matching is commonly defined as the comparison of two or more records in order to evaluate if they correspond to the same real world entity (i.e. are duplicates) or represent some other data relationship (e.g. a family household). Data matching is commonly plagued by what I refer to as The Two-Headed Monster: False Negatives -... [read more]

Identifying Duplicate Customers

March 26, 2009 by Jim Harris
with 109 views
0

I just finished publishing a five part series of articles on data matching methodology for dealing with the common data quality problem of identifying duplicate customers.  The article series was published on Data Quality Pro, which is the leading data quality online magazine and free independent community resource dedicated... [read more]