• Jul 03, 2017 News!Good News! Since 2017, IJMLC has been indexed by Scopus!
  • Nov 14, 2017 News!Vol.7, No.5 has been published with online version.   [Click]
  • Aug 15, 2017 News![CFP] 2017 the annual meeting of IJMLC Editorial Board, ACMLC 2017, will be held in Singapore, December 8-10, 2017   [Click]
Search
General Information
Editor-in-chief
Dr. Lin Huang
Metropolitan State University of Denver, USA
It's my honor to take on the position of editor in chief of IJMLC. We encourage authors to submit papers concerning any branch of machine learning and computing.
IJMLC 2012 Vol.2(5): 614-617 ISSN: 2010-3700
DOI: 10.7763/IJMLC.2012.V2.200

The Review of Fields Similarity Estimation Methods

Mahsa Sabbagh Nobarian and Mohammad Reza Feizi Derakhshi

Abstract—Accuracy and consistency are the most important factors in any databases but increasing size of data has become a great challenge in this area. Detecting duplicate records is an important and very difficult process in huge databases containing millions of records. Field matching is a major process for duplicated record detection. In this paper, an attempt is made to provide a brief survey of field matching techniques and their efficiency.

Index Terms—Duplicate detection, character based similarity metrics, edit distance, Jaro distance, Q-Grams.

Mohammad Reza Feizi Derakhshi is with Department of Computer, University of Tabriz, Tabriz, Iran (e-mail: mfeizi@tabrizu.ac.ir)
Mahsa Sabbagh Nobarian is with Department of Computer, Islamic Azad University, Shabestar Branch, Shabestar, Iran (e-mail:msn.sabbagh@yahoo.com)

[PDF]

Cite: Mahsa Sabbagh Nobarian and Mohammad Reza Feizi Derakhshi, "The Review of Fields Similarity Estimation Methods," International Journal of Machine Learning and Computing vol. 2, no. 5, pp. 614-617, 2012.

Copyright © 2008-2015. International Journal of Machine Learning and Computing. All rights reserved.
E-mail: ijmlc@ejournal.net