Optimizing Massive Distance Computations in Pattern Recognition

Home > Archive > 2014 > Volume 4 Number 1 (Feb. 2014) >

IJMLC 2014 Vol.4(1): 114-119 ISSN: 2010-3700
DOI: 10.7763/IJMLC.2014.V4.397

Andras Farago

Abstract—It is a common task in pattern recognition to evaluate the similarity of large data objects. These are often represented by high dimensional vectors. A frequently used mathematical model for evaluating their similarity is to view them as points (vectors) in a high dimensional space, and compute their distances from each other. The “distance,” however, can be defined in a very complicated way, it may be much more complex than the well known Euclidean distance. Therefore, the algorithmic bottleneck often becomes the number of distance computations that need to be carried out. We consider the case when we have to compute all the distances between n objects, where n is large. Without any shortcuts it takes n (n − 1)/2 = O( n²) distance computations. In those applications where the distances are complicated, being defined by sophisticated algorithms (such as in speech and image recognition), a quadratically growing number of distance computations becomes a severe bottleneck. We prove the following general result that can help eliminating the bottleneck: for a large and general class of distances it is possible to obtain a very close approximation of each of the O(n²) pairwise distances of n objects by doing only a linear number distance computations, which is optimal with respect to the order of magnitude. Moreover, the approximation factor can be made arbitrarily close to 1, making the approximation error negligible. The needed side computations to achieve this reduction can also be done in polynomial time.

Index Terms—Pattern recognition, approximate distance computation, metric space, normed space.

Andras Farago is with the Department of Computer Science, The University of Texas at Dallas, Richardson, TX 75080, USA (e-mail: farago@utdallas.edu).

[PDF]

Cite:Andras Farago, "Optimizing Massive Distance Computations in Pattern Recognition," International Journal of Machine Learning and Computing vol.4, no. 1, pp. 114-119, 2014.

PREVIOUS PAPER

Development of Artificial Neural Network Architecture for Face Recognition in Real Time

NEXT PAPER

Last page

General Information

E-ISSN: 2972-368X
Abbreviated Title: Int. J. Mach. Learn.
Frequency: Quaterly
DOI: 10.18178/IJML
Editor-in-Chief: Dr. Lin Huang
Executive Editor: Ms. Cherry L. Chen
Abstracing/Indexing: Inspec (IET), Google Scholar, Crossref, ProQuest, Electronic Journals Library, CNKI.
E-mail: ijml@ejournal.net

Home

About IJML

Editorial Board

Author Guideline

Editor Guideline

Reviewer Guideline

Special Issues

Archive

Home > Archive > 2014 > Volume 4 Number 1 (Feb. 2014) >

Optimizing Massive Distance Computations in Pattern Recognition

General Information

Article Metrics in Dimensions