Novel Approximate Statistical Algorithm for Large Complex Datasets

Home > Archive > 2012 > Volume 2 Number 5 (Oct. 2012) >

IJMLC 2012 Vol.2(5): 720-724 ISSN: 2010-3700
DOI: 10.7763/IJMLC.2012.V2.222

Yohei Takeuchi, Momoyo Ito, and Minoru Fukumi

Abstract—In the field of pattern recognition, principal component analysis (PCA) is one of the most well-known feature extraction methods for reducing the dimensionality of high-dimensional datasets. Simple-PCA (SPCA), which is a faster version of PCA, performs effectively with iterative operated learning. However, SPCA might not be efficient when input data are distributed in a complex manner because it learns without using the class information in the dataset. Thus, SPCA cannot be said to be optimal from the perspective of feature extraction for classification. In this study, we propose a new learning algorithm that uses the class information in the dataset. Eigenvectors spanning the eigenspace of the dataset are produced by calculating the data variations within each class. We present our proposed algorithm and discuss the results of our experiments that used UCI datasets to compare SPCA and our proposed algorithm.

Index Terms—Pattern recognition, principal component analysis, supervised learning.

Y. Takeuchi is a doctoral course student with the Graduate School of Advanced Technology and Science, the University of Tokushima, Tokushima, 770-8506 Japan (e-mail: takeuchi-yohei@is.tokushima-u.ac.jp).
M. Ito is an Assistant Professor with the Department of Information Science and Intelligent Systems, the University of Tokushima, Tokushima, 770-8506 Japan (e-mail: momoito@is.tokushima-u.ac.jp).
M. Fukumi is a Professor with the Department of Information Science and Intelligent Systems, the University of Tokushima, Tokushima, 770-8506 Japan (e-mail: fukumi@is.tokushima-u.ac.jp).

[PDF]

Cite:Yohei Takeuchi, Momoyo Ito, and Minoru Fukumi, "Novel Approximate Statistical Algorithm for Large Complex Datasets," International Journal of Machine Learning and Computing vol.2, no. 5, pp. 720-724, 2012.

PREVIOUS PAPER

Computational Modeling of Metabolic Networks

NEXT PAPER

Automatic Background Updating for Abandoned Object Detection at Train Stations

General Information

E-ISSN: 2972-368X
Abbreviated Title: Int. J. Mach. Learn.
Frequency: Quaterly
DOI: 10.18178/IJML
Editor-in-Chief: Dr. Lin Huang
Executive Editor: Ms. Cherry L. Chen
Abstracing/Indexing: Inspec (IET), Google Scholar, Crossref, ProQuest, Electronic Journals Library, CNKI.
E-mail: ijml@ejournal.net

Home

About IJML

Editorial Board

Author Guideline

Editor Guideline

Reviewer Guideline

Special Issues

Archive

Home > Archive > 2012 > Volume 2 Number 5 (Oct. 2012) >

Novel Approximate Statistical Algorithm for Large Complex Datasets

General Information

Article Metrics in Dimensions