• Dec 12, 2017 News!Good News! All papers from Volume 7, Number 1 to Volume 7, Number 5 have been indexed by Scopus!   [Click]
  • Jul 03, 2017 News!Good News! Since 2017, IJMLC has been indexed by Scopus!
  • Nov 14, 2017 News!Vol.7, No.5 has been published with online version.   [Click]
Search
General Information
Editor-in-chief
Dr. Lin Huang
Metropolitan State University of Denver, USA
It's my honor to take on the position of editor in chief of IJMLC. We encourage authors to submit papers concerning any branch of machine learning and computing.
IJMLC 2015 Vol. 5(4): 277-282 ISSN: 2010-3700
DOI: 10.7763/IJMLC.2015.V5.520

Keyword Clustering for Comparing Documents in Different Languages

J. Tae and D. Shin
Abstract—The objective of this study was to complement natural language processing of a content-based retrieval system by applying keyword clustering. We focused on comparing documents in two languages. To evaluate the performance of this approach, we clustered keywords using the features of documents and performed document clustering using the results of keyword clustering. The purity and the entropy of document clustering revealed that keyword clustering resulted in improvements in the quality of document clustering and allowed us to measure similarities between documents in different languages.

Index Terms—Keyword clustering, dictionary, document clustering, purity, entropy, export control.

The authors are with the Korea Institute of Nuclear nonproliferation and control (KINAC), 1534 Yuseong-daero, Yuseong-gu, Daejeon, 305-348, Republic of Korea (e-mail: ttjjww@postech.ac.kr, nucleo@kinac.re.kr).

[PDF]

Cite: J. Tae and D. Shin, "Keyword Clustering for Comparing Documents in Different Languages," International Journal of Machine Learning and Computing vol. 5, no. 4, pp. 277-282, 2015.

Copyright © 2008-2015. International Journal of Machine Learning and Computing. All rights reserved.
E-mail: ijmlc@ejournal.net