• Dec 20, 2017 News!ACMLC 2017 has been successfully held in NEC, Singapore during December 8-10.   [Click]
  • Dec 12, 2017 News!Good News! All papers from Volume 7, Number 1 to Volume 7, Number 5 have been indexed by Scopus!   [Click]
  • Mar 05, 2018 News!Welcome Assoc. Prof. Xianghua Xie, University of Swansea, UK joins our editorial board.
Search
General Information
Editor-in-chief
Dr. Lin Huang
Metropolitan State University of Denver, USA
It's my honor to take on the position of editor in chief of IJMLC. We encourage authors to submit papers concerning any branch of machine learning and computing.
IJMLC 2015 Vol. 5(4): 277-282 ISSN: 2010-3700
DOI: 10.7763/IJMLC.2015.V5.520

Keyword Clustering for Comparing Documents in Different Languages

J. Tae and D. Shin
Abstract—The objective of this study was to complement natural language processing of a content-based retrieval system by applying keyword clustering. We focused on comparing documents in two languages. To evaluate the performance of this approach, we clustered keywords using the features of documents and performed document clustering using the results of keyword clustering. The purity and the entropy of document clustering revealed that keyword clustering resulted in improvements in the quality of document clustering and allowed us to measure similarities between documents in different languages.

Index Terms—Keyword clustering, dictionary, document clustering, purity, entropy, export control.

The authors are with the Korea Institute of Nuclear nonproliferation and control (KINAC), 1534 Yuseong-daero, Yuseong-gu, Daejeon, 305-348, Republic of Korea (e-mail: ttjjww@postech.ac.kr, nucleo@kinac.re.kr).

[PDF]

Cite: J. Tae and D. Shin, "Keyword Clustering for Comparing Documents in Different Languages," International Journal of Machine Learning and Computing vol. 5, no. 4, pp. 277-282, 2015.

Copyright © 2008-2018. International Journal of Machine Learning and Computing. All rights reserved.
E-mail: ijmlc@ejournal.net