IJMLC 2013 Vol. 3(1): 164-167 ISSN: 2010-3700
DOI: 10.7763/IJMLC.2013.V3.294

An Audio Retrieval Algorithm Based on Audio Shot and Inverted Index

Xueyuan Zhang and Qianhua He

Abstract—An efficient audio indexing and retrieval algorithm is proposed to locate similar audio segments in the database. A new boundary detection technique based on audio shot is proposed for audio segmentation. Subsequently, a new method is employed to convert the audio shot sequence to audio word sequence, which utilizes a self-learning audio shot dictionary. We also borrow the idea of inverted file from text retrieval to locate candidates efficiently. Furthermore, a similarity measure combining content and temporal order matching is proposed. Experiment results show a retrieval precision of 94.70% within an average response time of 6.344 seconds.

Index Terms—Audio retrieval, audio word, inverted file, temporal similarity.

The authors are with the School of Electronic and Information Engineering, South China University of Technology, Guangzhou, 510640, China (e-mail: zhang.xueyuan@mail.scut.edu.cn, eeqhhe@scut.edu.cn )


Cite:Xueyuan Zhang and Qianhua He, "An Audio Retrieval Algorithm Based on Audio Shot and Inverted Index," International Journal of Machine Learning and Computing vol. 3, no. 1, pp. 164-167, 2013.

General Information

  • ISSN: 2010-3700 (Online)
  • Abbreviated Title: Int. J. Mach. Learn. Comput.
  • Frequency: Bimonthly
  • DOI: 10.18178/IJMLC
  • Editor-in-Chief: Dr. Lin Huang
  • Executive Editor:  Ms. Cherry L. Chen
  • Abstracing/Indexing: Scopus (since 2017), Inspec (IET), Google Scholar, Crossref, ProQuest, Electronic Journals Library.
  • E-mail: ijmlc@ejournal.net