Home > Archive > 2014 > Volume 4 Number 6 (Dec. 2014) >
IJMLC 2014 Vol. 4(6): 538-542 ISSN: 2010-3700
DOI: 10.7763/IJMLC.2014.V6.469

Similarity/Dissimilarity of DNA Sequences Based on a New Condensed Curve Representation

Qianjun Xiao

Abstract—Based on a 3-D graphical representation, Bo Liao et al. [B. Liao et al., J. Molec. Struct. (THEOCHEM) 717 (2005) 199] made a comparison for the coding sequences of the first exon of β-globin gene of 11 different species. However, some results in the Tables IV of Liao's were somewhat rational because the main information focus on the cumulative occurrence numbers Si of base A, G, C, T. In this paper, we propose another 3D graphical representation by converting the Si into 1-1/Si. Based on the mathematic invariants S2, the results of comparison for the coding sequences used in Liao's are improved greatly and the examination of similarities among the full coding sequences shows our graphical representation method is more effective to the comparative study of DNA sequences. Furthermore, our graphical curves are compact and the complexities of computation are very small especially for long sequences.

Index Terms—DNA Sequences, graphical representation, numerical characterization, S2, similarity.

Qianjun Xiao is now with Hunan Vocational Institute of Technology, Xiangtan 411104, China (tel.: +86-731-52720616; e-mail: xqjxt@126.com, qjxiao1978@126.com).

[PDF]

Cite: Qianjun Xiao, "Similarity/Dissimilarity of DNA Sequences Based on a New Condensed Curve Representation," International Journal of Machine Learning and Computing vol. 4, no. 6, pp. 538-542, 2014.

General Information

  • E-ISSN: 2972-368X
  • Abbreviated Title: Int. J. Mach. Learn.
  • Frequency: Quaterly
  • DOI: 10.18178/IJML
  • Editor-in-Chief: Dr. Lin Huang
  • Executive Editor:  Ms. Cherry L. Chen
  • Abstracing/Indexing: Inspec (IET), Google Scholar, Crossref, ProQuest, Electronic Journals LibraryCNKI.
  • E-mail: ijml@ejournal.net


Article Metrics in Dimensions