Comparison of Random Forest and SVM for Raw Data in Drug Discovery: Prediction of Radiation Protection and Toxicity Case Study

Home > Archive > 2016 > Volume 6 Number 2 (Apr. 2016) >

IJMLC 2016 Vol.6(2): 145-148 ISSN: 2010-3700
DOI: 10.18178/ijmlc.2016.6.2.589

Atsushi Matsumoto, Shin Aoki, and Hayato Ohwada

Abstract—This paper compares random forest and SVM for raw data in drug discovery. Both machine-learning methods are often applied in drug discovery. We should select our methods depending on the problem. This is very important. SVM is suitable for virtual screening when the target protein is known. In contrast, random forest is suitable for virtual screening when the target protein is not decided uniquely or unknown, because random forest can find good combinations of features from many features. Therefore, random forest is thus more effective for problems including many unknown parts. Incidentally, selecting the good features is important in both methods. In particular, we must narrow the features using importance calculations if we lack sufficient biochemical knowledge. In this study, we predicted the radiation protection function and toxicity for radioprotectors targeting p53 as a case study. When predicting the radiation protection function the target protein is known. In contrast, when predicting toxicity, the target protein is not decided uniquely or is unknown. We evaluated each experiment based on its AUC score. As a result, we found that when predicting the radiation protection function, SVM was better than random forest. By contrast, when predicting toxicity, random forest was better than SVM.

Index Terms—SVM, random forest, drug discovery, radioprotector.

Atsushi Matsumoto, Shin Aoki, and Hayato Ohwada are with the Tokyo University of Science, Noda-shi, Chiba-ken, Japan (e-mail: 7415617@ed.tus.ac.jp, shinaoki@rs.tus.ac.jp, ohwada@rs.tus.ac.jp).

[PDF]

Cite: Atsushi Matsumoto, Shin Aoki, and Hayato Ohwada, "Comparison of Random Forest and SVM for Raw Data in Drug Discovery: Prediction of Radiation Protection and Toxicity Case Study," International Journal of Machine Learning and Computing vol.6, no. 2, pp. 145-148, 2016.

PREVIOUS PAPER

Muscular Dystrophy Disease Classification Using Relative Synonymous Codon Usage

NEXT PAPER

Predicting Online Doctor Ratings from User Reviews Using Convolutional Neural Networks

General Information

E-ISSN: 2972-368X
Abbreviated Title: Int. J. Mach. Learn.
Frequency: Quaterly
DOI: 10.18178/IJML
Editor-in-Chief: Dr. Lin Huang
Executive Editor: Ms. Cherry L. Chen
Abstracing/Indexing: Inspec (IET), Google Scholar, Crossref, ProQuest, Electronic Journals Library, CNKI.
E-mail: ijml@ejournal.net

Home

About IJML

Editorial Board

Author Guideline

Editor Guideline

Reviewer Guideline

Special Issues

Archive

Home > Archive > 2016 > Volume 6 Number 2 (Apr. 2016) >

Comparison of Random Forest and SVM for Raw Data in Drug Discovery: Prediction of Radiation Protection and Toxicity Case Study

General Information

Article Metrics in Dimensions