Gradient Masking Is a Type of Overfitting

Home > Archive > 2018 > Volume 8 Number 3 (Jun. 2018) >

IJMLC 2018 Vol.8(3): 203-207 ISSN: 2010-3700
DOI: 10.18178/ijmlc.2018.8.3.688

Yusuke Yanagita and Masayuki Yamamura

Abstract—Neural networks have recently been attracting attention again as classifiers with high accuracy, so called “deep learning,” which is applied in a wide variety of fields. However, this advanced machine learning algorithms are vulnerable to adversarial perturbations. Although they cannot be recognized by humans, these perturbations deliver a fatal blow to the estimation ability of classifiers. Thus, while humans perceive perturbed examples as being the same as the original natural examples, sophisticated classifiers identify them as completely different examples. Although several defensive measures against such adversarial examples have been suggested, they are known to fail in undesirable phenomena, gradient masking. Gradient masking can neutralize the useful gradient for adversaries, but adversarial perturbations tend to transfer across most models, and these models can be deceived by adversarial examples crafted based on other models, which is called a black-box attack. Therefore, it is necessary to develop training methods to withstand black-box attacks and conduct studies to investigate the weak points of current NN training. This paper argues that no special defensive measures are necessary for NN to fall into gradient masking, and it is sufficient to slightly change the initial learning rate of Adam from the recommended value. Moreover, our experiment implies that gradient masking is a type of overfitting.

Index Terms—Adam, adversarial examples, gradient masking, machine learning, neural network.

The authors are with the Department of Computer Science, School of Computer, Tokyo Institute of Technology, Yokohama, 226-8503, Japan (e-mail: y_yanagita@ali.c.titech.ac.jp, my@c.titech.ac.jp).

[PDF]

Cite: Yusuke Yanagita and Masayuki Yamamura, "Gradient Masking Is a Type of Overfitting," International Journal of Machine Learning and Computing vol. 8, no. 3, pp. 203-207, 2018.

PREVIOUS PAPER

Speech Emotion Recognition Based on SVM and ANN

NEXT PAPER

Data- and Algorithm-Hybrid Approach for Imbalanced Data Problems in Deep Neural Network

General Information

E-ISSN: 2972-368X
Abbreviated Title: Int. J. Mach. Learn.
Frequency: Quaterly
DOI: 10.18178/IJML
Editor-in-Chief: Dr. Lin Huang
Executive Editor: Ms. Cherry L. Chen
Abstracing/Indexing: Inspec (IET), Google Scholar, Crossref, ProQuest, Electronic Journals Library, CNKI.
E-mail: ijml@ejournal.net

Home

About IJML

Editorial Board

Author Guideline

Editor Guideline

Reviewer Guideline

Special Issues

Archive

Home > Archive > 2018 > Volume 8 Number 3 (Jun. 2018) >

Gradient Masking Is a Type of Overfitting

General Information

Article Metrics in Dimensions