Unifying adversarial training algorithms with data gradient regularization (Neural Computation, 2017)




Alexander G. Ororbia II, Daniel Kifer, C. Lee Giles. 2017. "Unifying Adversarial Training Algorithms with Data Gradient Regularization." Neural ComputationApril 2017. 29(4): 867-87. doi:10.1162/NECO_a_00928



Many previous proposals for adversarial training of deep neural nets have included directly modifying the gradient, training on a mix of original and adversarial examples, using contractive penalties, and approximately optimizing constrained adversarial objective functions. In this article, we show that these proposals are actually all instances of optimizing a general, regularized objective we call DataGrad. Our proposed DataGrad framework, which can be viewed as a deep extension of the layerwise contractive autoencoder penalty, cleanly simplifies prior work and easily allows extensions such as adversarial training with multitask cues. In our experiments, we find that the deep gradient regularization of DataGrad (which also has L1 and L2 flavors of regularization) outperforms alternative forms of regularization, including classical L1, L2, and multitask, on both the original data set and adversarial sets. Furthermore, we find that combining multitask optimization with DataGrad adversarial training results in the most robust performance.