论文标题
通过空间动量增强对抗性示例的可传递性
Enhancing Transferability of Adversarial Examples with Spatial Momentum
论文作者
论文摘要
许多对抗性攻击方法在白色盒子设置下达到了令人满意的攻击成功率,但是在攻击其他DNN模型时,它们通常显示出差的可传递性。基于动量的攻击是提高可转移性的一种有效方法。它将动量术语集成到迭代过程中,该过程可以通过添加每个像素的梯度的时间相关性来稳定更新方向。我们认为,只有这种时间动量还不够,即图像中空间域的梯度,即来自目标像素以稳定为中心的上下文像素的梯度也很重要。为此,我们提出了一种名为“空间动量迭代FGSM攻击(SMI-FGSM)”的新方法,该方法通过考虑来自图像中不同区域的上下文信息,引入了动量积累的机制。然后将SMI-FGSM与时间动量集成在一起,以同时稳定梯度的更新方向。广泛的实验表明,我们的方法确实进一步增强了对抗性可传递性。它可以为多个主流未防御和辩护的模型达到最佳的可传递性成功率,这平均超过了最新的攻击方法,平均差距为10 \%。
Many adversarial attack methods achieve satisfactory attack success rates under the white-box setting, but they usually show poor transferability when attacking other DNN models. Momentum-based attack is one effective method to improve transferability. It integrates the momentum term into the iterative process, which can stabilize the update directions by adding the gradients' temporal correlation for each pixel. We argue that only this temporal momentum is not enough, the gradients from the spatial domain within an image, i.e. gradients from the context pixels centered on the target pixel are also important to the stabilization. For that, we propose a novel method named Spatial Momentum Iterative FGSM attack (SMI-FGSM), which introduces the mechanism of momentum accumulation from temporal domain to spatial domain by considering the context information from different regions within the image. SMI-FGSM is then integrated with temporal momentum to simultaneously stabilize the gradients' update direction from both the temporal and spatial domains. Extensive experiments show that our method indeed further enhances adversarial transferability. It achieves the best transferability success rate for multiple mainstream undefended and defended models, which outperforms the state-of-the-art attack methods by a large margin of 10\% on average.
