Offense and defence against adversarial sample: A reinforcement learning method in energy trading market

Li, Donghe and Yang, Qingyu and Ma, Linyue and Peng, Zhenhua and Liao, Xiao (2023) Offense and defence against adversarial sample: A reinforcement learning method in energy trading market. Frontiers in Energy Research, 10. ISSN 2296-598X

[thumbnail of pubmed-zip/versions/2/package-entries/fenrg-10-1071973-r1/fenrg-10-1071973.pdf] Text
pubmed-zip/versions/2/package-entries/fenrg-10-1071973-r1/fenrg-10-1071973.pdf - Published Version

Download (6MB)

Abstract

The energy trading market that can support free bidding among electricity users is currently the key method in smart grid demand response. Reinforcement learning is used to formulate optimal strategies for them to obtain optimal strategies. Non-etheless, the security problem raised by artificial intelligence technology has been paid more and more attention. For example, the neural network has been proved to be able to resist adversarial example attacks, thus affecting its training results. Considering that reinforcement learning is also widely used for training by neural networks, the security problem can not be ignored, especially in scenarios with high security requirements such as smart grids. To this end, we study the security issues in reinforcement learning-based bidding strategy method facing by the adversarial example. First of all, regarding to the electric vehicle double auction market, we formalize the bidding decision problem of EVs into a Markov Decision Process, so that reinforcement learning is used to solve this problem. Secondly, from the perspective of attackers, we have designed a local Fast Gradient Sign Method which affects the environment and the results of reinforcement learning by changing its own bidding. Then, from the perspective of the defender, we designed a reinforcement learning training network containing an attack identifier based on the deep neural network, so as to identify malicious injection attacks to resist against adversarial attacks. Finally, comprehensive simulations are conducted to verify our proposed method. The results shows that, our proposed attack method will reduce the auction profit by influencing reinforcement learning algorithm, and the protect method will be able to completely resist such attacks.

Item Type: Article
Subjects: Librbary Digital > Energy
Depositing User: Unnamed user with email support@librbarydigit.com
Date Deposited: 29 Apr 2023 07:02
Last Modified: 01 Aug 2024 10:24
URI: http://info.openarchivelibrary.com/id/eprint/508

Actions (login required)

View Item
View Item