一种基于一致性的多智能体 Q 学习算法
崔浩岩 , 张震 , 赵德京 , 廖登宇
Multi-agent Q-learning Algorithm Based on Consensus
CUI Haoyan , ZHANG Zhen , ZHAO Dejing , LIAO Dengyu
控制工程 . 2024, (7): 1169 -1177 .