基于数据驱动的多智能体交互式动态影响图算法研究与应用

基本信息

批准号：61562033

项目类别：地区科学基金项目

资助金额：39.00

负责人：潘颖慧

学科分类：

依托单位：江西财经大学

批准年份：2015

结题年份：2019

起止时间：2016-01-01 - 2019-12-31

项目状态：已结题

项目参与者：曾一锋,杜宾,肖泉,谭亮,钟华,柯有伟

关键词：

多智能体系统影响图序贯决策问题

结项摘要

Individual decision making model provides a general framework for solving sequential multiagent decision making problems. By modeling other agents’ decision making process, a subject agent optimizes its decisions through solving the model. Since the true model of other agents is often unknown to the subject agent, most of the current research assumes a large number of candidate modes of other agents that can be developed by a domain expert. This leads to difficulty in solving complex decision models. This project aims to develop a data-driven multiagent decision model by exploiting available data of agents’ interactions. Instead of manually building models of other agents, this project learns policies of other agents by adapting probabilistic automata inference methods and extends the well-known decision model, namely Interactive Dynamic Influence Diagram (I-DID). It is the first time that this project focuses on online solutions to the extended I-DID models and proposes active learning based techniques to identify the true behavior of other agents. In addition, this project will investigate practical applications of the proposed solutions in multiple experimental platforms. In summary, this project will significantly improve the sequential multiagent decision making techniques and provide an example of integrating machine learning techniques into multiagent decision making research. The research outcomes can also be generalized to other multiagent decision models and inspire new solutions to complex sequential multiagent decision making problems.

从个体决策的角度研究多智能体序贯决策问题是一种普遍适用的方法。通过建模其他智能体的决策过程，主体智能体优化本身的决策。由于主体智能体不知道其他智能体的真实模型，研究主要依赖于其他智能体决策过程模型的建立，并假设存在数目众多的其他智能体候选模型。这造成了繁杂的相互建模过程，导致模型难于求解。本项目利用大量存在的多智能体交互数据，借助概率机器模型推理技术自动学习其他智能体决策行为，建立基于数据驱动的多智能体序贯决策模型。通过拓展交互式动态影响图（I-DID），本项目首次提出模型在线求解技术，采用主动学习策略迅速而有效地确定其他智能体的真实决策行为，并建立多个仿真试验平台以评估研究的实际应用价值。本项目的研究将全面提高多智能体序贯决策模型的求解技术，是将机器学习技术无缝嵌入到多智能体决策研究的典范。该研究成果也可以被广泛地应用到其他多智能体决策模型，为解决复杂多智能体决策问题提供新的思路。

项目摘要

从个体决策的角度研究多智能体序贯决策问题时，主体智能体通常通过建模其他智能体的决策过程优化本身的决策。由于主体智能体不知道其他智能体的真实模型，研究主要依赖于其他智能体决策过程模型的建立并，假设存在数目众多的其他智能体候选模型。这造成了繁杂的相互建模过程，导致模型难于求解。本项目利用大量存在的多智能体交互数据，借助概率机器模型推理技术自动学习其他智能体决策行为，建立基于数据驱动的多智能体序贯决策模型。通过拓展交互式动态影响图，本项目提出模型在线求解技术，采用主动学习策略迅速而有效地确定其他智能体的真实决策行为，并建立多个仿真试验平台以评估研究的实际应用价值。

项目成果

DOI：{{i.doi}}

发表时间：{{i.publish_year}}

暂无此项成果

数据更新时间：2023-05-31

其他相关文献

DOI：10.16368/j.issn.1674-8999.2018.12.569

发表时间：2018

DOI：10.16796/j.cnki.1000-3770.2022.03.003

发表时间：2022

DOI：10.12354/j.issn.1000-8179.2021.20201763

发表时间：2021

DOI：

发表时间：2021

DOI：10.12202/j.0476-0301.2022178

发表时间：2022

潘颖慧的其他基金

批准号：61806089

批准年份：2018

资助金额：27.00

项目类别：青年科学基金项目

相似国自然基金

基于多Agent的通信交互式动态影响图研究及应用

批准号：60975052

批准年份：2009

负责人：罗键

学科分类：F0304

资助金额：31.00

项目类别：面上项目

基于多群体融合与数据驱动的群体智能算法研究

批准号：61673193

批准年份：2016

负责人：宋威

学科分类：F0307

资助金额：59.00

项目类别：面上项目

基于值等价的交互式动态影响图的求解方法研究与应用

批准号：61772442

批准年份：2017

负责人：曾一锋

学科分类：F06

资助金额：58.00

项目类别：面上项目

大规模动态点云数据多尺度交互式实时绘制算法研究

批准号：60873130

批准年份：2008

负责人：万旺根

学科分类：F0209

资助金额：30.00

项目类别：面上项目

基于数据驱动的多智能体交互式动态影响图算法研究与应用

{{i.achievement_title}}

暂无此项成果

其他相关文献

肥胖型少弱精子症的发病机制及中医调体防治

EBPR工艺运行效果的主要影响因素及研究现状

外泌体在胃癌转移中作用机制的研究进展

基于铁路客流分配的旅客列车开行方案调整方法

复杂系统科学研究进展

潘颖慧的其他基金

具有可解释性的竞争对手建模技术研究及其应用

相似国自然基金