基于值等价的交互式动态影响图的求解方法研究与应用

基本信息

批准号：61772442

项目类别：面上项目

资助金额：58.00

负责人：曾一锋

学科分类：

依托单位：厦门大学

批准年份：2017

结题年份：2021

起止时间：2018-01-01 - 2021-12-31

项目状态：已结题

项目参与者：罗晔,陈碧连,曹浪财,马碧阳,孙婷,王鹏鹏,余深宝,沈杲阳

关键词：

多Agent学习多Agent规划多Agent推理多Agent系统

结项摘要

Multiagent system is one of the most important areas in the development of artificial intelligence research, and plays a significant role in many applications. Interactive dynamic influence diagram (I-DID) is well recognized as a general technique for solving multiagent sequential decision making problems under uncertainty. Based on a large amount of I-DID research in the past years, this project aims to complement the I-DID solutions by proposing a new type of solutions that is developed from the principle of value equivalence. The new solutions, which are different from the previous I-DID solutions that are entirely based on behavioral equivalence, can significantly reduce the solution complexity. More importantly, they can provide a theoretical guarantee to the quality of agents’ optimal policies, which will therefore improve the reliability of applying I-DID in a practical setting. This project will develop the solutions by employing new advances of artificial intelligence techniques including sub-modular function optimization, Monte Carlo tree search, active learning, generative adversarial networks and so on. It aims to guarantee correctness of new I-DID solutions in a theoretical way and ensure their effectiveness in practice. The research outcomes will benefit the further development of artificial intelligence research and applications in a new horizon.

多智能体系统是人工智能技术研究的一个重要发展领域，在众多应用领域起到了不可估量的作用。交互式动态影响图是求解在不确定环境下多智能体序贯决策问题的一个普遍适用技术，在多智能体系统研究领域得到了高度的认可。基于前期大量的交互式动态影响图研究工作基础上，本项目继续完善交互式动态影响图的求解方法，提出一个基于值等价的求解体系。有别于传统的基于行为等价准则的交互式动态影响图求解方法，该崭新的求解技术不仅能够极大地降低模型求解的复杂度，而且更为重要的是能够在理论上对智能体的最优决策质量给出一个严格的保证。这将提高交互式动态影响图在实际应用中的可靠性。本项目将采用子模函数优化方法、蒙特卡洛树搜索方法、主动学习方法、生成对抗网络等人工智能最新发展技术，开发一套理论上正确、实际有效的交互式动态影响图求解算法。项目的研究成果将对人工智能技术的进一步发展和可靠应用有一定的借鉴作用。

项目摘要

多智能体系统是人工智能技术研究的一个重要发展领域，在众多应用领域起到了不可估量的作用。交互式动态影响图是求解在不确定环境下多智能体序贯决策问题的一个普遍适用技术，在多智能体系统研究领域得到了高度的认可。基于前期大量的交互式动态影响图研究工作基础上，本项目继续完善交互式动态影响图的求解方法，提出了一个基于值等价的求解体系。有别于传统的基于行为等价准则的交互式动态影响图求解方法，该崭新的求解技术不仅极大地降低模型求解的复杂度，而且能够在理论上对智能体的最优决策质量给出一个严格的保证，提高了交互式动态影响图在实际应用中的可靠性。本项目采用了子模函数优化方法、蒙特卡洛树搜索方法、生成对抗网络等人工智能最新发展技术，开发了一套理论上正确、实际有效的交互式动态影响图求解算法。

项目成果

DOI：{{i.doi}}

发表时间：{{i.publish_year}}

暂无此项成果

数据更新时间：2023-05-31

其他相关文献

DOI：10.16796/j.cnki.1000-3770.2022.03.003

发表时间：2022

DOI：

发表时间：2021

DOI：10.12202/j.0476-0301.2022178

发表时间：2022

DOI：

发表时间：2020

DOI：10.3788/CJL201946.0801003

发表时间：2019

曾一锋的其他基金

批准号：61375070

批准年份：2013

资助金额：76.00

项目类别：面上项目

相似国自然基金

基于多Agent的通信交互式动态影响图研究及应用

批准号：60975052

批准年份：2009

负责人：罗键

学科分类：F0304

资助金额：31.00

项目类别：面上项目

基于数据驱动的多智能体交互式动态影响图算法研究与应用

批准号：61562033

批准年份：2015

负责人：潘颖慧

学科分类：F06

资助金额：39.00

项目类别：地区科学基金项目

基于交互式动态影响图的未知对手模型学习

批准号：61375070

批准年份：2013

负责人：曾一锋

学科分类：F0305

资助金额：76.00

项目类别：面上项目

基于交互式动态影响图的光储微网运行控制研究

批准号：61703091

批准年份：2017

负责人：李波

学科分类：F0302

资助金额：23.00

项目类别：青年科学基金项目

基于值等价的交互式动态影响图的求解方法研究与应用

{{i.achievement_title}}

暂无此项成果

其他相关文献

EBPR工艺运行效果的主要影响因素及研究现状

基于铁路客流分配的旅客列车开行方案调整方法

复杂系统科学研究进展

基于多色集合理论的医院异常工作流处理建模

基于腔内级联变频的0.63μm波段多波长激光器

曾一锋的其他基金

基于交互式动态影响图的未知对手模型学习

相似国自然基金