基于强化学习的分布参数系统数据驱动控制

基本信息

批准号：61503377

项目类别：青年科学基金项目

资助金额：21.00

负责人：罗彪

学科分类：

依托单位：中国科学院自动化研究所

批准年份：2015

结题年份：2018

起止时间：2016-01-01 - 2018-12-31

项目状态：已结题

项目参与者：阎鹏飞,李超,徐延才,马宏文,石光,唐冲,林汉权

关键词：

强化学习数据驱动控制偏微分方程分布参数系统

结项摘要

Most of practical industrial processes are distributed parameter system (DPS), which are essentially described by a set of complex nonlinear partial differential equations. Due to the infinite-dimensional nature of the DPSs, the direct application of the control theories and methods for lumped parameter systems to them is impossible. Moreover, with the fast developments of science technologies, many industrial processes become more and more complicated due to their large scale and complex manufacturing techniques, equipments and procedures. Therefore, the accurately modeling and identification of these processes are often costly to conduct, or the established models are too complicated to support controller design. To overcome these difficulties, this project aims at studying reinforcement learning methods for data-driven control problem of the DPSs, and establishing its theories for performance and stability analysis. The effectiveness and the practical feasibility of the methods will be overified with computer simulations. Through the research of this project, some novel and effective methods and theories will be provided for control design of DPSs, which are extremely important for the development of data-driven control theories and meaningful in both scientific researches and real engineering applications.

大部分实际工业过程均为分布参数系统(DPS)，它们本质上由复杂的非线性偏微分方程描述，DPS具有无穷维自由度的特征，所以现有针对集中参数系统的控制理论与方法无法直接用于DPS。而且，随着科学技术的快速发展，工业系统的规模越来越大，生产过程越来越复杂，导致精确建立DPS数学模型的代价非常大，或是模型非常复杂而无法用于控制器设计。为解决这一困难，本项目拟引入强化学习的思想，研究DPS数据驱动控制问题，建立相应的性能分析与稳定性理论，并通过计算机仿真，验证方法的有效性，和探讨实际应用的可行性。通过对本项目的研究，将为DPS的控制设计提供一些新的、有效的方法和理论依据，促进数据驱动控制理论的发展，具有重要的科学意义和应用价值。

项目摘要

大部分实际工业过程均为分布参数系统(DPS)，它们本质上由复杂的非线性偏微分方程描述，DPS具有无穷维自由度的特征，所以现有针对集中参数系统的控制理论与方法无法直接用于DPS。而且，随着科学技术的快速发展，工业系统的规模越来越大，生产过程越来越复杂，导致精确建立DPS数学模型的代价非常大，或是模型非常复杂而无法用于控制器设计。为解决这一困难，本项目引入强化学习的思想，研究了数据驱动最优控制问题，提出了一系列基于强化学习的控制方法及理论。相关成果发表了SCI期刊论文16篇，国际学术会议论文3篇，包括领域顶级期刊：IEEE Transactions on Cybernetics, IEEE Transactions on Neural Networks and Learning Systems, IEEE Transactions on Industrial Electronics。

项目成果

DOI：{{i.doi}}

发表时间：{{i.publish_year}}

暂无此项成果

数据更新时间：2023-05-31

其他相关文献

DOI：10.16796/j.cnki.1000-3770.2022.03.003

发表时间：2022

DOI：10.12202/j.0476-0301.2022178

发表时间：2022

DOI：10.13197/j.eeev.2019.05.95.fuwq.009

发表时间：2019

DOI：10.13199/j.cnki.cst.2020.07.010

发表时间：2020

DOI：10.16383/j.aas.c180673

发表时间：2021

罗彪的其他基金

批准号：61873350

批准年份：2018

资助金额：63.00

项目类别：面上项目

批准号：70802058

批准年份：2008

资助金额：13.50

项目类别：青年科学基金项目

批准号：71272064

批准年份：2012

资助金额：50.00

项目类别：面上项目

相似国自然基金

复杂动态系统数据驱动的强化学习控制研究

批准号：61573052

批准年份：2015

负责人：李大字

学科分类：F0301

资助金额：65.00

项目类别：面上项目

基于数据驱动建模的分布参数系统预测控制策略研究

批准号：60604017

批准年份：2006

负责人：邹涛

学科分类：F0301

资助金额：23.00

项目类别：青年科学基金项目

分布参数系统的迭代学习控制及其应用

批准号：61364006

批准年份：2013

负责人：戴喜生

学科分类：F0301

资助金额：45.00

项目类别：地区科学基金项目

基于数据驱动的非高斯随机分布系统优化设定控制

批准号：61573190

批准年份：2015

负责人：殷利平

学科分类：F0301

资助金额：64.00

项目类别：面上项目

基于强化学习的分布参数系统数据驱动控制

{{i.achievement_title}}

暂无此项成果

其他相关文献

EBPR工艺运行效果的主要影响因素及研究现状

复杂系统科学研究进展

基于被动变阻尼装置高层结构风振控制效果对比分析

智能煤矿建设路线与工程实践

二维FM系统的同时故障检测与控制

罗彪的其他基金

离散时间系统的脱策强化学习鲁棒优化控制

企业集团总部对异质子公司的滚动过程绩效管理方法研究

集团交互控制系统：行为自适应性与动态演化机制

相似国自然基金