面向动态数据的翻译模型更新方法研究

基本信息

批准号：61806065

项目类别：青年科学基金项目

资助金额：24.00

负责人：卜晨阳

学科分类：

依托单位：合肥工业大学

批准年份：2018

结题年份：2021

起止时间：2019-01-01 - 2021-12-31

项目状态：已结题

项目参与者：何进,王博岩,嵇圣硙,单迎春,蒋婷婷,李娇,盛绍静

关键词：

知识表示学习知识图谱进化计算分布式表示动态优化

结项摘要

As a representative knowledge graph embedding method, the translation-based model (TransE) aims to embed the semantic information of a knowledge base into low-dimensional vector spaces. Because real-world data usually change over time, it is of great significance to study the online updating problem of TransE. However, existing studies mainly focus on static data, and the online updating of TransE has not drawn much attention. This proposal intends to study the online updating of translation-based models from the perspective of dynamic optimization. More specifically, main research contents include: (1) the translation-based models for dynamic data, including the study of optimization functions that do not depend on negative samples to reduce the impact of the expired data on the model; (2) the incremental processing of dynamic data from the perspective of dynamic optimization to avoid re-training the model; (3) the grouping strategy of large-scale optimization problem, so as to decompose the updating problem of translation-based models into several sub-problems to further save the updating time; and (4) constructing a prototype system for dynamic social networks to provide in-depth studies and further improvement of these research contents. This proposal provides a new solution to the updating of the translation-based models, and can contribute to the research on evolutionary dynamic optimization algorithms with practical values.

翻译模型是代表性的知识表示学习模型，旨在将知识库中的语义信息映射到低维的向量空间中。由于实际数据通常具有动态特征，因此研究模型的在线更新问题具有重要意义。然而，已有工作主要面向静态数据，针对动态数据的研究尚处于起步阶段。本项目拟从动态优化的角度研究翻译模型的在线更新问题，主要研究内容包括：（1）面向动态数据的翻译模型，包括研究不依赖于负样本的优化函数以减少数据过期对模型的影响，研究面向动态数据的复杂关系模型，以及研究如何降低模型的更新频率；（2）基于演化算法，从动态优化的角度研究动态数据的增量式处理方法，避免重新训练模型；（3）研究大规模优化问题的分组策略，以将翻译模型的更新问题分解为若干子问题，使得可以进一步减少模型的更新时间；（4）以动态社交网络为例构建原型系统并以此完善和深化研究工作。本项目为面向动态数据的知识表示学习提供新的理论探索，并为研发具有实用价值的演化算法作出有意义的尝试。

项目摘要

知识表示学习旨在将知识库中的语义信息映射到低维的向量空间中。由于实际数据通常具有动态特征，因此研究面向动态数据的模型更新问题具有重要意义。然而，已有工作主要面向静态数据，针对动态数据的研究尚处于起步阶段。本项目以翻译模型这一代表性的知识表示学习模型为切入点，从动态优化的角度，研究面向动态数据的复杂模型。具体研究内容包括：面向动态数据的知识表示学习模型建模，基于演化算法的表示学习模型更新方法，问题的分解策略研究，以及知识表示学习应用研究。在本项目的支持下，项目负责人以第一作者或者学生第一、本人第二/通讯发表学术论文11篇，包括IEEE/ACM Transactions论文2篇，Pattern Recognition、Knowledge-Based Systems、Journal of Database Management 论文各1篇，CCF推荐的国际学术会议论文3篇，以及CCF推荐的A类中文期刊论文1篇；并且申请专利9项。本项目为面向动态数据的知识表示学习提供了新的方法探索，并为研发具有实用价值的演化算法做出了有意义的尝试。

项目成果

DOI：{{i.doi}}

发表时间：{{i.publish_year}}

暂无此项成果

数据更新时间：2023-05-31

其他相关文献

DOI：10.1051/jnwpu/20213920292

发表时间：2021

DOI：10.13199/j.cnki.cst.2020.07.010

发表时间：2020

DOI：

发表时间：2021

DOI：10.1360/SSM-2020-0035

发表时间：2020

DOI：10.3969/j.issn.1004-132X.2020.03.001

发表时间：2020

卜晨阳的其他基金

相似国自然基金

面向辅助翻译的多模型融合方法研究

批准号：61402478

批准年份：2014

负责人：汪昆

学科分类：F0211

资助金额：26.00

项目类别：青年科学基金项目

面向动态数据认知的知识发现理论模型与方法

批准号：61876201

批准年份：2018

负责人：张清华

学科分类：F0605

资助金额：62.00

项目类别：面上项目

面向领域的多粒度动态海量数据挖掘理论模型与方法

批准号：61073146

批准年份：2010

负责人：王国胤

学科分类：F06

资助金额：33.00

项目类别：面上项目

面向大数据、少资源、跨领域汉葡机器翻译方法研究与实现

批准号：61672555

批准年份：2016

负责人：黃輝

学科分类：F0211

资助金额：63.00

项目类别：面上项目

面向动态数据的翻译模型更新方法研究

{{i.achievement_title}}

暂无此项成果

其他相关文献

一种基于多层设计空间缩减策略的近似高维优化方法

智能煤矿建设路线与工程实践

药食兼用真菌蛹虫草的液体发酵培养条件优化

现代优化理论与应用

机电控制无级变速器执行机构动态响应特性仿真研究

卜晨阳的其他基金

相似国自然基金