互联网有限监督信息下的通用图像语义理解机制研究

基本信息

批准号：61472276

项目类别：面上项目

资助金额：80.00

负责人：韩亚洪

学科分类：

依托单位：天津大学

批准年份：2014

结题年份：2018

起止时间：2015-01-01 - 2018-12-31

项目状态：已结题

项目参与者：孙美君,张鹏,杨雅君,张长青,张建光,韦星星,刘彦镔,郭强,范柏翔

关键词：

有限监督跨域学习图像语义理解字典学习通用

结项摘要

With the popularity of social media and mobile internet applications, there is an explosive growth of web images. The social characteristics and the increased scalability turn out a great challenge in the image semantic understanding. Though the user's comments and tagging can be well exploited to provide more semantic cues for image semantic analysis, the annotations of these data contain a lot of noisy tags and are always weakly tagging. Thus, the supervision information available is limited due to the huge output space. Furthermore, the negative or testing examples come from an infinite semantic space and we have no clue about the semantic these examples include. In this proposal, we target to develop a framework of ad hoc web image semantic understanding with limited supervision. Based on the recent development and research focuses in multimedia, computer vision, machine learning, and natural language processing, four key issues are explored in this proposal: large-scale supervised dictionary learning with semantic taxonomy, semi-supervised heterogeneous domain adaptation, structural prediction and description generation of image semantic, efficient algorithm and its consistency analysis. Based on the research results of this proposal and related technologies, we will release a portal of image retrieval with semantic ontology. The output technologies and demonstration of this proposal will mainly contribute to the real-world applications of image semantic understanding in web multimedia search, regulation, and services etc.

社会媒体和移动互联应用的发展使得互联网图像数据海量涌现，社会属性和不断增长的规模给图像语义理解带来了巨大挑战。尽管用户评论和标签为图像语义分析提供了更多的语义线索，但这些语义标注信息往往是有噪音和弱标记的。因此，可用的监督信息相对于巨大的语义输出空间是非常有限的。同时，由于负例或测试图像样本在理论上存在的无限语义空间，我们无法有效获得其语义空间的先验信息。本课题拟建立互联网有限监督信息下的通用图像语义理解框架，在结合多媒体、计算机视觉、机器学习和自然语言处理等交叉领域的最新进展和热点研究的基础上，主要研究：语义层次监督信息下的大规模字典学习、半监督异构的跨域学习、图像结构化语义预测和语义描述生成、以及算法高效求解和一致性分析等内容。通过集成相关研究成果与技术，发布基于本体语义的图像检索系统，为图像语义理解在互联网媒体搜索、监管与服务等领域的实际应用提供技术支撑和平台示范。

项目摘要

在互联网图像语义理解中，尽管用户评论和标签为图像语义分析提供了更多的语义线索，但这些语义标注信息往往是有噪音和弱标记的。本项目主要研究了：语义层次监督信息下的大规模字典学习、半监督异构的跨域学习、图像结构化语义预测和语义描述生成、以及算法高效求解和一致性分析等内容。并构建了图像语义理解数据集和基图像检索原型系统，验证了算法和框架的有效性。项目成果在国内外重要期刊/会议发表论文43篇，包括IEEE Trans.论文5篇；CCF-A类会议论文8篇，CCF-B类会议论文6篇；相关技术成果申请国家发明专利5项；相关算法在MSR Video-to-Language等国内外评测中取得优异成绩。为图像语义理解在互联网媒体搜索、监管与服务等领域的实际应用提供技术支撑和平台示范。

项目成果

DOI：{{i.doi}}

发表时间：{{i.publish_year}}

暂无此项成果

数据更新时间：2023-05-31

其他相关文献

DOI：10.3778/j.issn.1002-8331.1911-0012

发表时间：2020

DOI：10.6041/j.issn.1000-1298.2022.07.022

发表时间：2022

DOI：10.13336/j.1003-6520.hve.20200528028

发表时间：2021

DOI：10.3724/SP.J.1089.2019.17435

发表时间：2019

DOI：10.1360/SSM-2020-0035

发表时间：2020

韩亚洪的其他基金

批准号：61876130

批准年份：2018

资助金额：64.00

项目类别：面上项目

批准号：61202166

批准年份：2012

资助金额：25.00

项目类别：青年科学基金项目

相似国自然基金

弱监督信息下的互联网视频语义分析机制研究

批准号：61702165

批准年份：2017

负责人：张建光

学科分类：F0605

资助金额：29.00

项目类别：青年科学基金项目

跨媒体互联网社群图像语义理解

批准号：61372148

批准年份：2013

负责人：刘宏哲

学科分类：F0116

资助金额：78.00

项目类别：面上项目

基于上下文感知的互联网社群图像语义理解

批准号：61272352

批准年份：2012

负责人：郎丛妍

学科分类：F0210

资助金额：80.00

项目类别：面上项目

噪声环境下的弱监督图像语义分割研究

批准号：61573363

批准年份：2015

负责人：卢志武

学科分类：F0604

资助金额：66.00

项目类别：面上项目

互联网有限监督信息下的通用图像语义理解机制研究

{{i.achievement_title}}

暂无此项成果

其他相关文献

针对弱边缘信息的左心室图像分割算法

基于改进LinkNet的寒旱区遥感图像河流识别方法

带有滑动摩擦摆支座的500 kV变压器地震响应

信息熵-保真度联合度量函数的单幅图像去雾方法

现代优化理论与应用

韩亚洪的其他基金

面向图像语义理解的对抗机器学习理论与方法

高维结构性稀疏特征选择与图像语义理解机制研究

相似国自然基金