With the rapid development of social network, collaborative retrieval has become a research focus in Information Retrieval field, which has important practical significance to raising the accuracy and efficiency of Web search in communities. A community is constantly changing, which makes it obligatory to identify communities in dynamic networks, but such characteristics of search process will become bottlenecks, as ①sparsity, ②high-dimensionality and ③dynamic data. This topic mainly includes following three parts. ⑴Probability methods are employed to express the relevance amongst user, query and document. ⑵When dealing sparse and high-dimensional data, we innovatively extend information-theoretic co-clustering methods originally just used to analyze two-dimensional contingency tables to be is suitable for three-dimensional data. ⑶For dynamic data, we start with the dimensionality that needs to be updated, and incrementally renew the three-dimensional relevance and co-clustering results, as can improve the update efficiency. We also analyze and discuss the recommendation method based on collaboration and real-time property. Our topic is based on probabilistic relevance of the three-dimensional space, focuses on identifying communities dynamically, takes into account both theoretical analysis and experimental verification and provides a new idea for further research and implementation of collaborative retrieval methods.
随着社会化网络的飞速发展,协同检索成为信息检索领域的研究热点,它对于提高社区内用户检索的准确率和效率具有重要的实际意义。社区不断变化,因此需要对社区信息持续更新,但检索过程的特点为此造成了很大困难,包括:①数据稀疏;②特征空间维度高;③数据更新频繁。本课题围绕以上三个特点展开研究,内容包括:⑴三维空间相关性模型:建立由用户、查询和文档构成的三维空间,并采用概率方法量化三个维度间的相关性;⑵基于联合聚类的社区动态确定方法:针对检索过程的特点①和②,将原本仅用于分析二维列联表的信息论联合聚类方法进行扩展,使之适用于分析三维问题,进而动态确定用户社区;⑶增量学习机制:针对检索过程的特点③,从数据所在维入手,增量更新三维概率关系及联合聚类结果。本课题基于三维空间的概率关系,重点围绕社区的动态更新问题,兼顾理论分析和实践验证,为协同检索方法的进一步研究与应用提供新的思路。
针对社会网络数据的数据稀疏、特征空间维度高、数据更新频繁等特点,本课题围绕模糊聚类算法展开研究,并进行了三项创新。首先基于信息瓶颈理论,提出了模糊联合聚类算法ibFCC,取得了较高的聚类质量;然后将模糊聚类思想从二维扩展到三维,提出了两种模糊三维联合聚类算法FTC和ibFTC,可实现三个维度上的同时聚类;最后针对不断更新的流数据,提出了基于信息瓶颈理论的模糊增量聚类算法spFCM-IB和oFCM-IB,可有效处理大规模数据集。本课题的研究内容基于三维空间的概率关系,重点围绕社区的动态更新问题,兼顾理论分析和实践验证,为协同检索方法的进一步研究与应用提供新的思路。
{{i.achievement_title}}
数据更新时间:2023-05-31
基于协同表示的图嵌入鉴别分析在人脸识别中的应用
多空间交互协同过滤推荐
三级硅基填料的构筑及其对牙科复合树脂性能的影响
混采地震数据高效高精度分离处理方法研究进展
环境信息披露会影响分析师盈余预测吗?
对等网络中基于社区的分布式信息检索方法研究
大数据环境下面向移动电子商务虚拟社区的协同推荐方法研究
面向影像表现的肺部CT图像检索方法研究
面向视频大数据检索的哈希方法研究