Chinese information processing indubitably need to utilize Chinese grammar roperties. However, the system is basically copied from English. In addition, it is not suit computers but people who are studying Chinese and it is not suite the word neighborship processing but structure analysis. To improve performance of Chinese processing applications, we must change this situation, thoroughly think of the linguistics foundation of Chinese processing, and work over grammar that suits to process word neighborship in Chinese by computers. That is why we are studying Chinese linear grammar..We did research on the part-of-speech systems that are currently influential in the Mainland and Taiwan, and investigated the shortages in intelligent interface system of language information due to statistic dada sparseness. We tagged meta information of texts in a large scale Chinese novel corpus containing about one hundred million characters, developed the first retriever in the world for word attributes from Chinese raw corpus, improved and expanded the capacity of the general purpose word segmentation system GPWS, and tried to discover which word attributes required in the applications related to language processing. Based on all these works, as an original innovation, we brought forward the accidence design of linear grammar system based on word attributes. Research on this grammar system will lead to reestablishment of the linguistics foundation of the Chinese information processing and bring the distinct improvement in performance of the applications..
本项目研究面向汉语语言信息智能接口的线性方法,特别是适合于经性文法的汉语词类系统,以及使用词和词类为语言单位和统计语言模型。本项目的研究成果将缓解线性文法中数据稀疏的,提高汉语语言信息智能输入输出系统的准确性,推动我国计算机和网络的普及,推进我国经济和信息化的进程
{{i.achievement_title}}
数据更新时间:2023-05-31
粗颗粒土的静止土压力系数非线性分析与计算方法
中国参与全球价值链的环境效应分析
基于公众情感倾向的主题公园评价研究——以哈尔滨市伏尔加庄园为例
基于细粒度词表示的命名实体识别研究
F_q上一类周期为2p~2的四元广义分圆序列的线性复杂度
线性文法及其在智能信息处理中的应用
面向自然语言智能处理的信息理论
新智能机接口中语言、图形、图象识别与理解
面向智能仿生假肢的光遗传学上行接口研究