Chinese information processing indubitably need to utilize Chinese grammar roperties. However, the system is basically copied from English. In addition, it is not suit computers but people who are studying Chinese and it is not suite the word neighborship processing but structure analysis. To improve performance of Chinese processing applications, we must change this situation, thoroughly think of the linguistics foundation of Chinese processing, and work over grammar that suits to process word neighborship in Chinese by computers. That is why we are studying Chinese linear grammar..We did research on the part-of-speech systems that are currently influential in the Mainland and Taiwan, and investigated the shortages in intelligent interface system of language information due to statistic dada sparseness. We tagged meta information of texts in a large scale Chinese novel corpus containing about one hundred million characters, developed the first retriever in the world for word attributes from Chinese raw corpus, improved and expanded the capacity of the general purpose word segmentation system GPWS, and tried to discover which word attributes required in the applications related to language processing. Based on all these works, as an original innovation, we brought forward the accidence design of linear grammar system based on word attributes. Research on this grammar system will lead to reestablishment of the linguistics foundation of the Chinese information processing and bring the distinct improvement in performance of the applications..
本项目研究面向汉语语言信息智能接口的线性方法,特别是适合于经性文法的汉语词类系统,以及使用词和词类为语言单位和统计语言模型。本项目的研究成果将缓解线性文法中数据稀疏的,提高汉语语言信息智能输入输出系统的准确性,推动我国计算机和网络的普及,推进我国经济和信息化的进程
{{i.achievement_title}}
数据更新时间:2023-05-31
一种基于多层设计空间缩减策略的近似高维优化方法
带有滑动摩擦摆支座的500 kV变压器地震响应
基于腔内级联变频的0.63μm波段多波长激光器
二维FM系统的同时故障检测与控制
具有随机多跳时变时延的多航天器协同编队姿态一致性
线性文法及其在智能信息处理中的应用
面向自然语言智能处理的信息理论
新智能机接口中语言、图形、图象识别与理解
面向智能仿生假肢的光遗传学上行接口研究