基因表达调控分析的非参数回归模型

基本信息
批准号:39900126
项目类别:青年科学基金项目
资助金额:12.00
负责人:陈长生
学科分类:
依托单位:中国人民解放军第四军医大学
批准年份:1999
结题年份:2002
起止时间:2000-01-01 - 2002-12-31
项目状态: 已结题
项目参与者:阎玉霞,李文潮,宇传华,张俊杰,尚磊,李鹏,韩炯
关键词:
非参数回归模型基因表达基因组信息学
结项摘要

A nonparametric regression model do relax the strict assumptions of classical regression models, and serve any form distribution data. It does not choose model form, especially, relaxing the assumption of linear relationship between the responses and the explanatory variables. Therefore, it extends linear models and strengthen model adaptability. In order to improve on LS estimate, the penalized sum of squares is set up. The penalized least squares estimator for regression function by minimizing the penalized sum of squares can be obtained, this estimator compromise between goodness of fit and smoothness.There are few quantitative indices measuring level of gene expression. Distributions of these indices are unknown, and patterns of dependent relationship between level of gene expression and influencing factors are indefinite. So, some strict assumptions supporting the classical theory of linear models are not satisfied. If data do not meet these conditions of classical statistical approaches, statistical inferences drawn from classical approaches would be, to different extent, influenced in negative direction and even erroneous conclusions would be drawn.Therefore, nonparametric regression models would help us solve statistical problems of genome nformatics..This project aims at establishing non-parametric regression models for analyzing gene expression regulation networks. Based on cubic spline and roughness penalty approach, a set of theories and algorithms of nonparametric regression models are proposed for various cases of nonparametric regression analysis. We explore smoothing spline, weighted nonparametric regression model, semiparametric regression model and multidimensional nonparametric regression model in consideration of weights, ties and covariables. We provide cross-validation (CV) score function and generalized cross-validation (GCV) score function. The best design of interest parameters can be obtained by a module form search method. Various nonparametric regression models are verified and assessed by statistical simulations and examples. The computational method measuring codon usage bias is proposed, and codon usage frequencies for two known yeasts are analyzed by using Relative Synonymous Codon Usage (RSCU). Thus highly expressed optimal codons are determined. RSCU-based quantitative statistic, Codon Adaptation Index (CAI), is proposed to measure level of gene expression. The regression relationship between CAI for yeast and such factors as codon usage bias, third base composition and linear correlation of codon usage with tRNA abundance. A proper software for nonparametric regression models is compiled.

本项目研究建立一套适合分析基因表达调控网络的非参数回归模型。从理论上阐述模型的特性,用粗糙度惩罚方法构造出模型的目标函数并证明出密码子不同位置上的碱基组成及其相关性与基因表达水平间的回归关系,提出用于基因表达调控分析的一些统计量。在模型中考虑协变量,控制非调节因素对模型核心参估计值的干扰,并建立高维非参数回归模型。

项目摘要

项目成果
{{index+1}}

{{i.achievement_title}}

{{i.achievement_title}}

DOI:{{i.doi}}
发表时间:{{i.publish_year}}

暂无此项成果

数据更新时间:2023-05-31

其他相关文献

1

DeoR家族转录因子PsrB调控黏质沙雷氏菌合成灵菌红素

DeoR家族转录因子PsrB调控黏质沙雷氏菌合成灵菌红素

DOI:10.3969/j.issn.1673-1689.2021.10.004
发表时间:2021
2

粗颗粒土的静止土压力系数非线性分析与计算方法

粗颗粒土的静止土压力系数非线性分析与计算方法

DOI:10.16285/j.rsm.2019.1280
发表时间:2019
3

正交异性钢桥面板纵肋-面板疲劳开裂的CFRP加固研究

正交异性钢桥面板纵肋-面板疲劳开裂的CFRP加固研究

DOI:10.19713/j.cnki.43-1423/u.t20201185
发表时间:2021
4

基于LASSO-SVMR模型城市生活需水量的预测

基于LASSO-SVMR模型城市生活需水量的预测

DOI:10.19679/j.cnki.cjjsjj.2019.0538
发表时间:2019
5

低轨卫星通信信道分配策略

低轨卫星通信信道分配策略

DOI:10.12068/j.issn.1005-3026.2019.06.009
发表时间:2019

相似国自然基金

1

半参数/非参数回归模型的变量选择

批准号:10741002
批准年份:2007
负责人:左国新
学科分类:A0403
资助金额:4.00
项目类别:专项基金项目
2

非参数半参数分位数回归模型及其应用

批准号:11271241
批准年份:2012
负责人:程业斌
学科分类:A0402
资助金额:60.00
项目类别:面上项目
3

方差分量模型中的Bayes分析及非参数回归极值点的研究

批准号:19971085
批准年份:1999
负责人:韦来生
学科分类:A0403
资助金额:8.00
项目类别:面上项目
4

非参数回归模型变点的监测方法研究

批准号:11426160
批准年份:2014
负责人:齐培艳
学科分类:A0402
资助金额:3.00
项目类别:数学天元基金项目