基于优化片段搜索和残基接触预测的全新蛋白质结构从头预测算法设计

基本信息

批准号：31670723

项目类别：面上项目

资助金额：60.00

负责人：龚海鹏

学科分类：

依托单位：清华大学

批准年份：2016

结题年份：2020

起止时间：2017-01-01 - 2020-12-31

项目状态：已结题

项目参与者：熊大鹏,王童,毛闻志,孙芮宁

关键词：

三级结构预测蛋白质从头预测残基接触预测片段组装法

结项摘要

In order to perform their physiological roles, macromolecules like proteins have to fold into their unique native conformations, which renders the importance of the structural studies of proteins. However, the experimental protein structural determinations, albeit well developed, are lagging far behind the derivation of amino acid sequences of proteins. Protein structure prediction, the computational technique that utilizes the amino acid sequences to predict the tertiary structures, can effectively fill the gap between sequencing and structural determination. Unlike the relatively mature protein structure prediction techniques including the homologous modeling and threading, the ab initio prediction methods predict the tertiary structures from the amino acid sequences purely based on first principles and are thus independent of the presence of homologous templates in the structural database. Unfortunately, none of the available ab initio prediction algorithms can predict the protein tertiary structures reliably. In this project, we plan to improve the accuracy and reliability of the present ab initio methods. On one hand, we will optimize the searching of fragments with low homology, which may greatly improve the efficiency of the subsequent fragment assembly that predict the protein structures by randomly assembling the identified fragments. On the other hand, we will further improve the present algorithm for predicting the contacts between amino acid residues, by effectively combining the respective information derived from the structure and sequence databases. The predicted residue contacts can further facilitate the protein structure prediction by reducing the sampling space. Combining the above two improvements, we will develop a novel algorithm for the ab initio protein structure prediction.

蛋白质需要折叠到其天然态构象行使生理功能，因此蛋白质的结构研究非常重要。虽然实验测定蛋白质结构的方法发展很快，但是仍远远落后于氨基酸序列测定的速度。蛋白质结构预测通过理论计算，根据氨基酸序列预测三级结构，因此能有效地填补结构和序列测定间的鸿沟。在蛋白质结构预测方法中，不同于较为成熟的同源建模法和穿线法，从头预测法完全根据物理化学规律进行预测，因此不依赖于结构数据库中是否存在同源模板。但是，目前没有一种从头预测法能可靠地预测蛋白质结构。本项目中，我们计划提高从头预测法的准确度和可靠性。一方面，我们优化远同源片段的搜索，进一步提高使用片段组装法通过拼接这些片段模板来预测蛋白质结构的效率。另一方面，我们通过有效结合得自于结构和序列数据库的信息，优化氨基酸残基间接触的预测。预测所得的残基接触信息可以缩减采样空间，从而进一步辅助结构预测。结合以上改进，我们计划发展一种全新的蛋白质结构从头预测算法。

项目摘要

蛋白质需要折叠到其天然态构象行使生理功能，因此蛋白质的结构研究非常重要。虽然实验测定蛋白质结构的方法发展很快，但是仍远远落后于氨基酸序列测定的速度。蛋白质结构预测通过理论计算，根据氨基酸序列预测三级结构，因此能有效地填补结构和序列测定间的鸿沟。在蛋白质结构预测方法中，不同于较为成熟的同源建模法和穿线法，从头预测法完全根据物理化学规律进行预测，因此不依赖于结构数据库中是否存在同源模板。但是，目前少有从头预测法能可靠地预测蛋白质结构。本项目中，我们开发了多种算法从多个角度提升从头预测方法的准确率和可靠性。特别是DeepFragLib、AmoebaContact+GDFold和GANProDist等三种算法或流程，不仅在算法的设计思路上具有高度的创新性，而且在性能上也至少达到了与领域内主流算法的持平的效果。

项目成果

DOI：{{i.doi}}

发表时间：{{i.publish_year}}

暂无此项成果

数据更新时间：2023-05-31

其他相关文献

DOI：10.15957/j.cnki.jjdl.2016.12.031

发表时间：2016

DOI：10.16606/j.cnki.issn0253-4320.2022.10.026

发表时间：2022

DOI：10.19679/j.cnki.cjjsjj.2019.0538

发表时间：2019

DOI：

发表时间：2018

DOI：

发表时间：2015

龚海鹏的其他基金

批准号：31170674

批准年份：2011

资助金额：60.00

项目类别：面上项目

批准号：31470033

批准年份：2014

资助金额：30.00

项目类别：面上项目

相似国自然基金

基于残基特异性力场的蛋白质结构从头预测

批准号：21573009

批准年份：2015

负责人：蒋帆

学科分类：B0302

资助金额：67.00

项目类别：面上项目

利用深度学习进行从头开始的蛋白质结构预测和蛋白质设计

批准号：61902335

批准年份：2019

负责人：李镇

学科分类：F0213

资助金额：30.00

项目类别：青年科学基金项目

基于蛋白质分类和残基定义优化的蛋白质-蛋白质相互作用位点预测

批准号：U1404307

批准年份：2014

负责人：邱智军

学科分类：C0504

资助金额：30.00

项目类别：联合基金项目

蛋白质残基间相互作用预测算法研究及其在三级结构预测中的应用

批准号：31770775

批准年份：2017

负责人：卜东波

学科分类：C0504

资助金额：60.00

项目类别：面上项目

基于优化片段搜索和残基接触预测的全新蛋白质结构从头预测算法设计

{{i.achievement_title}}

暂无此项成果

其他相关文献

演化经济地理学视角下的产业结构演替与分叉研究评述

氟化铵对CoMoS /ZrO_2催化4-甲基酚加氢脱氧性能的影响

基于LASSO-SVMR模型城市生活需水量的预测

基于多模态信息特征融合的犯罪预测算法研究

城市轨道交通车站火灾情况下客流疏散能力评价

龚海鹏的其他基金

MFS家族膜转运蛋白转运机理的研究

使用分子模拟研究电压门控钠离子通道的分子机理

相似国自然基金