基于图聚类的蛋白质功能预测方法Predicting Protein Function Based on Graph Clustering
郭金文;林劼;
摘要(Abstract):
利用蛋白质序列的循环关系,采用循环匹配算法对数据进行预处理,得到相关联蛋白质数据集,再利用该数据集构造蛋白质的网络图,在此基础上采用图聚类算法,对待预测的蛋白质相关的各个蛋白聚类,并进行子群分割,对各个子群采用z值进一步计算并得出作为预测结果的蛋白质功能.经实验,该方法与其它最新方法相比较,预测结果的最终衡量指标F1-measure具有明显的提升.
关键词(KeyWords): 蛋白质功能预测;循环关系;图聚类;蛋白质域;F1-measure
基金项目(Foundation): 国家自然科学基金资助项目(61472082);; 福建省自然科学基金资助项目(2014J01220)
作者(Authors): 郭金文;林劼;
参考文献(References):
- [1]T Hawkins,M Chitale,S Luban,et al.Automated prediction of gene ontology functional annotations with confidence scores using protein sequence data[J].Proteins,2009,74:556-582.
- [2]Clark WT,Radivojac P.Analysis of protein function and its prediction from amino acid sequence[J].Proteins,2011,79:2086-2096.
- [3]Wei Fengjia,Bo Liao,Li Dachao,et al.Protein function prediction using a double weighted K-Nearest neighbor method[J].Journal of Computational and Theoretical Nanoscience,2011,8(1):80-83.
- [4]Wei Peng,Wang Jianxin,Cai Juan.Improving protein function prediction using domain and protein complexes in PPI networks[J].BMC Systems Biology,2014,8:35.
- [5]赵研,卢奕南,权勇.基于模糊积分多源数据融合的蛋白质功能预测[J].南京大学学报:自然科学版,2011,48(1):63-69.
- [6]Cunningham B A,Hemperly J,Hopp T P,et al.Favin versus concanavalin A:circularly permuted amino acid sequences[J].Proceedings of the National Academy of Sciences,1979,6(7):3218–3222.
- [7]Lindqvist Y,Schneider G.Circular permutations of natural protein sequences:structural evidence[J].Current Opinion Structure Biology,997,7(3):422–427.
- [8]Jeltsch A.Circular permutations in the molecular evolution of dna methyltransferases[J].Journal of Molecular Evolution,1999,49(1):161–164.
- [9]Spencer Bliven,Andreas Prlic.Circular permutation in proteins[J].Plos Computational Biology,2012,8:e1002445.
- [10]Lin Jie.Suffix Structures and circular pattern problems[D].West Virginia:West Virginia University,2011.
- [11]Rost B,Liu J,Nair R,et al.Automatic prediction of protein function[J].Cellular and Molecular Life Sciences,2003,60:2637-2650.
- [12]Louie B,Higdon R,Kolker E.A statistical model of protein sequence similarity and function similarity reveal soverlyspecific function prediction[J].Plos One,2009,4(10):e7546.
- [13]Schug J,Diskin S,Mazzarelli J,et al.Predicting gene ontology functions from prodom and CDD protein domains[J].Genome res,2002,12:648-655.
- [14]Robert Rentzsch,Christine A Orengo.Protein function prediction using domain families[J].BMC Bioinformatics,2013,14(3):S5.
- [15]温菊屏,钟勇.图聚类算法及其在社会网络中的应用[J].计算机运用与软件,2012,29(2):160-164.
- [16]Xiaowei Xu,Nurcan Yuruk,Zhidan Feng,et al.SCAN:a structural clustering algorithm for networks[C]∥Proc.of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining,New York,USA:ACM Press,2007.
- [17]Venu Satuluri,Srinivasan Parthasarathy.Scalable graph clustering using stochastic flows:applications to community discovery[C]∥KDD 2009,New York:ACM,2009:737–746.
- [18]Craik D J.Circling the enemy:cyclic proteins in plant defence[J].Trends plant sci,2009,14(6):328–335.
- [19]Craik D J.Seamless proteins tie up their loose ends[J].Science,2006,311:1563–1564.
- [20]Tang Y Q,Yuan J,Osapay G,et al.A cyclic antimicrobial peptide produced in primate leukocytes by the ligation of two truncated alpha-defensins[J].Science,1999,286(5439):498–502.
- [21]Daly N L,Craik D J.Acyclic permutants of naturally occurring cyclic proteins[J].The Journal of Biological Chemistry,2000,275(25):19068–19075.
- [22]Weiner J,Bornberg-Bauer E.Evolution of circular permutations in multidomain proteins[J].Molecular Biology and Evolution,2006,23(4):734–743.
- [23]Lindberg M,Tangrot J,Oliveberg M.Complete change of the protein folding transition state upon circular permutation[J].Nature Structural Biology,2002,9:818–822.
- [24]Haglund E,Lindberg M O,Oliveberg M.Changes of protein folding pathways by circular permutation.overlapping nuclei promote global cooperativity[J].Journal of Biological Chemistry,2008,283(41):7904–7915.
- [25]Weiner J,Thomas G,Bornberg-Bauer E.Rapid motif-based prediction of circular permutations in multi-domain proteins[J].Bioinformatics,2005,21(7):932–937.
- [26]Jung Jongsun,Lee Byungkook.Circularly permuted proteins in the protein structure database[J].Proteins,2001,10:1881–1886.