|
Genomic functional annotation using co-evolution profiles of gene clusters |
|
[编者的话] 利用共进化以及基因相互之间的位置信息来进行基因组功能预测,是当今的流行方法。本文是这方面研究的范例。
Background The
current speed of sequencing already exceeds the capability of annotation,
creating a potential bottleneck. A large proportion of the genes in
microbial genomes remains uncharacterized. Here we propose a new method
for functional annotation using the conservation patterns of gene
clusters. If several gene clusters show the same coevolution pattern
across different genomes it is reasonable to infer they are functionally
related. The gene cluster phylogenetic profile integrates chromosomal
proximity information and phylogenetic profile information and allows us
to infer functional dependences between the gene clusters even at great
distance on the chromosome. Results As
a proof of concept, we applied our method to the genome of Escherichia
coli K12 strain. Our method establishes functional relationships among 176
gene clusters, comprising 738 E. coli genes. The accuracy of pair
phylogenetic profiles was compared with the single-gene phylogenetic
profile and was shown to be higher. As a result, we are able to suggest
functional roles for several previously unknown genes or unknown genomic
regions in E. coli. We also examined the robustness of coevolution signals
across a larger set of genomes and suggest a possible upper limit of
accuracy for the phylogenetic profile methods. Conclusions The higher-order phylogenetic profiles, such as the gene-pair phylogenetic profiles, can detect functional dependences that are missed by using conventional single-gene phylogenetic profile or the chromosomal proximity method only. We show that the gene-pair phylogenetic profile is more accurate than the single-gene phylogenetic profiles.
|
|
|
|
1999-2005 中国科学院上海生命科学研究院生物信息中心 |