Paper published about the database of results from the Genome Comparison project


A paper about the ProteinWorldDB database, created with the results of the Genome Comparison project, was published in the proceedings of the 7th Brazilian Symposium on Bioinformatics, BSB 2012, Campo Grande, Brazil, August 15-17, 2012.



Paper Title:

“Design and Implementation of ProteinWorldDB”

Lay Person Abstract:

The paper describes the design of and future enhancements to the ProteinWorldDB data base, containing the results of the Genome Comparison project run on World Community Grid. The paper was presented at the 7th Brazilian Symposium on Bioinformatics at Campo Grande, Brazil, August 15-17, 2012.

Technical Abstract:

This work involves the comparison of protein information in a genomic scale. The main goal is to improve the quality and interpretation of biological data, besides our understanding of biological systems and their interactions. Stringent comparisons were obtained after the application of the Smith-Waterman algorithm in a pair wise manner to all predicted proteins encoded in both completely sequenced and unfinished genomes available in the public database RefSeq. Comparisons were run through a computational grid and the complete result reaches a volume of over 900 GB. Consequently, the database system design is a critical step in order to store and manage the information from comparisons’ results. This paper describes database conceptual design issues for the creation of a database that represents a data set of protein sequence cross-comparisons. We show that our conceptual schema and its relational mapping enables users to extract relevant information, from simple to complex queries integrating distinct data sources.

Access to Paper:

To view the paper, please click here.