GenomeLKPG: A comprehensive proteome sequencedatabase for taxonomy studies
2008 (English)Article in journal (Refereed) Submitted
Background: In order to perform taxonomically unbiased analyses of protein relationships, there is a need ofcomplete proteomes rather than databases with bias towards well characterized protein families. However, nocomprehensive resource of completed proteomes is currently available. Instead, the proteomes need to be down-loaded manually from di®erent servers, all using different filename conventions and fasta header formats.
Results: We have developed a semi-automatic algorithm that retrieves complete proteomes from multiple FTP-servers and maps the species-speci¯c sequence entries to the NCBI taxonomy. The compiled data is provided ina sequence database named genomeLKPG.
Conclusions: The usefulness of genomeLKPG is proven in several published taxonomical studies.
Place, publisher, year, edition, pages
IdentifiersURN: urn:nbn:se:liu:diva-52933OAI: oai:DiVA.org:liu-52933DiVA: diva2:285932