PEDANT covers all complete RefSeq genomes

Mathias C Walter, Thomas Rattei, Roland Arnold, Ulrich Güldener, Martin Münsterkötter, Karamfilka Nenova, Gabi Kastenmüller, Patrick Tischler, Andreas Wölling, Andreas Volz, Norbert Pongratz, Ralf Jost, Hans-Werner Mewes, Dmitrij Frishman

Research output: Contribution to journalArticlepeer-review

80 Citations (Scopus)

Abstract

The PEDANT genome database provides exhaustive annotation of nearly 3000 publicly available eukaryotic, eubacterial, archaeal and viral genomes with more than 4.5 million proteins by a broad set of bioinformatics algorithms. In particular, all completely sequenced genomes from the NCBI's Reference Sequence collection (RefSeq) are covered. The PEDANT processing pipeline has been sped up by an order of magnitude through the utilization of precalculated similarity information stored in the similarity matrix of proteins (SIMAP) database, making it possible to process newly sequenced genomes immediately as they become available. PEDANT is freely accessible to academic users at http://pedant.gsf.de. For programmatic access Web Services are available at http://pedant.gsf.de/webservices.jsp.

Original languageEnglish
Pages (from-to)D408-11
JournalNucleic Acids Research
Volume37
Issue numberDatabase issue
DOIs
Publication statusPublished - Jan 2009

Keywords

  • Databases, Genetic
  • Genome
  • Genomics
  • Internet
  • Proteins
  • Journal Article

Fingerprint

Dive into the research topics of 'PEDANT covers all complete RefSeq genomes'. Together they form a unique fingerprint.

Cite this