PAT: predictor for structured units and its application for the optimization of target molecules for the generation of synthetic antibodies

Research output: Contribution to journalArticle

Authors

  • Jouhyun Jeon
  • Fateh Singh
  • Joan Teyra
  • Tatjana Braun
  • Philip M Kim

Colleges, School and Institutes

External organisations

  • Terrence Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, M5S 3E1, ON, Canada.
  • Department of Computer Science, University of Toronto, Toronto, M5S 3E1, ON, Canada. pi@kimlab.org.

Abstract

BACKGROUND: The identification of structured units in a protein sequence is an important first step for most biochemical studies. Importantly for this study, the identification of stable structured region is a crucial first step to generate novel synthetic antibodies. While many approaches to find domains or predict structured regions exist, important limitations remain, such as the optimization of domain boundaries and the lack of identification of non-domain structured units. Moreover, no integrated tool exists to find and optimize structural domains within protein sequences.

RESULTS: Here, we describe a new tool, PAT ( http://www.kimlab.org/software/pat ) that can efficiently identify both domains (with optimized boundaries) and non-domain putative structured units. PAT automatically analyzes various structural properties, evaluates the folding stability, and reports possible structural domains in a given protein sequence. For reliability evaluation of PAT, we applied PAT to identify antibody target molecules based on the notion that soluble and well-defined protein secondary and tertiary structures are appropriate target molecules for synthetic antibodies.

CONCLUSION: PAT is an efficient and sensitive tool to identify structured units. A performance analysis shows that PAT can characterize structurally well-defined regions in a given sequence and outperforms other efforts to define reliable boundaries of domains. Specially, PAT successfully identifies experimentally confirmed target molecules for antibody generation. PAT also offers the pre-calculated results of 20,210 human proteins to accelerate common queries. PAT can therefore help to investigate large-scale structured domains and improve the success rate for synthetic antibody generation.

Details

Original languageEnglish
Article number150
JournalBMC Bioinformatics
Volume17
Publication statusPublished - 1 Apr 2016

Keywords

  • Amino Acid Sequence, Antibodies, Area Under Curve, Databases, Protein, Humans, Internet, Peptide Library, Protein Structure, Tertiary, Proteins, ROC Curve, User-Computer Interface, Journal Article, Research Support, Non-U.S. Gov't