Issues in high-throughput comparative modelling: A case study using the ubiquitin E2 conjugating enzymes

Peter Winn; K Schleinkofer; A Banerjee; RC Wade

doi:10.1002/prot.20318

Issues in high-throughput comparative modelling: A case study using the ubiquitin E2 conjugating enzymes

Peter Winn, K Schleinkofer, A Banerjee, RC Wade

Biosciences

Research output: Contribution to journal › Article

7 Citations (Scopus)

Abstract

Sequences of the ubiquitin-conjugating enzyme (UBC or E2) family were used as a test set to investigate issues associated with the high-throughput comparative modelling of protein structures. A semi-automatic method was initially developed with particular emphasis on producing models of a quality suitable for structural comparison. Structural and sequence features of the E2 family were used to improve the sequence alignment and the quality of the structural templates. Initially, failure to correct for subtle structural inconsistencies between templates lead to problems in the comparative analysis of the UBC electrostatic potentials. Modelling of known UBC structures using Modeller 4.0 showed that multiple templates produced, on average, no better models than the use of just one template, as judged by the root-mean-squared deviation between the comparative model and crystal structure backbones. Using four different quality-checking methods, for a given target sequence, it was not possible to distinguish the model most similar to the experimental structure. The UBC models were thus finally modelled using only the crystal structure template with the highest sequence identity to the target to be modelled, and producing only one model solution. Quality checking was used to reject models with obvious structural anomalies (e.g., bad side-chain packing). The resulting models have been used for a comparison of UBC structural features and of their electrostatic potentials. The work was extended through the development of a fully automated pipeline that identifies E2 sequences in the sequence databases, aligns and models them, and calculates the associated electrostatic potential.

Original language	English
Pages (from-to)	367-375
Number of pages	9
Journal	Proteins: structure, function, and bioinformatics
Volume	58
Issue number	2
DOIs	https://doi.org/10.1002/prot.20318
Publication status	Published - 1 Feb 2005

Access to Document

10.1002/prot.20318

Cite this

@article{830165e028d7499597d9273958f7b97e,

title = "Issues in high-throughput comparative modelling: A case study using the ubiquitin E2 conjugating enzymes",

abstract = "Sequences of the ubiquitin-conjugating enzyme (UBC or E2) family were used as a test set to investigate issues associated with the high-throughput comparative modelling of protein structures. A semi-automatic method was initially developed with particular emphasis on producing models of a quality suitable for structural comparison. Structural and sequence features of the E2 family were used to improve the sequence alignment and the quality of the structural templates. Initially, failure to correct for subtle structural inconsistencies between templates lead to problems in the comparative analysis of the UBC electrostatic potentials. Modelling of known UBC structures using Modeller 4.0 showed that multiple templates produced, on average, no better models than the use of just one template, as judged by the root-mean-squared deviation between the comparative model and crystal structure backbones. Using four different quality-checking methods, for a given target sequence, it was not possible to distinguish the model most similar to the experimental structure. The UBC models were thus finally modelled using only the crystal structure template with the highest sequence identity to the target to be modelled, and producing only one model solution. Quality checking was used to reject models with obvious structural anomalies (e.g., bad side-chain packing). The resulting models have been used for a comparison of UBC structural features and of their electrostatic potentials. The work was extended through the development of a fully automated pipeline that identifies E2 sequences in the sequence databases, aligns and models them, and calculates the associated electrostatic potential.",

author = "Peter Winn and K Schleinkofer and A Banerjee and RC Wade",

year = "2005",

month = feb,

day = "1",

doi = "10.1002/prot.20318",

language = "English",

volume = "58",

pages = "367--375",

journal = "Proteins: structure, function, and bioinformatics",

issn = "1097-0134",

publisher = "Wiley",

number = "2",

}

TY - JOUR

T1 - Issues in high-throughput comparative modelling: A case study using the ubiquitin E2 conjugating enzymes

AU - Winn, Peter

AU - Schleinkofer, K

AU - Banerjee, A

AU - Wade, RC

PY - 2005/2/1

Y1 - 2005/2/1

N2 - Sequences of the ubiquitin-conjugating enzyme (UBC or E2) family were used as a test set to investigate issues associated with the high-throughput comparative modelling of protein structures. A semi-automatic method was initially developed with particular emphasis on producing models of a quality suitable for structural comparison. Structural and sequence features of the E2 family were used to improve the sequence alignment and the quality of the structural templates. Initially, failure to correct for subtle structural inconsistencies between templates lead to problems in the comparative analysis of the UBC electrostatic potentials. Modelling of known UBC structures using Modeller 4.0 showed that multiple templates produced, on average, no better models than the use of just one template, as judged by the root-mean-squared deviation between the comparative model and crystal structure backbones. Using four different quality-checking methods, for a given target sequence, it was not possible to distinguish the model most similar to the experimental structure. The UBC models were thus finally modelled using only the crystal structure template with the highest sequence identity to the target to be modelled, and producing only one model solution. Quality checking was used to reject models with obvious structural anomalies (e.g., bad side-chain packing). The resulting models have been used for a comparison of UBC structural features and of their electrostatic potentials. The work was extended through the development of a fully automated pipeline that identifies E2 sequences in the sequence databases, aligns and models them, and calculates the associated electrostatic potential.

AB - Sequences of the ubiquitin-conjugating enzyme (UBC or E2) family were used as a test set to investigate issues associated with the high-throughput comparative modelling of protein structures. A semi-automatic method was initially developed with particular emphasis on producing models of a quality suitable for structural comparison. Structural and sequence features of the E2 family were used to improve the sequence alignment and the quality of the structural templates. Initially, failure to correct for subtle structural inconsistencies between templates lead to problems in the comparative analysis of the UBC electrostatic potentials. Modelling of known UBC structures using Modeller 4.0 showed that multiple templates produced, on average, no better models than the use of just one template, as judged by the root-mean-squared deviation between the comparative model and crystal structure backbones. Using four different quality-checking methods, for a given target sequence, it was not possible to distinguish the model most similar to the experimental structure. The UBC models were thus finally modelled using only the crystal structure template with the highest sequence identity to the target to be modelled, and producing only one model solution. Quality checking was used to reject models with obvious structural anomalies (e.g., bad side-chain packing). The resulting models have been used for a comparison of UBC structural features and of their electrostatic potentials. The work was extended through the development of a fully automated pipeline that identifies E2 sequences in the sequence databases, aligns and models them, and calculates the associated electrostatic potential.

U2 - 10.1002/prot.20318

DO - 10.1002/prot.20318

M3 - Article

C2 - 15558745

SN - 1097-0134

VL - 58

SP - 367

EP - 375

JO - Proteins: structure, function, and bioinformatics

JF - Proteins: structure, function, and bioinformatics

IS - 2

ER -

Issues in high-throughput comparative modelling: A case study using the ubiquitin E2 conjugating enzymes

Abstract

Access to Document

Fingerprint

Cite this