SHIFTY
:
Automated 1H and 13C chemical shift prediction using
the BioMagResBank
|
David
S. Wishart a, M. Scott Watsona, Robert F. Boykob
and Brian D. Sykes b
aFaculty of Pharmacy and Pharmaceutical
Sciences,bDepartment of Biochemistry
University of Alberta
Edmonton, AB, Canada
|
|
Abstract:
A computer program has been developed to accurately and automatically
predict the 1H and 13C chemical shifts of unassigned
proteins on the basis of sequence homology. The program (called SHIFTY)
uses standard sequence alignment techniques to compare the sequence of
an unassigned
protein against the BioMagResBank
a public database containing sequences and NMR chemical shifts of nearly
200 assigned proteins [Seavey et al. (1991) J. Biomol. NMR, 1, 217-236
PubMed PMID: 1841696].
From this initial sequence alignment,
the program uses a simple set of rules to directly assign or transfer
a complete set of 1H or 13C
chemical shifts (from the previously assigned homologues) to the unassigned
protein. This " homologous assignment"
protocol takes advantage of the simple
fact that homologous proteins tend to share both structural similarity
and chemical shift similarity. SHIFTY has been
extensively tested on more than 25 medium-sized
proteins. Under favorable circumstances, this program can predict the 1H
or 13C chemical shifts of
proteins with an accuracy far exceeding
any other method published to date. With the exponential growth in
the number of assigned proteins appearing in
the literature (now at a rate of more
than 150 per year), we believe that SHIFTY may have widespread utility
in assigning individual members in
families of related proteins, an endeavor
that accounts for a growing portion of the protein NMR work being done
today.
J Biomol NMR 1997 Dec;10(4):329-336
[PubMed PMID: 9460240]
Listed here are the proteins
used for this article. Information in the table includes chemical
shifts, X-ray structure, the accession codes for data deposited in
the BMRB and PDB databases, the resolution at which the crystal structure
was solved, the sequence homologues that were used, the correlation coefficients
for the different NMR specs, and the assignment prediction accuracy.
Protein
Chemical shifts refence
(BMRB code) |
No.
of residues |
X-ray
structure ref.
(PDB code) |
Resolution
(Angstroms) |
Matching
protein
(BMRB code) |
Identity
(%)
|
aH
correlation |
NH
correlation |
bH
correlation |
Assignment
prediction accuracy
(%)
|
Lysozyme (hen)
(1093
,
2786
,
4652
) |
129 |
Vaney
et al., 1996
(193L) |
1.33 |
Lysozyme
(hen)
Lysozyme (turkey) |
100
93.7 |
1.00
0.99 |
1.00
0.99 |
1.00
0.99 |
100
92.2 |
Calbindin (Bovine)
(247,
327,
753) |
76 |
Szebenyi
et al., 1986
(3ICB) |
2.30 |
Calbindin (porcine) |
86.8 |
.99 |
.99 |
.99 |
93.4 |
PTI(type E) |
. |
. |
. |
PTI (type K) |
61.4 |
.96 |
.89 |
.98 |
67.8 |
HPr (B. subtilis)
(932) |
87 |
Liao
et al., 1994
(1SPH) |
1.70 |
HPr (S. aureus) |
60.9 |
.93 |
.87 |
.96 |
66.7 |
EGF (mouse) |
53 |
Kohda
et al., 1992
(1EPH) |
. |
EGF (human) |
60.4 |
.92 |
.94 |
.95 |
75.5 |
Anthropluerin A |
. |
. |
. |
ATX I toxin |
59.2 |
.86 |
.86 |
.97 |
75.5 |
Plastocyanin (spinach)
(120,
121,79) |
99 |
Xue
et al., 1997
(1AG6) |
1.7 |
Plastocyanin (algae) |
50.5 |
.91 |
.80 |
.94 |
64.6 |
Troponin C (turkey) |
162 |
Herzberg
et al., 1988
(5TNC) |
2.00 |
Calmodulin (fruit fly) |
46.9 |
.92 |
.89 |
N/A |
. |
Bungarotoxin |
148 |
Love
et al., 1986
(2ABX) |
2.5 |
Alpha neurotoxin |
35.1 |
.84 |
.72 |
.93 |
70.3 |
HPr (E. coli) |
85 |
Jia
et al., 1993
(1POH) |
2.00 |
HPr (B. subtilis) |
32.9 |
.64 |
.51 |
.91 |
77.4 |
Cardiotoxin III |
60 |
Bhaskaran
et al., 1994
(2CRS) |
. |
Alpha neurotoxin |
30.0 |
.20 |
.12 |
.89 |
. |
Lac repressor
(1066,
127,
1494,
1552,
2956,
32,
4813,
661,
736,
848,
849,
96)
|
60 |
Spronk
et al., 1999
(1CJG) |
. |
HPr (B. subtilis) |
10.4 |
.30 |
.15 |
N/A |
39.2 |
|