Verotoxin (VT) and Heat-labile Enterotoxin (LT) cause hemolytic uremia and traveller's diarrhea, respectively. From the 3D structures of both toxins it is clear that their B-subunits adopt the same fold, now known as the OB-fold. The RMS deviation of 47 CA is 1.0 Å while only three residues are identical in a structure-based sequence alignment (VT and LT B-subunit sequences are 69 and 101 residues long, respectively). Given the 3D structure of the VT B-subunit, can threading methods identify the fold of LT B?
Sequence alignment based on 3D structures
b b b b b b b b b b b
VT - - - - - - - - - - - - - - - T P D C V T G K V E Y T K Y N D - - -
| | | | | | | |
LT A P Q T I T E L C S E Y R N T Q I Y T I N D K I L S Y T E S M A G K
a a a a a b b b b b b b b b b b b b
b b b b b b b b b b a
VT D D T F T V K V - G D K E L F T N R - - - - - - - - - - - - - W N L
| | | | | | | | | | | | | | | |
LT R E M V I I T F K S G E T F Q V E V P G S Q H I D S Q K K A I E R M
b b b b b b b b b a a a a a a a a a a
a a a a a a a a a a b b b b b b b b b
VT Q S L L L S A Q I T G M T V - T I K T N A C H N G G G F S E V I F R
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
LT K D T L R I T Y L T E T K I D K L C V W N N K T P N S I A A I S M K N
a a a a a a a a a b b b b b b b b b b b b b b b b
The OB-fold is:

10 best structures:
| PDB structure | raw score | Z-score | a.a.aligned | % identites | Fold (class) |
| 1bovAx 69 a.a. | 3062 | 10.40 | 69 | 100 | OB (all beta) |
| 1ltsDx 103 a.a. | 569 | 1.48 | 89 | 16 | OB (all beta) |
| 1mhcBx 99 a.a. | 543 | 1.38 | 81 | 20 | MHC recogn. dom. (alpha + beta) |
| 1ayaAx 101 a.a. | 470 | 1.12 | 79 | 21 | SH2-like(alpha + beta) |
| 1fkj_x 107 a.a. | 434 | 0.99 | 78 | 21 | FKBP-like (alpha + beta) |
| 2mcm_x 112 a.a. | 401 | 0.87 | 82 | 20 | Immunoglob.-like (all beta) |
| 1aac_x 105 a.a. | 386 | 0.82 | 84 | 26 | Cupredoxin-like(all beta) |
| 2fxb_x 81 a.a. | 381 | 0.80 | 75 | 13 | Ferredoxin-like (alpha + beta) |
| 1bdo_x 80 a.a. | 369 | 0.76 | 74 | 16 | ND |
| 1prtDx 110 a.a. | 366 | 0.75 | 84 | 17 | OB (all beta) |
Alignments:
1bovAx 3062 finds itself
bb bbbbb bbbbb bb bbaaaaa
MKKTLLIAASLSFFSASALATPDCVTGKVEYTKYNDDDTFTVKVGDKELFTNRWNLQSLL
****************************************
--------------------TPDCVTGKVEYTKYNDDDTFTVKVGDKELFTNRWNLQSLL
bbbbb bbbbbbbbbbbbbbb aaaaa
aaa bbbbbb bbbbbbb
LSAQITGMTVTIKTNACHNGGGFSEVIFR
*****************************
LSAQITGMTVTIKTNACHNGGGFSEVIFR
aaaaaaabbbbbbb bbbbb
1ltsDx 569 finds Heat-labile enterotoxin
bb bb bbb bbbbb bb
MKKTLLIAAS----LSFFS--ASALATPDCVTGKVE--YTKYNDDDTFTVKVGDKELF-T
.*. * ..|. . * .:. .** * ...|.:.|** *.*.:.: .
APQTITELCSEYRNTQIYTINDKILSYTESMAGKREMVIITFKSGETFQVEVPGSQHIDS
aaaaaaaa bbbbbbb bbbbbbb bbbbbbbb bbbbb
bbaaaaaaaa bbbbbb bbbbbbb
NRWNLQSLLLSAQITGMTVTIKTNAC----HNGGGFSEVIFR-
.. .::.: . .** .* * . . * ..:.:.... .|
QKKAIERMKDTLRITYLTETKIDKLCVWNNKTPNSIAAISMKN
aaaaaaaaaaaaaaaaaaabbbbb bbbbbbb
--- ------------------------------------------------------------
--- TOPITS prediction-based threading
--- ------------------------------------------------------------
--- TOPITS ALIGNMENTS HEADER: ABBREVIATIONS
--- RANK : rank in alignment list, sorted according to z-score
--- EALI : alignment score
--- LALI : length of alignment
--- IDEL : number of residues inserted
--- NDEL : number of insertions
--- ZALI : alignment zcore; note: hits with z>3 more reliable
--- PIDE : percentage of pairwise sequence identity
--- LEN2 : length of aligned protein structure
--- ID2 : PDB identifier of aligned structure
--- NAME2 : name of aligned protein structure
---
--- TOPITS ALIGNMENTS HEADER: ACCURACY
--- : Tested on 80 proteins, TOPITS found the
--- : correct remote homologue in about 30% of
--- : the cases, detection accuracy was higher
--- : for higher z-scores (ZALI):
--- ZALI>0 : 1st hit correct in 33% of cases
--- ZALI>3 : 1st hit correct in 50% of cases
--- ZALI>3.5 : 1st hit correct in 60% of cases
---
--- TOPITS ALIGNMENTS HEADER: SUMMARY
RANK EALI LALI IDEL NDEL ZALI PIDE LEN2 ID2 NAME2
1 89.43 69 0 0 4.25 100 349 1bov_A VEROTOXIN-1 OB fold (all beta>
2 88.48 69 0 0 4.18 100 349 1bov_A VEROTOXIN-1
3 88.43 69 0 0 4.17 100 349 1bov_A VEROTOXIN-1
4 88.28 69 0 0 4.16 100 349 1bov_A VEROTOXIN-1
5 87.87 69 0 0 4.13 100 349 1bov_A VEROTOXIN-1
6 57.28 88 14 5 1.82 19 476 2aaa D alpha-amylase beta dom (all beta)
7 56.65 88 9 2 1.78 16 244 1rva_A restriction endonuclease (alpha/beta)
8 55.65 86 4 2 1.70 16 1275 1tah_A alpha/beta hydrolase (alpha/beta)
9 55.65 86 4 2 1.70 16 1275 1tah_A alpha/beta hydrolase (alpha/beta)
10 55.48 87 15 3 1.69 15 405 1eft EF-Tu C-term Dom. (all beta)
11 55.25 88 18 3 1.67 13 496 1smd L_ ND
12 55.15 80 3 2 1.66 17 1275 1tah_A alpha/beta hydrolase (alpha/beta)
13 54.92 83 14 4 1.65 17 131 1ifc Lipocalins (all beta)
14 54.43 87 5 3 1.61 13 686 1cdg Immunoglobulin-like (all beta)
15 54.10 87 6 4 1.58 22 648 2bbv_A Viral capsid proteins (all beta)
16 53.93 86 4 2 1.57 16 1275 1tah_A alpha/beta hydrolase (alpha/beta)
17 53.77 87 14 2 1.56 17 452 1gcb L_ID Immunoglobulin-like (all beta)
18 53.37 81 13 4 1.53 15 377 1hsb_B Immunoglobulin-like (all beta)
19 53.37 81 13 4 1.53 15 377 1hsb_A Immunoglobulin-like (all beta)
20 53.28 86 6 4 1.52 22 648 2bbv_A Viral capsid proteins (all beta)
The method used for this prediction was: gonnet .
NOTE: This method is merely a sequence-sequence comparison
of your sequence to the sequences of the PDB folds.
Most similar fold: 1bova
VEROTOXIN-1 (B-OLIGOMER, ALSO CALLED SHIGA-LIKE T
RANK Z-SCORE FOLD LENGTHALI %ID
1 49.98 1bova 69 100 2-21 beta ; OB-fold] 2-21-1-1-3-1-1-1
2 4.23 1tiid 77 25 1997 new ; ESCHERICHIA COLI HEAT LABILE ENTEROTOXIN TYPE I]
3 3.72 1dlha_3-81 72 21 4-12 (a+b) ; MHC antigen-recognition domain] 4-12-1-1-2-1-1-1 RC:,
4 3.16 1ifl 52 15 1-71 alpha ; Oligomers of long helices] 1-71-3-1-1-4-1
5 2.96 1drs 51 12 7-5 small ; Snake toxin-like] 7-5-1-2-1-1-1
6 2.89 1erg 64 16 7-5 small ; Snake toxin-like] 7-5-1-3-1-1-1
7 2.84 3hhrb_131-234 86 19 2-1 beta ; Immunoglobulin-like beta-sandwich] 2-1-2-1-5-1-1-2
8 2.80 1mjc 63 24 2-21 beta ; OB-fold] 2-21-5-1-1-1-1
---> Major cold shock protein from Escherichia coli (n=5, S=8)
9 2.55 2cyr 72 26 7-27 small ; Cytochrome c3] 7-27-1-1-1-1-2
10 2.42 1frta_179-269 64 23 2-1 beta ; Immunoglobulin-like beta-sandwich] 2-1-1-2-3-1-2-1
LEGEND:
COL. 1: RANK. The ranks are obtained by sorting the fold library,
by Z-SCORES, in decreasing order. Only the 10
structures that are most compatible to your sequence
are shown.
COL. 2: Z-SCORE. The z-scores are computed using
the distribution of raw scores (not shown) of all folds.
COL. 3: FOLD. Protein Data Bank codes for the coordinates of the 3D
structures.
COL. 4: LENGTHALI. The number of residues from your sequence that were
aligned to the fold.
COL. 5: % ID. Percentage of identical residues in the alignment.
RELIABILITY OF THIS PREDICTION:
With this method the confidence threshold
is a z-score of 4.4 +- 1.0.
5. Comparative Modeling: From Sequence to 3D model