functions each of which represents the observed orientation of a contacting residue, and is evaluated as a series expansion of spherical harmonics functions. The sample size limits the frequencies of modes whose expansion coefficients can be reliably estimated. High frequency modes are statistically less reliable than low frequency modes. Each expansion coefficient is separately corrected for the sample size according to suggestions from a Bayesian statistical analysis. As a result, many expansion terms can be utilized to evaluate orientational distributions. Also, unlike other orientational potentials, the uniform distribution is used for a reference distribution in evaluating a potential of mean force for each type of contacting residue pair from its orientational distribution, so that residue-residue orientations can be fully evaluated. It is shown by using decoy sets that the discrimination power of the orientational potential in fold recognition increases by taking account of the Euler angle dependencies and becomes comparable to that of a simple contact potential, and that the total energy potential taken as a simple sum of contact, orientation, and (
,
) potentials performs well to identify the native folds.© 2005 American Institute of Physics.
functions each of which represents the observed location in angular space, and then is estimated in the form of a series expansion with spherical harmonics functions, ignoring high frequency modes that occur, because of the sample size. High frequency modes are statistically less reliable than low frequency modes. Here, unlike other works33,34,35 each expansion term is separately corrected for the sample size according to suggestions from an analysis of Bayesian statistics. As a result, many expansion terms can be utilized to evaluate orientational distributions. A local coordinate system for each residue is defined for fold recognition, based only on main chain atoms to represent directional and rotational relationships between the main chains of contacting residues rather than between the side chains.33,34,35 Results show that a large contribution to the orientational entropy of residue pairs comes from the Euler angle dependencies of the frequency distribution and also from the polar and Euler angle correlations. Then, an energy potential for relative orientations of contacting residues is evaluated for each type of amino acid pair as a potential of mean force from the estimated distributions.
The short-range potential is a secondary structure potential based on peptide dihedral angles. All of these potentials are estimated as potentials of mean force from the observed distributions of residue-residue contacts and of peptide dihedral angles at the residue level in crystal structures of proteins. In the following, energy is represented in kBT units, where kB is the Boltzmann constant and T is temperature.
where ec(ri,rj) is the contact energy between the ith and jth residues, and ri represents all the atomic positions of the ith residue. The pairwise energy potential is represented as the sum of two terms, one of which is the usual contact potential2,3,4 and the other is a potential of mean force for relative orientations between contacting residues that is evaluated here from the statistical distribution of relative orientations,
where
c(ri,rj) represents the degree of contact between the ith and jth residues, e
is the contact energy for residues of types ai and aj in contact, and eoaiaj(ri,rj) is the orientational energy for the relative direction and rotation between amino acids of type ai and aj contact; ai means the amino acid type of the ith residue. Here, it should be noted that the radial distance between residues is described by specifying whether or not these residues are in contact with each other, and that orientational interactions are assumed only for residues that are in contact with each other.
c(ri,rj) takes a value one for residues that are completely in contact, the value zero for residues that are too far from each other, and values between one and zero for residues whose distance is intermediate between those two extremes, about 6.5 Å between geometric centers of their side chain heavy atoms. Previously, this function was defined as a step function for simplicity. Here, it is defined as a switching function as follows; in the equation below to define residue contacts, ri means the position vector of a geometric center of side chain heavy atoms or the C
atom for GLY,
where Sw is a switching function, and r
is the van der Waals radius of a residue of type a which is estimated from the average volume Va occupied by a residue of type a in protein structures with the packing density of hard sphere
; Va are those calculated in Refs. 46 and 47 and listed in Ref. 2. A critical distance to define a residue-residue contact is about 6.5 Å, but it is taken to be larger for bulky residues.
and a residue-type dependent term
e
; r means an average residue here.
The energies
e
for all pairs of the 20 types of residues were recalculated44 from 2129 protein species representatives of the SCOP48 Release 1.53 with the sampling method3 and with the parameters evaluated in Miyazawa and Jernigan4 to correct these values estimated by the Bethe approximation; actually, the estimates of contact energies corrected for the Bethe approximation are divided by 
![[prime]](024901_1_files/prime-script.gif)
0.263 defined in Eq. (34) of that paper4 and used as the values of
e
. In other words, the intrinsic pairwise interaction energies
eij are corrected relative to the hydrophobic energies
eir, and the hydrophobic energies are not corrected at all; see that paper4 for the exact definitions of
eij and
eir. This scheme is employed, so that all the energy potentials in Eq. (1) have magnitudes estimated as the potential of mean force from observed distributions by assuming a Boltzmann distribution.
is essential for a protein to fold by canceling out the large conformational entropy of extended conformations but it is difficult to estimate.2,3 The value –2.55 originally estimated2,3 for e
is used here; as a result, the contact energy e
takes a negative value for all amino acid pairs except for LYS-LYS pair.
,
) and Euler angles (
,
,
) to describe the direction and rotation of one residue relative to another, respectively. A local coordinate system fixed on each residue will be defined later. The potential of mean force for residue orientations is defined as
where faa
(
,
,
,
,
) is a probability density function for a residue of type a
at the orientation (
,
,
,
,
) relative to the residue of type a; it satisfies
and fa
a:
The relationship in respect to the polar angles (
,
) is not simple, but (
,
) can be uniquely calculated from (
,
,
,
,
). Thus, in principle, faa
and fa
a must be equal to each other:
However, in the present statistical estimation of the probability density, the relationship above would be approximately satisfied. Therefore, the potential is evaluated in the form of Eq. (11).
Here it is important to note that this term represents a reference state such that the expected value of the orientational energy for each type of contacting residue pair in the native structures is equal to zero. Thus, this orientational potential represents simply the suitability of a relative orientation between contacting residues, but does not represent at all whether a contact between residues is favorable or not. The latter is supposed to be represented in the present scheme by the usual contact energy e
. The reference distribution of residue-residue orientations for these orientational potentials is the uniform distribution, and not the overall distribution for all types of amino acid pairs employed by others.33,34,35 Therefore, for residue pairs whose distributions coincide with the overall distribution, the latter potentials give always no preference but the present potentials give a preference. This is a desirable behavior for orientational potentials, because such an overall distribution of residue-residue orientations would not be an intrinsic characteristic of non-native conformations but rather of native structures of proteins.
,
,
,
,
) variables.
g is represented as
where Y
is the normalized spherical harmonics function, P
is the associated Legendre function; P
with mp = 0 is the Legendre polynomial. Then, the coefficients in the expansion of Eq. (18) can be calculated from the observed density distribution by
Thus, the coefficient of the first constant term in Eq. (18) that corresponds to the uniform distribution is obvious;
function can be used,33 that is,
and then, the expansion coefficients are calculated as
where (
µ,
µ,
µ,
µ,
µ) is a set of angles observed for the contact µ between residue types a and a
, and wµ is a weight for this contact. The summations in the equations above are over all contacts of amino acid types a versus a
. A contact between amino acid types a and a
is counted as one half of a contact for a versus a
and another half for a
versus a; Naa
+ Na
a is equal to the actual number of contacts between amino acid types a and a
. Thus, a weight wµ is equal to 0.5wc, where wc is a sampling weight for each protein that is described in the section "Datasets of protein structures used." In Eq. (24), residues are regarded to be in contact if the geometric centers of side chains or C
atoms for GLY are within 6.5 Å.
where 
is taken to be
in order to reduce statistical errors resulting from the small size of samples;
in Eq. (31) is a parameter to be optimized. Equation (31) means that more samples are required to determine higher frequency modes. In Eq. (27), the first term becomes more effective than the second term in the limit of small numbers of Naa
, and inversely the second term becomes more effective than the first term in the limit of large numbers of Naa
.
and
where Ocutoff is a cutoff value for expansion terms.
where H is the Heaviside step function which takes a value of one for zero and positive values of the argument and is otherwise zero. Finally the estimate of the probability density faa
(
,
,
,
,
) is cut off at sufficiently low and high values in such a way that its logarithm takes a value within an appropriate range; for example, –7
–ln faa
(
,
,
,
,
) + ln(c
g00000)
1.
, and a repulsive packing potential e
,
where Sw is defined by Eq. (7). The repulsive packing potentials e
for the 20 types of residues are estimated from the observed distributions of the numbers of contacting residues in dense regions of protein structures by assuming a Boltzmann distribution.3 N(ai,n
) is the observed number of residues of type ai that are surrounded by n
residues in the database of protein structures. q
is a coordination number, which is defined as the maximum feasible number of contacting residues around a residue, for the amino acid of type ai.
in Eq. (40) is a small value (
= 10–6) that is added to avoid the divergence of the logarithm function. The observed distribution N(ai,n
) used here is one44 compiled from 2129 protein species representatives of the SCOP48 Release 1.53 with our sampling method.3
(
i,
i) over all residues:
For this secondary structure potential, a 10° mesh over (
,
) space is used to count frequencies of amino acids observed in protein native structures, and this intraresidue potential e
for each amino acid type a is evaluated as
where Na(
,
) is the number of amino acids of type a at (
,
) observed in protein native structures, and Na is their sum over the entire (
,
) space, that is, the number of amino acids of type a. The second term is a constant term that corresponds to a reference energy, so that the (
,
) energy expected for each type of residue in the native structures is equal to zero.
,
) used here is one44 compiled from 2129 protein species representatives of the SCOP48 Release 1.53 with the sampling method3 used to reduce the weights of contributions of structures having high sequence identity.
, all
,
/
,
+
, and multidomain proteins. Classes of membrane and cell surface proteins, small proteins, peptides, and designed proteins are not used. Proteins whose structures50 were determined by NMR or having stated resolutions worse than 2.5 Å are removed to assure that the quality of proteins used is high. Also, proteins whose coordinate sets consist either of only C
atoms, or include many unknown residues, or lack many atoms or residues, are removed. In addition, proteins shorter than 50 residues are also removed. As a result, the set of species representatives includes 4435 protein domains; this dataset is named here as dataset A.
, and C
to represent the orientational relationship between the main chains of contacting residues rather than representing33,34,35 those relationships between the side chains. The origin O of the local coordinate system is located at the C
position of each residue. The Y and Z axes are ones formed by the vector product and the sum of the unit vectors from N to C
and from C
to C
, respectively. That is, the Y and Z axes are taken to be perpendicular to and in the plane of the three atoms N, C
, and C
, respectively. These form a right-handed coordinate system. There are two degrees of directional freedom and three degrees of rotational freedom in the relative orientation of one residue to another in contacting residue pairs. The relative direction and rotation of one residue to another in contacting residues are represented by polar angles (
,
) and Euler angles (
,
,
), respectively.
Figure 1.
, l
, and k
that are the maximum values of lp, le, and ke which are the highest frequency modes to be estimated. However, even though each of (lp, mp, le, me, ke) is sufficiently small, their combinations may correspond to high frequency modes. The number of modes lower than or equal to (lp, mp, le, me, ke), Olpmplemeke defined by Eq. (32), is used as a one-dimensional projection of (lp, mp, le, me, ke) on a frequency axis. To remove high frequency modes, only frequency modes less than and equal to Ocutoff are utilized. In addition, only significant terms in the expansion of Eq. (35) whose coefficients take larger absolute values than the value of a cutoff, ccutoffc
, are used to estimate the distributions of relative residue-residue orientations.
g00000) = 6.900 in kB units; kB is the Boltzmann constant. The estimate of orientational entropy for each type of residue pair and the number of significant terms required for the estimation depends on the resolution of the potentials, that is, the values of l
, l
, and k
, and also the cutoff parameters of Ocutoff and ccutoff, and
for the correction for a small sample size. Orientational entropies estimated with various values of the parameters are shown in Fig. 2, and the numbers of significant terms required are plotted in Fig. 3. Orientational entropies and the numbers of significant terms averaged with a weight of the number of contacts over all residue pairs are plotted against the cutoff value of the coefficient for expansion terms, ccutoff. Triples of digits near curves in the figure indicate the values of (l
, l
, and k
). The entropy reduction is large when the resolution of the potential increases. The estimate of orientational entropy with l
= l
= k
= 4,5,6 almost converges at the cutoff value, ccutoff = 0.025. The number of significant terms decreases almost exponentially with the cutoff value, ccutoff; see Fig. 3. The number of significant terms required for each type of residue pair is related to the orientational entropy for the residue pair. Figure 4 shows the correlation between the orientational entropies and the number of significant terms. As expected, many significant terms tend to be required for residue pairs whose orientational entropies are large. The frequency distribution of the number of significant terms for the 210 types of residue pairs is shown in Fig. 5, indicating that the orientational distribution strongly depends on the type of residue pair.
Figure 2.
Figure 3.
Figure 4.
Figure 5.
–ln faa![[prime]](024901_1_files/prime-script.gif)
for each type of residue pair are listed in Table I. Residue type "r" in Table I means any type of residue. As already noted in the Methods section, in principle this matrix is symmetrical. The table shows that the matrix is almost symmetrical, indicating the good quality of their statistical estimates. These values in this table are calculated with l
= l
= k
= 6, Ocutoff = O33333 = 1792,
= 0.2, and ccutoff = 0.025.
g00000) = 6.900 for the uniform distribution. The abscissa indicates the amino acid pair identification number; amino acid types are numbered in the order of amino acids written along the abscissa. Thus, the amino acid pair identification number one means a CYS-CYS pair and 400 means a PRO-PRO pair. The lowest solid line is for a distribution estimated with l
= l
= k
= 6. The highest solid line shows the orientational entropies estimated with l
= 6, l
= k
= 0, and therefore the contribution to the total entropies from polar angle dependences. The middle line shows the orientational entropies estimated by subtracting the entropy, 6.900, for the uniform distribution from the sum of entropies estimated with l
= 6, l
= k
= 0, and with l
= 0, l
= k
= 6. In other words, the difference between the highest solid line and the middle line shows contributions to the total entropies from Euler angle dependences. The difference between the middle and lowest solid lines corresponds to contributions from the cross correlation between polar angle and Euler angle dependences. Cutoff values for significant terms in the expansion are Ocutoff = 1792 and ccutoff = 0.025. The parameter for the correction for a small sample size is
= 0.2.
Figure 6.
positions for these decoys were generated by exhaustively enumerating ten selectively chosen residues in each protein using a four-state off-lattice model.36
helical proteins. The main chains for these decoys were generated using a fragment insertion simulated annealing procedure to assemble nativelike structures from fragments of unrelated protein structures with similar local sequences using Bayesian scoring functions.37
atoms in overlaps between the native structure and decoys. The rank probabilities, Pe in the energy scale and Pr in the RMSD scale, are defined as
where
and
E are the mean and the standard deviation of energies of decoys, and
and
rmsd are the mean and the standard deviation of RMSD of decoys. RMSDlowest is the RMSD of the lowest energy fold.
= k
= 0, Euler angle dependencies are completely ignored. Thus, the comparisons of the performances of discrimination between the cases of l
= k
= 0 and l
,k
0 indicate how important the Euler angle dependencies of relative residue orientations are in fold recognition. In Tables II and III, the performances of discrimination are compared among some combinations of parameters l
and l
for both the decoy set groups of monomeric proteins and immunoglobulin domains; k
was taken to be equal to l
. The full lists of these tables are provided in the auxiliary material.52 Here, the potentials consist of the orientational potential eo only. In these tables, the performances of discrimination are evaluated by the number of decoy sets (no. of tops) in which the native structure is the lowest energy fold, and also the averages over the decoy sets of the logarithms of rank probabilities Pe in the energy scale and Pr in the RMSD scale, and the mean Z scores Ze of the native folds in the energy scale.
= k
= 0. Both the monomeric protein decoy set group and immunoglobulin decoy set group show similar characteristics; when the resolution, that is, the value of l
increases up to 7, the number of top ranks tends to increase and the means of the log rank probabilities,
in the energy scale and
in the RMSD scale, tend to be improved with more negative values. The potentials with 7<l
<14 appear to yield worse results than that of l
= 7. At l
= 14, the orientational potential shows a similar performance to that for l
= 7. These results indicate that the improvement in the performance of fold recognition is not monotonic with the number of expansion terms, and also that there may be an intrinsic periodicity in the polar-angle distribution of residue-residue orientations.
= 0 is used, so that polar-angle dependencies are completely ignored. The best result in the cases of 4
l
= k
7 is obtained in the case of the highest resolution, l
= 0,l
= k
= 7. In comparison with the results of l
= 7,l
= k
= 0, some improvement is clearly observed for the immunoglobulin decoy set group, although the performance of z score Ze is slightly worse for the monomeric protein decoy set group. The native structures of immunoglobulin domains consist mainly of
sheets. Hydrogen bonds between
strands are essential to maintain
sheets. In addition to hydrogen bonds, residue-residue packing between a
sheet and other parts may require relatively stringent orientations between residues, especially for Euler angles.
= l
= k
~6,Ocutoff~1792, probably owing to the sample size. However, the comparison of the results for l
= 7,l
= k
= 0, l
= l
= k
= 7, Ocutoff = O77000 = 64, and l
= l
= k
= 7, Ocutoff = O00777 = 960 indicates that including small numbers of lower orders of cross terms between polar and Euler angles does not lead to an improvement in performance and sufficient numbers of cross terms are required to improve the performance. This may be one of reasons why Onizuka et al.33 observed worse rather than better performances by taking account of Euler angle dependencies in orientational distributions.
for a small sample correction are shown in Table III(c). The potential shows a better performance around
= 0.2; Naa
/
18 000(= 1 467 302/400/0.2). This means that the first digit will be significant in the estimated values of the expansion coefficients for the terms of Olpmplemeke = 1792, because 
in Eq. (31) becomes about 0.1 for Olpmplemeke = 1792. Thus, the values of
= 0.2 and Ocutoff = 1792 would be consistent with one another.
= l
= k
= 6 with Ocutoff = 1792, ccutoff = 0.025, and
= 0.2 are employed here, although Ocutoff = 960 is also good, and could be chosen if one wants to reduce the number of expansion terms. The discrimination of the native structures is successful for 37 of the 79 monomeric decoy sets and for 59 of the 81 immunoglobulin decoy sets using the orientational energy.
= 7,l
= k
= 0; both are the best case for each decoy set group if only polar-angle dependencies are taken into account. Open circles and broken lines are for the case of l
= l
= k
= 6. For most decoy sets, the performance in the discrimination of the native structures is improved.
Figure 7.
over all decoy sets of the correlation coefficients of rank order between the energies and RMSDs of the decoys are listed for reference.
ec and e
+
ec, indicating that residues in the non-native structures are not well positioned with respect to the relative orientation between them.
ec without the orientational energy is slightly better than that of the orientational energy eo only, but it is significantly worse for the immunoglobulin decoy set group. Including the collapse energy e
causes the performance to become even worse, indicating that the contact potential without the orientational potential does not work at all for these decoy sets. In the case of multimeric proteins, the evaluation of contact energies for residues on the surface of the domain requires other domains and chains to be present. When other domains and chains are not available for a given domain, residue-residue contacts between domains and chains cannot be evaluated. Thus, as already mentioned, unlike short-range potentials, the true ground state of those multimeric proteins in the contact potential requires all of the chains to be present. Especially in the case of immunoglobulin molecules, the interface among constant and variable domains occupies a large portion of the surface of the domains. Thus, the potential consisting of the simple contact energy shows an extremely poor performance for the immunoglobulin decoy sets. On the other hand, the orientational potential only measures how good or bad the relative orientations between contacting residues are, and thus its evaluation does not necessarily require the presence of all domains and chains in multimeric proteins, although it would be more precisely measured if all contacting residues were known; as seen from Eq. (11), the expected value of the orientational energy for contacting residues in native protein structures is adjusted to be equal to zero.
to the residue-type dependent contact potential
ec except for the case of the energy
ec + eo. This fact indicates that optimizing potentials is not simple.
,
) potential, strongly indicating that the short-range interactions should not be ignored in fold recognition.
Figure 8.
functions each of which represents the observed location in angular space. Then, the distribution of residue-residue orientations is estimated in the expansion with spherical harmonics functions and the coefficients of the expansion terms are estimated by inversely transforming the observed distribution represented as the sum of
functions.
and which is an extremely important energy for a protein to fold that compensates for the large conformational entropy loss of compact conformations.
ski, and J. Skolnick, Protein Sci. 4, 2107 (1995). [ISI] [MEDLINE] [ChemPort]
first citation in article
dziej, M. R. Pincus, R. J. Wawak, S. Rackovsky, and H. A. Scheraga, J. Comput. Chem. 18, 849 (1997). [ISI] [ChemPort]
first citation in article
dziej, and H. A. Scheraga, J. Comput. Chem. 18, 874 (1997). [Inspec] [ISI] [ChemPort]
first citation in article
mierkiewicz, C. Czaplewski et al., J. Comput. Chem. 19, 259 (1998). [ISI] [ChemPort]
first citation in article
ski, A. Godzik, and J. Skolnick, J. Chem. Phys. 98, 7420 (1993). [ISI] [ChemPort]
first citation in article
ski, A. Godzik, and J. Skolnick, Proteins 26, 271 (1996). [ISI] [MEDLINE] [ChemPort]
first citation in article
ski, and J. Skolnick, Biophys. J. 85, 1145 (2003). [Inspec] [MEDLINE] [ChemPort]
first citation in article
Full figure (5 kB)Fig. 1. The definitions of a local coordinate system affixed to each residue. The origin O of the local coordinate system is located at the C
position of each residue. The Y and Z axes are ones formed by the vector product and the sum of the unit vectors from N to C
and from C
to C
, respectively. The X axis is taken to form a right-handed coordinate system. The relative direction and rotation of one residue to the other in contacting residues are represented by polar angles (
,
) and Euler angles (
,
,
), respectively. First citation in article
Full figure (9 kB)Fig. 2. Dependencies of orientational entropies on parameters in the estimation of the orientational potentials. The orientational entropies averaged over all types of residue pairs with the weight of the number of contacts Naa
for each type of residue pair are plotted against the cutoff values for the expansion coefficients. Triplets of digits near solid lines indicate the values of (l
,l
,k
); for the non-solid lines, l
= l
= k
= 6 is used. The other parameters are
= 0.2 for all lines, and Ocutoff = O33333 = 1792 for solid lines. The dotted line shows the case of Ocutoff = O00777 = 960, the dotted broken line is for Ocutoff = O11555 = 1584, and the broken line is for Ocutoff = O22444 = 2025. First citation in article
Full figure (9 kB)Fig. 3. Dependencies of the number of significant expansion terms on estimation parameters for the orientational potentials. The numbers of significant terms averaged over all types of residue pairs with the weight of the number of contacts Naa
for each type of residue pair are plotted against the cutoff values for expansion coefficients. Triplets of digits near curves indicate the values of (l
,l
,k
); for the non-solid lines, l
= l
= k
= 6 is used. The other parameters are
= 0.2 for all lines, and Ocutoff = O33333 = 1792 for solid lines. The dotted line shows the case of Ocutoff = O00777 = 960, the dotted broken line is for Ocutoff = O11555 = 1584, and the broken line is for Ocutoff = O22444 = 2025. First citation in article
Full figure (9 kB)Fig. 4. Correlation between the number of significant expansion terms and orientational entropy. Those values for 210 different types of residue pairs, which are averaged over residue pairs (a,a
) and (a
,a), are plotted here. The orientational potentials are evaluated with l
= l
= k
= 6, Ocutoff = 1792,
= 0.2, and ccutoff = 0.025. First citation in article
Full figure (8 kB)Fig. 5. Histograms of the numbers of significant expansion terms for the 210 types of residue pairs; the numbers of significant expansion terms are averaged over residue pairs (a,a
) and (a
,a). The size of a bin is 200. These data are those for l
= l
= k
= 6, Ocutoff = 1792,
= 0.2, and ccutoff = 0.025. First citation in article
Full figure (22 kB)Fig. 6. Orientational entropies,
–ln faa![[prime]](024901_1_files/prime-script.gif)
, for three types of distributions are plotted against the identification number of amino acid pair (a,a
). Amino acid types are numbered in the order of amino acids written along the abscissa; see text for details. The broken line shows the entropy, 6.900, for a uniform distribution. The lowest solid line shows the distribution with polar and Euler angle dependencies, l
= l
= k
= 6. The highest solid line shows the distribution with l
= 6,l
= k
= 0 that depends on polar angles only. The middle solid line shows the distribution that depends on polar angles with l
= 6, and on Euler angles with l
= k
= 6, but ignores any correlation between polar and Euler angles. The values of other parameters are Ocutoff = 1792,
= 0.2, and ccutoff = 0.025. First citation in article
Full figure (27 kB)Fig. 7. The effects of Euler angle dependencies in the orientational potentials on the performance for fold recognition. The value of logarithm of rank probability Pe in the energy scale for each decoy set is plotted against the identification number of the decoy set that is listed in Table V and tables in the auxiliary material (Ref. 52). The left figure (a) corresponds to the decoy set group of monomeric proteins in "Decoys'R'Us" (Ref. 39), and the right figure (b) to the immunoglobulin decoy set group. The potential function used here consists of orientational potentials eo only. Cross marks and solid lines show the case for the orientational potential with l
= 7, l
= k
= 0, Ocutoff =
, and ccutoff = 0.025. Open circles and broken lines show the case for the orientational potential with l
= l
= k
= 6, Ocutoff = 1792, and ccutoff = 0.025. First citation in article
Full figure (28 kB)Fig. 8. The effects of the orientational potentials on performance for fold recognition. The value of logarithm of rank probability Pe in the energy scale for each decoy set is compared between two types of potential functions, one of which includes the orientational potential. The abscissa shows the identification number of each decoy set that is listed in Table V and tables in the auxiliary material (Ref. 52). (a) The potentials for monomeric protein decoy sets consist of e
+
ec for cross marks and solid lines, and e
+
ec + eo for open circles and broken lines. (b) The potentials for immunoglobulin decoy sets consist of
ec + er for cross marks and solid lines, and eo + er for open circles and broken lines. The orientational energies are evaluated with l
= l
= k
= 6, Ocutoff = 1792,
= 0.2, ccutoff = 0.025. First citation in article
Table I. Orientational entropy, –ln faa![]() , in kB units for each residue pair (a,a ); a (a ) is shown in each row (column), r is for all types of residues, and the parameters used are l = l = k = 6, Ocutoff = 1792, = 0.2, and ccutoff = 0.025. | |||||||||||||||||||||
| C | M | F | I | L | V | W | Y | A | G | T | S | Q | N | E | D | H | R | K | P | r | |
| C | 3.97 | 4.06 | 4.52 | 4.31 | 4.54 | 4.33 | 3.62 | 4.33 | 4.38 | 4.74 | 4.40 | 4.43 | 4.02 | 4.25 | 3.96 | 4.00 | 3.96 | 4.26 | 4.01 | 4.50 | 5.12 |
| M | 4.07 | 4.47 | 4.69 | 4.44 | 4.58 | 4.45 | 4.23 | 4.64 | 4.50 | 4.88 | 4.48 | 4.57 | 4.24 | 4.42 | 4.15 | 4.16 | 4.21 | 4.35 | 4.04 | 4.78 | 4.97 |
| F | 4.51 | 4.71 | 4.92 | 4.73 | 4.88 | 4.68 | 4.55 | 4.86 | 4.84 | 5.09 | 4.82 | 4.83 | 4.51 | 4.82 | 4.60 | 4.60 | 4.60 | 4.67 | 4.50 | 4.90 | 5.16 |
| I | 4.31 | 4.45 | 4.72 | 4.38 | 4.52 | 4.34 | 4.42 | 4.66 | 4.36 | 4.91 | 4.47 | 4.57 | 4.27 | 4.47 | 4.13 | 4.27 | 4.34 | 4.44 | 4.10 | 4.82 | 4.77 |
| L | 4.53 | 4.57 | 4.88 | 4.52 | 4.68 | 4.55 | 4.60 | 4.78 | 4.43 | 5.01 | 4.62 | 4.64 | 4.35 | 4.65 | 4.20 | 4.41 | 4.68 | 4.56 | 4.28 | 5.06 | 4.86 |
| V | 4.31 | 4.46 | 4.69 | 4.33 | 4.55 | 4.21 | 4.53 | 4.65 | 4.33 | 4.90 | 4.44 | 4.55 | 4.43 | 4.60 | 4.22 | 4.28 | 4.43 | 4.48 | 4.16 | 4.80 | 4.78 |
| W | 3.59 | 4.23 | 4.53 | 4.43 | 4.59 | 4.53 | 3.87 | 4.46 | 4.78 | 4.79 | 4.46 | 4.51 | 4.06 | 4.27 | 4.29 | 4.40 | 4.09 | 4.28 | 4.01 | 4.56 | 5.21 |
| Y | 4.34 | 4.61 | 4.85 | 4.63 | 4.74 | 4.62 | 4.44 | 4.87 | 4.85 | 5.11 | 4.78 | 4.80 | 4.46 | 4.86 | 4.76 | 4.91 | 4.71 | 4.66 | 4.38 | 4.88 | 5.23 |
| A | 4.34 | 4.50 | 4.85 | 4.33 | 4.42 | 4.29 | 4.76 | 4.85 | 3.76 | 4.88 | 4.46 | 4.45 | 4.37 | 4.52 | 4.10 | 4.05 | 4.60 | 4.53 | 4.20 | 4.96 | 4.78 |
| G | 4.70 | 4.88 | 5.12 | 4.89 | 4.98 | 4.88 | 4.84 | 5.13 | 4.88 | 5.47 | 5.12 | 5.31 | 5.00 | 5.30 | 4.90 | 4.95 | 5.06 | 5.22 | 4.97 | 5.35 | 5.61 |
| T | 4.37 | 4.46 | 4.82 | 4.44 | 4.62 | 4.44 | 4.44 | 4.80 | 4.46 | 5.13 | 4.23 | 4.54 | 4.19 | 4.63 | 3.95 | 4.16 | 4.52 | 4.62 | 4.16 | 4.91 | 4.95 |
| S | 4.42 | 4.56 | 4.87 | 4.56 | 4.62 | 4.54 | 4.50 | 4.82 | 4.41 | 5.30 | 4.54 | 4.67 | 4.42 | 4.78 | 4.24 | 4.33 | 4.59 | 4.76 | 4.48 | 4.98 | 5.09 |
| Q | 4.02 | 4.20 | 4.51 | 4.21 | 4.31 | 4.38 | 4.07 | 4.47 | 4.36 | 5.02 | 4.19 | 4.39 | 4.15 | 4.39 | 3.84 | 4.03 | 4.32 | 4.27 | 3.91 | 4.72 | 4.86 |
| N | 4.23 | 4.41 | 4.84 | 4.48 | 4.61 | 4.58 | 4.30 | 4.85 | 4.52 | 5.28 | 4.65 | 4.77 | 4.39 | 4.84 | 4.28 | 4.45 | 4.59 | 4.71 | 4.36 | 4.97 | 5.22 |
| E | 3.96 | 4.12 | 4.59 | 4.12 | 4.19 | 4.18 | 4.29 | 4.81 | 4.10 | 4.93 | 3.95 | 4.22 | 3.81 | 4.29 | 3.72 | 3.83 | 4.58 | 4.39 | 4.06 | 4.54 | 4.71 |
| D | 3.96 | 4.14 | 4.61 | 4.24 | 4.38 | 4.28 | 4.42 | 4.95 | 4.06 | 4.95 | 4.14 | 4.32 | 4.03 | 4.44 | 3.83 | 4.13 | 4.71 | 4.85 | 4.46 | 4.67 | 4.95 |
| H | 3.98 | 4.20 | 4.58 | 4.33 | 4.66 | 4.43 | 4.09 | 4.73 | 4.60 | 5.07 | 4.51 | 4.53 | 4.30 | 4.60 | 4.58 | 4.71 | 4.40 | 4.44 | 4.18 | 4.63 | 5.27 |
| R | 4.26 | 4.36 | 4.68 | 4.42 | 4.55 | 4.46 | 4.31 | 4.72 | 4.54 | 5.25 | 4.63 | 4.75 | 4.27 | 4.73 | 4.37 | 4.87 | 4.47 | 4.66 | 4.05 | 4.88 | 5.08 |
| K | 3.97 | 4.06 | 4.51 | 4.09 | 4.26 | 4.15 | 3.99 | 4.42 | 4.22 | 5.00 | 4.19 | 4.49 | 3.94 | 4.38 | 4.06 | 4.48 | 4.18 | 4.07 | 3.85 | 4.53 | 4.81 |
| P | 4.47 | 4.76 | 4.94 | 4.80 | 5.06 | 4.79 | 4.59 | 4.91 | 4.97 | 5.35 | 4.89 | 4.96 | 4.76 | 5.00 | 4.58 | 4.66 | 4.68 | 4.89 | 4.54 | 5.19 | 5.48 |
| r | 5.11 | 4.97 | 5.15 | 4.77 | 4.86 | 4.77 | 5.21 | 5.24 | 4.78 | 5.61 | 4.96 | 5.09 | 4.88 | 5.23 | 4.72 | 4.96 | 5.26 | 5.08 | 4.81 | 5.48 | 5.18 |
| Table II. Dependencies of the performance of fold recognition on the resolution of the orientational potential; dependencies on polar or Euler angles. | |||||||||
| (a) Dependencies on polar angles | |||||||||
l | ccutoff | l = k = 0, = 0.2, Ocutoff = | |||||||
| 79 monomeric decoy sets | 81 Ig decoy sets | ||||||||
| No. of tops | | | | No. of tops | | | | ||
| 4 | 0.0 | 23 | –2.79 | –2.09 | –1.41 | 29 | –2.66 | –1.88 | –1.45 |
| 0.025 | 22 | –2.77 | –2.02 | –1.41 | 28 | –2.67 | –1.82 | –1.45 | |
| 5 | 0.0 | 31 | –3.35 | –2.57 | –1.84 | 31 | –2.68 | –1.96 | –1.46 |
| 0.025 | 31 | –3.37 | –2.57 | –1.84 | 30 | –2.66 | –1.93 | –1.45 | |
| 6 | 0.0 | 27 | –3.23 | –2.55 | –1.77 | 34 | –2.69 | –2.19 | –1.45 |
| 0.025 | 28 | –3.24 | –2.58 | –1.76 | 34 | –2.68 | –2.16 | –1.44 | |
| 7 | 0.0 | 30 | –3.45 | –2.60 | –1.98 | 45 | –2.93 | –2.52 | –1.57 |
| 0.025 | 31 | –3.46 | –2.60 | –1.98 | 45 | –2.94 | –2.53 | –1.58 | |
| 8 | 0.0 | 28 | –3.37 | –2.59 | –1.91 | 38 | –2.73 | –2.24 | –1.48 |
| 0.025 | 27 | –3.36 | –2.55 | –1.89 | 39 | –2.74 | –2.27 | –1.49 | |
| 9 | 0.0 | 25 | –3.38 | –2.43 | –1.92 | 32 | –2.66 | –2.06 | –1.54 |
| 0.025 | 24 | –3.36 | –2.44 | –1.90 | 33 | –2.68 | –2.08 | –1.56 | |
| 10 | 0.0 | 27 | –3.32 | –2.55 | –1.83 | 37 | –2.55 | –2.13 | –1.52 |
| 0.025 | 26 | –3.31 | –2.49 | –1.82 | 36 | –2.52 | –2.14 | –1.55 | |
| 11 | 0.0 | 28 | –3.44 | –2.67 | –1.94 | 39 | –2.68 | –2.16 | –1.71 |
| 0.025 | 30 | –3.48 | –2.82 | –1.92 | 39 | –2.67 | –2.18 | –1.72 | |
| 12 | 0.0 | 25 | –3.29 | –2.45 | –1.78 | 41 | –2.70 | –2.29 | –1.76 |
| 0.025 | 24 | –3.30 | –2.50 | –1.77 | 40 | –2.70 | –2.29 | –1.77 | |
| 13 | 0.0 | 30 | –3.39 | –2.73 | –1.80 | 39 | –2.80 | –2.19 | –1.83 |
| 0.025 | 29 | –3.38 | –2.73 | –1.80 | 40 | –2.80 | –2.20 | –1.83 | |
| 14 | 0.0 | 31 | –3.42 | –2.89 | –1.84 | 46 | –2.87 | –2.48 | –1.91 |
| 0.025 | 30 | –3.44 | –2.82 | –1.82 | 47 | –2.89 | –2.53 | –1.89 | |
| (b) Dependencies on Euler angles | |||||||||
l![]() k | ccutoff | l = 0, = 0.2, Ocutoff = | |||||||
| 79 monomeric decoy sets | 81 Ig decoy sets | ||||||||
| No. of tops | | | | No. of tops | | | | ||
| 4 | 0.0 | 25 | –3.18 | –2.68 | –1.78 | 33 | –2.63 | –2.26 | –1.31 |
| 0.025 | 25 | –3.14 | –2.71 | –1.75 | 33 | –2.61 | –2.31 | –1.29 | |
| 5 | 0.0 | 25 | –3.26 | –2.79 | –1.77 | 44 | –2.85 | –2.55 | –1.65 |
| 0.025 | 26 | –3.23 | –2.80 | –1.74 | 44 | –2.84 | –2.58 | –1.61 | |
| 6 | 0.0 | 26 | –3.25 | –2.79 | –1.83 | 47 | –3.04 | –2.78 | –1.84 |
| 0.025 | 24 | –3.20 | –2.57 | –1.81 | 45 | –3.00 | –2.79 | –1.77 | |
| 7 | 0.0 | 30 | –3.31 | –2.84 | –1.88 | 52 | –3.03 | –2.94 | –1.82 |
| 0.025 | 28 | –3.24 | –2.70 | –1.83 | 52 | –3.02 | –2.92 | –1.73 | |
| Table III. Dependencies of the performance of fold recognition on the resolution of the orientational potential; interdependencies between polar and Euler angles. | |||||||||
| (a) Dependencies on lmax and cutoff Ocutoff | |||||||||
l | Ocutoff | l = k = l , = 0.2, ccutoff = 0.025 | |||||||
| 79 monomeric decoy sets | 81 Ig decoy sets | ||||||||
| No. of tops | | | | No. of tops | | | | ||
| 4 | 960 | 34 | –3.72 | –3.24 | –2.18 | 47 | –2.97 | –2.81 | –1.59 |
| 1792 | 36 | –3.77 | –3.27 | –2.21 | 47 | –3.01 | –2.79 | –1.67 | |
| 5 | 960 | 36 | –3.82 | –3.38 | –2.27 | 56 | –3.18 | –3.02 | –1.81 |
| 1792 | 38 | –3.87 | –3.22 | –2.33 | 55 | –3.23 | –2.92 | –1.96 | |
| 6 | 960 | 37 | –3.83 | –3.33 | –2.32 | 60 | –3.24 | –3.23 | –1.92 |
| 1792 | 37 | –3.88 | –3.22 | –2.38 | 59 | –3.27 | –3.11 | –2.00 | |
| 2025 | 38 | –3.85 | –3.25 | –2.36 | 56 | –3.21 | –3.05 | –1.99 | |
| 7 | 64 | 27 | –3.53 | –2.95 | –1.93 | 30 | –2.63 | –2.04 | –1.46 |
| 960 | 36 | –3.85 | –3.22 | –2.34 | 57 | –3.22 | –3.11 | –1.93 | |
| 1792 | 38 | –3.91 | –3.31 | –2.42 | 53 | –3.20 | –2.94 | –2.02 | |
| 2025 | 37 | –3.87 | –3.29 | –2.40 | 54 | –3.20 | –3.02 | –2.04 | |
| (b) Dependencies on cutoff ccutoff | |||||||||
l = k = l , = 0.2, Ocutoff = 960 | |||||||||
l | ccutoff | 79 monomeric decoy sets | 81 Ig decoy sets | ||||||
| No. of tops | | | | No. of tops | | | | ||
| 5 | 0.0 | 35 | –3.81 | –3.33 | –2.27 | 55 | –3.17 | –2.96 | –1.83 |
| 0.025 | 36 | –3.82 | –3.38 | –2.27 | 56 | –3.18 | –3.02 | –1.81 | |
| 6 | 0.0 | 34 | –3.80 | –3.24 | –2.32 | 60 | –3.26 | –3.25 | –1.95 |
| 0.025 | 37 | –3.83 | –3.33 | –2.32 | 60 | –3.24 | –3.23 | –1.92 | |
| 7 | 0.0 | 34 | –3.82 | –3.11 | –2.33 | 59 | –3.25 | –3.17 | –1.96 |
| 0.025 | 36 | –3.85 | –3.22 | –2.34 | 57 | –3.22 | –3.11 | –1.93 | |
l | ccutoff | l = k = l , = 0.2, Ocutoff = 1792 | |||||||
| 5 | 0.0 | 38 | –3.88 | –3.30 | –2.34 | 56 | –3.23 | –2.93 | –1.96 |
| 0.025 | 38 | –3.87 | –3.22 | –2.33 | 55 | –3.23 | –2.92 | –1.96 | |
| 6 | 0.0 | 37 | –3.87 | –3.35 | –2.40 | 60 | –3.28 | –3.14 | –2.01 |
| 0.025 | 37 | –3.88 | –3.22 | –2.38 | 59 | –3.27 | –3.11 | –2.00 | |
| 7 | 0.0 | 39 | –3.92 | –3.27 | –2.43 | 55 | –3.20 | –3.05 | –2.05 |
| 0.025 | 38 | –3.91 | –3.31 | –2.42 | 53 | –3.20 | –2.94 | –2.02 | |
(c) Dependencies on a parameter for small sample correction, | |||||||||
l = l = k = 6, ccutoff = 0.025 | |||||||||
| Ocutoff | | 79 monomeric decoy sets | 81 Ig decoy sets | ||||||
| No. of tops | | | | No. of tops | | | | ||
| 960 | 0.1 | 35 | –3.82 | –3.26 | –2.32 | 60 | –3.25 | –3.23 | –1.93 |
| 0.2 | 37 | –3.83 | –3.33 | –2.32 | 60 | –3.24 | –3.23 | –1.92 | |
| 1 | 34 | –3.78 | –3.23 | –2.28 | 58 | –3.22 | –3.19 | –1.89 | |
| 1792 | 0.1 | 36 | –3.86 | –3.15 | –2.39 | 59 | –3.27 | –3.11 | –2.00 |
| 0.2 | 37 | –3.88 | –3.22 | –2.38 | 59 | –3.27 | –3.11 | –2.00 | |
| 1 | 36 | –3.85 | –3.18 | –2.34 | 57 | –3.24 | –3.05 | –1.97 | |
| Table IV. Performance of each potential component in fold recognition. | ||||||||||||||||
| (a) For the 79 monomeric decoy sets | ||||||||||||||||
| Potentialsa | No. of top ranks | Mean | Mean | Mean | Mean | Median | Median | Mean | ||||||||
e | e | eo | er | es | Total No. = 79 | | | | | Ze | Zrmsd | b | ||||
| eo | 37 | –3.88 | –3.22 | –2.38 | –2.49 | –2.09 | –1.65 | 0.33 | ||||||||
| eo | + | er | 35 | –3.79 | –3.08 | –2.32 | –2.33 | –2.01 | –1.49 | 0.33 | ||||||
| eo | + | es | 53 | –4.00 | –3.99 | –2.96 | –3.13 | –3.22 | –2.59 | 0.35 | ||||||
| eo | + | er | + | es | 53 | –3.98 | –3.99 | –2.93 | –3.13 | –3.16 | –2.59 | 0.34 | ||||
ec | 36 | –4.12 | –3.20 | –2.56 | –2.12 | –2.37 | –1.63 | 0.33 | ||||||||
ec | + | er | 41 | –3.90 | –3.12 | –2.23 | –2.03 | –2.04 | –1.74 | 0.32 | ||||||
ec | + | eo | 52 | –4.53 | –4.24 | –3.18 | –3.19 | –2.79 | –2.60 | 0.37 | ||||||
ec | + | eo | + | er | 52 | –4.38 | –4.04 | –2.95 | –3.01 | –2.54 | –2.50 | 0.37 | ||||
ec | + | eo | + | es | 58 | –4.25 | –4.30 | –3.51 | –3.38 | –3.48 | –3.04 | 0.37 | ||||
ec | + | eo | + | er | + | es | 57 | –4.15 | –4.24 | –3.35 | –3.35 | –3.17 | –2.80 | 0.37 | ||
e | + | ec | 36 | –4.05 | –3.29 | –2.68 | –2.32 | –2.61 | –1.86 | 0.32 | ||||||
e | + | ec | + | er | 38 | –4.18 | –3.50 | –2.53 | –2.50 | –2.49 | –2.14 | 0.32 | ||||
e | + | ec | + | eo | 58 | –4.79 | –4.88 | –4.38 | –3.92 | –4.08 | –3.55 | 0.40 | ||||
e | + | ec | + | eo | + | er | 57 | –4.73 | –4.69 | –4.13 | –3.74 | –3.76 | –3.41 | 0.40 | ||
e | + | ec | + | eo | + | es | 61 | –4.63 | –4.63 | –4.45 | –3.68 | –4.11 | –3.41 | 0.39 | ||
e | + | ec | + | eo | + | er | + | es | 59 | –4.49 | –4.49 | –4.21 | –3.56 | –3.86 | –3.10 | 0.39 |
| (b) For the 81 immunogloblin decoy sets | ||||||||||||||||
| Potentialsa | No. of top ranks | Mean | Mean | Mean | Mean | Median | Median | Mean | ||||||||
e | e | eo | er | es | Total No. = 81 | | | | | Ze | Zrmsd | b | ||||
| eo | 59 | –3.27 | –3.11 | –2.00 | –2.74 | –2.03 | –2.55 | 0.38 | ||||||||
| eo | + | er | 62 | –3.35 | –3.23 | –2.15 | –2.85 | –2.27 | –2.61 | 0.36 | ||||||
| eo | + | es | 67 | –3.36 | –3.42 | –3.14 | –3.00 | –3.27 | –2.69 | 0.39 | ||||||
| eo | + | er | + | es | 68 | –3.38 | –3.46 | –3.29 | –3.03 | –3.44 | –2.71 | 0.37 | ||||
ec | 6 | –1.55 | –1.38 | –0.52 | –0.65 | –0.51 | –0.47 | 0.38 | ||||||||
ec | + | er | 36 | –2.78 | –2.29 | –1.02 | –1.70 | –0.95 | –1.15 | 0.29 | ||||||
ec | + | eo | 57 | –3.20 | –3.09 | –1.57 | –2.70 | –1.55 | –2.53 | 0.44 | ||||||
ec | + | eo | + | er | 63 | –3.39 | –3.35 | –1.82 | –2.95 | –1.79 | –2.67 | 0.40 | ||||
ec | + | eo | + | es | 68 | –3.36 | –3.50 | –2.53 | –3.09 | –2.44 | –2.69 | 0.43 | ||||
ec | + | eo | + | er | + | es | 69 | –3.39 | –3.52 | –2.81 | –3.09 | –2.81 | –2.71 | 0.40 | ||
e | + | ec | 0 | –0.40 | –1.33 | 0.54 | –0.46 | 0.44 | –0.49 | 0.35 | ||||||
e | + | ec | + | er | 0 | –0.44 | –1.29 | 0.35 | –0.50 | 0.24 | –0.49 | 0.32 | ||||
e | + | ec | + | eo | 19 | –2.11 | –2.08 | –0.86 | –1.26 | –0.89 | –0.79 | 0.50 | ||||
e | + | ec | + | eo | + | er | 44 | –2.82 | –2.81 | –1.20 | –2.22 | –1.25 | –2.13 | 0.48 | ||
e | + | ec | + | eo | + | es | 55 | –3.00 | –3.10 | –1.83 | –2.63 | –1.94 | –2.53 | 0.49 | ||
e | + | ec | + | eo | + | er | + | es | 61 | –3.24 | –3.31 | –2.25 | –2.82 | –2.34 | –2.61 | 0.46 |
a The orientational energies used above are calculated with l = l = k = 6, Ocutoff = 1792, = 0.2, and ccutoff = 0.025. | ||||||||||||||||
| b R is the correlation coefficient of rank order between the energies and RMSDs of decoys in a decoy set. | ||||||||||||||||
| Table V. The performance of scoring functions for each family of protein decoy sets. | ||||
| Decoy ID range, decoy family potentials | No. of tops /Total No. | Mean | Mean | Mean a |
| 1-7 4state_reduced: seven decoy sets | ||||
(e + ec + eo + es)b | 7/7 | –6.50 | –4.44 | 0.66 |
| Fain et al. (2002)c | 1/7 | –4.45 | –2.3 | 0.52 |
| Toby and Elber (2000)d | 3/6 | –5.42 | –3.14 | |
| Samudrala and Moult (1998)e | 6/7 | –6.06 | –2.67 | 0.67 |
| Onizuka et al. (2002)f | 7/7 | –6.50 | –3.41 | |
| Dominy and Brooks (2002)g | ~7/7 | ~–6.5 | –3.4 | 0.55 |
| 8–11 fisa: four decoy sets | ||||
(e + ec + eo + es)b | 2/4 | –4.04 | –2.55 | 0.26 |
| Toby and Elbner (2000)d | 2/3 | –3.34 | ||
| Onizuka et al. (2002)f | 1/3 | –1.38 | ||
| 12–16 fisa_casp3: five decoy sets | ||||
(e + ec + eo + es)b | 2/5 | –5.38 | –3.61 | 0.16 |
| Toby and Elber (2000)d | 1/3 | –3.94 | ||
| Onizuka et al. (2002)f | 1/3 | –2.01 | ||
| 17–45 hg_structal: 29 decoy sets | ||||
(e + ec + eo + es)b | 22/29 | –2.76 | –2.62 | 0.72 |
| Dominy and Brooks (2002)g | 19/29 | –2.0 | 0.69 | |
| 46–53 lattice_ssfit: eight decoy sets | ||||
(e + ec + eo + es)b | 8/8 | –7.60 | –11.12 | –0.01 |
| Fain et al. (2002)c | 8/8 | –7.60 | –6.84 | |
| Toby and Elber (2000)d | 4/6 | –6.89 | –4.10 | |
| Samudrala and Moult (1998)e | 8/8 | –7.60 | –6.46 | |
| Onizuka et al. (2002)f | 6/6 | –7.60 | –6.22 | |
| 54–63 lmds: ten decoy sets | ||||
(e + ec + eo + es)b | 8/10 | –4.89 | –5.34 | 0.14 |
| Fain et al. (2002)c | 3/9 | –4.55 | –2.83 | |
| Toby and Elber (2000)d | 4/7 | –5.32 | –3.27 | |
| Samudrala and Moult (1998)e | 3/9 | –3.04 | –0.58 | |
| Onizuka et al. (2002)f | 5/7 | –5.00 | –3.67 | |
| 64–73 lmds_v2: ten decoy sets | ||||
(e + ec + eo + es)b | 8/10 | –3.85 | –5.03 | 0.18 |
| Fain et al. (2002)c | 1/2 | –4.81 | –3.15 | |
| Samudrala and Moult (1998)e | 1/2 | –4.47 | –3.05 | |
| 74–79 semfold: six decoy sets | ||||
(e + ec + eo + es)b | 4/6 | –8.13 | –3.86 | 0.08 |
| 1–61 ig_structal: 61 dcoy sets | ||||
| (eo + er + es)b | 49/61 | –3.55 | –2.96 | 0.36 |
| 62–81 ig_structal_hires: 20 decoy sets | ||||
| (eo + er + es)b | 19/20 | –2.86 | –4.31 | 0.43 |
| a R is the correlation coefficient of rank order between the energies and RMSDs of decoys in a decoy set. | ||||
b The present model; the orientational energies were calculated with l = l = k = 6, Ocutoff = 1792, = 0.2, and ccutoff = 0.025. | ||||
| c Reference 25. | ||||
| d Reference 24. | ||||
| e Reference 13; taken from Ref. 25. | ||||
| f Reference 33; the distance-dependent angular potential named 3C326. | ||||
| g Reference 18; generalized Born, Coulomb, nonpolar solvation, and van der Waals energy terms are included. | ||||
bElectronic mail: jernigan@iastate.edu; URL: http://ribosome.bb.iastate.edu/
Up: Issue Table of Contents
Go to: Previous Article | Next Article
Other formats: HTML (smaller files) | PDF (245 kB)