pDOCK: a new technique for rapid and accurate docking of peptide ligands to Major Histocompatibility Complexes
- Proceedings
- Open Access
pDOCK: a new technique for rapid and accurate docking of peptide ligands to Major Histocompatibility Complexes
- Javed Mohammed Khan1 and
- Shoba Ranganathan1, 2Email author
https://doi.org/10.1186/1745-7580-6-S1-S2
© Ranganathan and Khan; licensee BioMed Central Ltd. 2010
- Published: 27 September 2010
Abstract
Background
Identification of antigenic peptide epitopes is an essential prerequisite in T cell-based molecular vaccine design. Computational (sequence-based and structure-based) methods are inexpensive and efficient compared to experimental approaches in screening numerous peptides against their cognate MHC alleles. In structure-based protocols, suited to alleles with limited epitope data, the first step is to identify high-binding peptides using docking techniques, which need improvement in speed and efficiency to be useful in large-scale screening studies. We present pDOCK: a new computational technique for rapid and accurate docking of flexible peptides to MHC receptors and primarily apply it on a non-redundant dataset of 186 pMHC (MHC-I and MHC-II) complexes with X-ray crystal structures.
Results
We have compared our docked structures with experimental crystallographic structures for the immunologically relevant nonameric core of the bound peptide for MHC-I and MHC-II complexes. Primary testing for re-docking of peptides into their respective MHC grooves generated 159 out of 186 peptides with Cα RMSD of less than 1.00 Å, with a mean of 0.56 Å. Amongst the 25 peptides used for single and variant template docking, the Cα RMSD values were below 1.00 Å for 23 peptides. Compared to our earlier docking methodology, pDOCK shows upto 2.5 fold improvement in the accuracy and is ~60% faster. Results of validation against previously published studies represent a seven-fold increase in pDOCK accuracy.
Conclusions
The limitations of our previous methodology have been addressed in the new docking protocol making it a rapid and accurate method to evaluate pMHC binding. pDOCK is a generic method and although benchmarks against experimental structures, it can be applied to alleles with no structural data using sequence information. Our outcomes establish the efficacy of our procedure to predict highly accurate peptide structures permitting conformational sampling of the peptide in MHC binding groove. Our results also support the applicability of pDOCK for in silicoidentification of promiscuous peptide epitopes that are relevant to higher proportions of human population with greater propensity to activate T cells making them key targets for the design of vaccines and immunotherapies.
Keywords
- Major Histocompatibility Complex
- Root Mean Square Deviation
- Major Histocompatibility Complex Allele
- Bind Site Residue
- Flexible Docking
Background
The molecular machinery by which an antigen presenting cell (APC) presents T cell epitopes for recognition by T cell receptors (TR) and subsequent activation of T cells followed by the immune response cascade is fascinating. T cell epitopes are short antigenic peptide sequences (p) that are bound to and presented by the major histocompatibility complexes (MHC) for recognition by the TR [1]. These epitopes are essential subunit peptides that are required in order to stimulate cellular immune responses, especially the adaptive immune responses. Peptide epitopes can be of endogenous (processed within the cell) or exogenous (processed outside the cell) origins, which are presented for surveillance and recognition by the TR in an MHC allele and supertype dependant manner. Broadly classified into two types, MHC class I (MHC-I) complexes bind and present endogenous peptides whereas MHC class II (MHC-II) complexes prefer exogenous peptides. Typically, MHC-I proteins are heterodimers, consisting of a heavy α chain (I-ALPHA) of about 45 kDa, and a light chain, β2-microglobulin (Β2M) of about 12 kDa [2, 3]. The α chain consists of α1 (G-ALPHA1), α2 (G-ALPHA2) and α3 (C-LIKE) domains where G-ALPHA1 and G-ALPHA2 domains form the peptide binding groove or ‘cleft’ [4]. MHC-II proteins are also heterodimeric proteins consisting of an α chain (II-APLHA; 34 kDa) and a β chain (II-BETA; 29 kDa) with very similar overall quaternary structure to that of MHC-I proteins [5, 6, 7, 8, 9, 10]. However, their peptide binding groove is formed by the α1 and β1 domains of the two chains.
Peptides presented by MHC-I are generally between 8-11 amino acids in length. These peptides are ‘chopped’ within the cytosol of the cell by cytosolic proteases and are transported to the MHC binding groove within the endoplasmic reticulum by the transporters associated with antigen processing (TAP) proteins in an ATP dependant manner. Following which, the peptides bind to the MHC to form the peptide-MHC (pMHC) complex which is then transported to the APC cell surface and presented for recognition by the TR of CD8+ cytotoxic T cells (CTLs). Similarly, the peptides presented by MHC-II are usually 12-25 amino acids in length and are endocytosed into the cell by the lysosomes where they bind the MHC-II proteins by displacing the original MHC-II ligand known as the ‘CLIP’ peptide to form the pMHC complex. And again, they are transported to the APC cell surface for recognition by the TR of the CD4+ T helper cells. Identification of true T cell epitopes from the repertoires of immunologically significant antigenic peptide sequences is a vital prerequisite in the process of conventional molecular vaccine design for prevention and treatment of infectious, autoimmune, allergic and graft vs. host diseases. The key step in TR-mediated immune response is thus the binding and presentation of antigenic endogenous or exogenous peptide epitopes, which can be reasonably well predicted using sequence-based methods for alleles with large datasets of known binding peptides, as reviewed earlier [11, 12].
Experimental identification of T cell epitopes is a tedious, time consuming and expensive process owing to the large number and diversity of both MHC alleles and the antigenic peptides. Not to mention, is the extremely low chance of immunogenicity (1 in 2000 peptides) even amongst the peptides that bind strongly to the MHC (50%) [13]. Recently developed computational methods have proven to be vastly time and cost efficient in screening the vast oceans of peptides and MHC repertoires [14]. Current computational methods can be broadly classified into: 1. Sequence-based approaches which use sequence motifs [15], matrix models [16, 17], Artificial Neural Network [18, 19, 20], Hidden Markov Model [21] and Support Vector Machine [22, 23, 24] for large-scale screening of potential T cell epitopes from protein sequence databanks and 2. Structure-based approaches such as protein threading [25, 26], homology modeling [27, 28], rigid docking [29] and flexible docking [2, 3] which utilize three-dimensional data for detailed structural analysis of interactions between the MHC and bound segmental antigenic peptides. The former are more suitable for large-scale screening of potential T cell epitopes, while the latter work better for detailed analysis of short immunogenic regions of antigens [2]. Although sequence-based methods are well established, a major limitation of such techniques is the heavy reliance on the availability of large comprehensive training sets of peptides. Thus, these approaches are not appropriate for accurate prediction of peptides in circumstances where the data available is insufficient. Therefore, the coverage of sequence-based techniques is limited to subsets of binding peptides that belong to the most numerous groups and cannot generate reliable data for peptides that are least represented in the dataset [2], leaving structural immunoinformatics as the only option for such peptides [3, 5, 6, 7].
Antigenic peptides that bind strongly to MHC alleles are known to elicit T cell responses [1, 2, 3, 5, 6, 7, 11]. Hence, their identification is a vital first step in the process of structure-based immune epitope prediction. The usual approach adopted to address this important issue is to utilize a powerful concept, based on the principle of structure-based drug design called “docking”, where peptides are computationally placed in MHC grooves in the best orientation, reflecting steric and electrostatic complimentarity, using structure-based docking techniques. The accuracy with which the peptides are docked is measured in terms of Root Mean Square Deviation (RMSD) values obtained by comparing the docked conformations of the peptides to their original bound conformations in the respective X-ray crystal structures. With the development of new structural modeling and docking techniques and an increase in the number of protein structures deposited in the Protein Data Bank (PDB) [30] and the IMGT/3Dstructure-DB [31, 32], structure-based approaches are being more commonly used to predict potential T cell epitopes [33], often producing modeled structures accurate to within 2.00Å RMSD from the experimental crystal structure, providing a wealth of information for structural analysis and the development of prediction methods.
The development of an accurate protocol for flexible docking has helped us to successfully carry out quantitative predictions for both MHC-I and MHC-II alleles even with limited binding peptide data [3, 5, 6, 7]. Our earlier docking protocol consisted of three steps (extended to four for pMHC-II complexes for incorporating the flanking residues on either side of the nonameric core, which is the 9-mer anchored to the MHC molecule): (1) rigid docking of the peptide nonamer termini into the MHC binding groove; (2) loop closure of central residues by satisfaction of spatial constraints; (3) followed by iterative ab initio refinements of ligand backbone and; (4) extension of flanking peptide residues by satisfaction of spatial constraints [2, 3] (only for MHC-II related peptides). While accurate, this approach has multiple steps, resulting in suboptimal computational speeds. Therefore, the efficiency of this protocol for peptide docking to MHC needs to be improved for large-scale screening of T cell epitopes. A grid-based docking methodology has earlier been reported [34] to be highly accurate in pMHC docking over a limited MHC-I data. Hence, we have developed a grid-based peptide docking method (pDOCK) and have extensively tested it on both MHC-I and MHC-II peptides. The motivation behind the development of a faster and more accurate peptide docking methodology was to eventually improve the qualitative and quantitative efficacy of structure-based T cell epitope prediction.
The first experiment that we conducted was to ensure that an extended peptide bound to its cognate MHC receptor preferentially selecting the same nonameric core peptides as in the crystal structure and then to evaluate the accuracy of the docked peptide. Hence, we performed re-docking of 186 peptides back to their cognate MHC receptors to check for conformational accuracy of the predicted binding registers and their Cα RMSD against their respective crystal structures. We have then benchmarked pDOCK with our earlier docking protocol [2, 3] for a dataset of 50 selected (35 MHC-I and 15 MHC-II) pMHC complexes to verify the speed and accuracy of pDOCK against our earlier method. This was followed by validation and accuracy checks for pDOCK against available flexible peptide docking results obtained from the literature for a dataset of 15 peptides.
In the process of selecting immunogenic peptides for vaccine design, the two main aspects are to determine: (1) multiple peptides that bind to the same allele or MHC molecule and; (2) promiscuous or same peptides that bind multiple alleles. Therefore, as a secondary experiment, we have pursued to test the efficacy and robustness of our docking protocol in modeling the bound conformations of novel peptides to specific MHC alleles by carrying out docking of multiple peptides to a single MHC template structure (same MHC allele), suitable for immune epitope prediction from an antigenic protein, using a moving window of 9-mers along the entire sequence [3, 5, 6, 7]. Our third experiment was to dock a single peptide from particular PDB structures onto multiple MHC templates (multiple alleles) from other crystal structures, suitable for determining promiscuous peptides capable of binding to a set of related alleles and therefore, important for vaccine design.
The Cα RMSD values have been calculated only for the nonameric core of the peptide (for both MHC-I and MHC-II related peptides) which is a contiguous immunogenic segment that forms the “binding register” within the MHC peptide binding cleft, as reported earlier by our group [3]. For the peptides with nine and less number of amino acid residues the entire peptide was used for Cα RMSD calculation. pDOCK accurately detected all 186 binding registers, i.e., the nonameric cores of the peptides are identical to their respective crystal structures. pDOCK generated 85.5% of all the peptides with Cα RMSD of less than 1.00 Å compared to their respective X-ray crystal structures. Our benchmarking results imply up to 2.5 fold improvement in the accuracy of the new peptide docking methodology. The validation results represent a sevenfold improvement in the accuracy of our technique compared to that of the existing methodologies in flexible docking and modeling of peptides into MHC grooves. Amongst the 21 peptides docked in the second experiment, the Cα RMSD values for docked peptides compared to their respective crystal structures were below 1.00 Å for 20 peptides (details in Results and discussion section). The third experiment accounted for all 4 peptides docked with less than 1.00 Å Cα RMSD compared to the same peptides from the corresponding template crystal structures (details in Results and discussion section). Overall, pDOCK is up to 60% faster than our earlier protocol and hence provides a rapid and accurate docking method to evaluate pMHC binding for large scale immune-epitope prediction.
Results and discussion
The fact that our earlier method was comparatively slower and that it involved rigid-docking of the peptide termini, acted as the platform for us to ‘revisit’ our pMHC docking methodology. Based on these requirements, we have developed a single step pMHC docking protocol (details in Methods section) as shown in Figure 1, which allows flexibility over the entire length of the peptide antigen and can be used as a generic method to obtain the conformations of bound peptide ligands to MHC binding grooves of both class I and class II MHC proteins. A systematic evaluation of pDOCK is performed as three separate tests: (1) exhaustive re-docking of all non-redundant peptides to their respective MHC grooves as a test case, benchmarking and validation; we then address two very significant practical problems faced by immunologists during the process of allele-specific peptide vaccine design: (2) the docking of multiple peptides that bind to same MHC allele, for immunogenic epitope scanning of antigenic sequences and; (3) docking of promiscuous peptides or same peptides binding to multiple MHC alleles for vaccine design, based on groups of disease-implicated alleles. A correctly docked structure is defined as the peptide with at most 2.50 Å Cα RMSD from the respective experimental X-ray crystal structure [2]. pDOCK has also been benchmarked against our previous docking protocol and validated on published peptide modeling and docking results from the literature. Bordner and Abagyan [34] suggested that while grid-based docking could be applied for pMHC-II, it was a more difficult problem. pDOCK has been successfully applied for MHC-II peptide docking as well with excellent results.
Experiment 1
Re-docking bound peptides to their cognate MHC grooves
Although water molecules and other common biological ions such as phosphate and chloride may mediate pMHC interactions in some cases, they were omitted from our experiments because the significance and contributions of these molecules towards pMHC binding vary immensely between different peptides and specific alleles over a large dataset like the one used in this study (186 complexes). Our previous protocol achieves a Cα RMSD of 1.53 Å for the bound structure of the peptide from pMHC complex 1jf1, due to the presence of a water molecule positioned around the peptide residues 5 to 7 in the crystal structure leading to erroneous prediction of the loop formed, which resulted in incorrect positioning of interacting residues [2]. However, pDOCK successfully overcomes this restriction to accurately predict the least energy bound conformation of this peptide with a Cα RMSD value of 0.30 Å. The enhancement in accuracy of docking is a direct consequence of the improved sampling of available conformational space in pDOCK. This preliminary experiment is a critical first step as it establishes the validity of our approach and helps us test the ability of our technique to accurately dock cognate peptides into their respective MHC receptors, using the proposed single-step docking procedure.
Benchmarking with our previous methodology
Benchmarking pDOCK with our earlier methodology.
S. No. |
Allele |
PDB |
Peptide Length |
Peptide Sequence |
Cα RMSD (Å) |
|
---|---|---|---|---|---|---|
Previous method |
pDOCK |
|||||
MHC-I |
||||||
1 |
HLA-A*1101 |
1qvo |
10 |
QVPLRPMTYK |
0.53 |
0.24 |
2 |
HLA-A*0201 |
1qr1 |
9 |
IISAVVGIL |
0.46 |
0.29 |
3 |
HLA-A*0201 |
1akj |
9 |
ILKEPVHGV |
0.87 |
0.39 |
4 |
HLA-A*0201 |
1i1y |
9 |
YLKEPVHGV |
0.70 |
0.66 |
5 |
HLA-A*0201 |
1i7r |
9 |
FAPGFFPYL |
0.59 |
0.47 |
6 |
HLA-A*0201 |
1i7u |
9 |
ALWGFVPVL |
0.32 |
0.29 |
7 |
HLA-A*0201 |
1oga |
9 |
GILGFVFTL |
0.32 |
0.16 |
8 |
HLA-A*0201 |
1qsf |
9 |
LLFGYPVAV |
0.54 |
0.34 |
9 |
HLA-A*0201 |
1lp9 |
9 |
ALWGFFPVL |
0.58 |
0.26 |
10 |
HLA-A*0201 |
1s9y |
9 |
SLLMWITQS |
1.09 |
0.39 |
11 |
HLA-A*0201 |
1hhh |
10 |
FLPSDFFPSV |
1.10 |
0.49 |
12 |
HLA-A*0201 |
1jf1 |
10 |
ELAGIGILTV |
1.53 |
0.30 |
13 |
HLA-B*0801 |
1agc |
8 |
GGKKKYQL |
0.28 |
0.23 |
14 |
HLA-B*0801 |
1mi5 |
9 |
FLRGRAYGL |
0.42 |
0.37 |
15 |
HLA-B*2705 |
1ogt |
9 |
RRKWRRWHL |
0.51 |
0.18 |
16 |
HLA-B*2705 |
2a83 |
9 |
RRRWHRWRL |
0.55 |
0.18 |
17 |
HLA-B*3501 |
2cik |
9 |
KPIVVLHGY |
0.74 |
0.26 |
18 |
HLA-B*3508 |
3bwa |
8 |
FPTKDVAL |
0.56 |
0.26 |
19 |
HLA-B*5101 |
1e27 |
9 |
LPPVVAKEI |
1.27 |
0.18 |
20 |
HLA-B*5301 |
1a1m |
9 |
TPYDINQML |
0.59 |
0.28 |
21 |
HLA-Cw*0401 |
1im9 |
9 |
QYDDAVYKL |
0.49 |
0.34 |
22 |
HLA-G*0101 |
2dyp |
9 |
RIIPRHLQL |
0.43 |
0.16 |
23 |
H2-Db |
1fg2 |
9 |
KAVYNFATC |
0.25 |
0.19 |
24 |
H2-Db |
3buy |
9 |
LSLRNPILV |
0.63 |
0.23 |
25 |
H2-Db |
1yn7 |
10 |
SSLENFAAYV |
0.62 |
0.14 |
26 |
H2-Db |
1jpf |
11 |
SGVENPGGYCL |
1.14 |
0.36 |
27 |
H2-Dd |
1qo3 |
10 |
RGPGRAFVTI |
1.49 |
0.17 |
28 |
H2-Kb |
1t0m |
8 |
SSIEFARL |
0.66 |
0.21 |
29 |
H2-Kb |
1vac |
8 |
SIINFEKL |
0.32 |
0.22 |
30 |
H2-Kb |
1wbz |
9 |
SSYRRPVGI |
0.89 |
0.19 |
31 |
H2-Kb |
1s7q |
9 |
KAVYNFATM |
0.20 |
0.09 |
32 |
H2-Kb |
1g7p |
9 |
SRDHSRTPM |
0.97 |
0.17 |
33 |
H2-Kd |
1vgk |
9 |
SYVNTNMGL |
0.86 |
0.25 |
34 |
H2-Kk |
1zt1 |
8 |
FEANGNLI |
0.57 |
0.45 |
35 |
H2-Ld |
2e7l |
9 |
QLSPFPFDL |
0.37 |
0.35 |
MHC-II |
||||||
36 |
HLA-DQB1*0602 |
1uvq |
20 |
MNLPSTKVSWAAVGGGGSLV |
1.09 |
0.23 |
37 |
HLA-DRB1*0301 |
1a6a |
15 |
PVSKMRMATPLLMQA |
0.38 |
0.30 |
38 |
HLA-DRB1*0101 |
1aqd |
14 |
GSDWRFLRGYHQYA |
1.08 |
0.28 |
39 |
HLA-DRB1*0101 |
1fyt |
13 |
PKYVKQNTLKLAT |
0.68 |
0.23 |
40 |
HLA-DRB1*0101 |
2iam |
15 |
GELIGILNAAKVPAD |
0.56 |
0.24 |
41 |
HLA-DRB1*0401 |
1d5x |
6 |
XXRXXX |
0.23 |
0.11 |
42 |
HLA-DRB1*0401 |
1d5z |
7 |
XXRAXSX |
0.33 |
0.22 |
43 |
HLA-DRB1*0401 |
1d6e |
8 |
XXRXMASX |
0.32 |
0.14 |
44 |
HLA-DRB1*0401 |
1j8h |
13 |
PKYVKQNTLKLAT |
0.59 |
0.20 |
45 |
HLA-DRB3*0101 |
2q6w |
11 |
AWRSDEALPLG |
0.54 |
0.30 |
46 |
HLA-DRB5*0101 |
1fv1 |
20 |
NPVVHFFKNIVTPRTPPPSQ |
0.88 |
0.59 |
47 |
I-Ad |
1iao |
14 |
RGISQAVHAAHAEI |
0.81 |
0.27 |
48 |
I-Ak |
1iak |
13 |
STDYGILQINSRW |
0.42 |
0.23 |
49 |
I-Au |
2pxy |
11 |
RGGASQYRPSQ |
0.78 |
0.28 |
50 |
I-Ek |
1r5v |
13 |
ADLIAYPKAATKF |
0.82 |
0.28 |
These results suggest that some of the conformational limitations of our previous methodology, such as the presence of water molecules in and around the peptide and within the peptide binding groove in the original PDB structure, have been addressed in our new docking protocol making it highly accurate. Besides an improvement in the accuracy, pDOCK is also able to accurately model docked conformations for some peptides especially for MHC-II related peptides with more than 9 amino acid residues, thereby improving the coverage over the entire length of the peptides. Peptides from the pMHC complexes 1uvq and 2iam were among the highest coverage (20 and 15 residues respectively) obtained in this experiment with Cα RMSD values 0.42 Å and 0.46 Å respectively over the length of the entire peptide (results not shown). The reliability for the accurate prediction of flanking residues (especially for MHC-II peptides) depends upon their interactions with the MHC residues outside the peptide binding groove and therefore, have not been included in the calculation of Cα RMSD values reported.
Comparison of computational time of pDOCK with our earlier docking method
Previous method |
pDOCK |
---|---|
Step 1: ~ 5 min |
|
Step 2: ~ 30 s |
Preparatory Step 1: ~ 3*s |
Step 3: ~ 18 min |
Preparatory Step 2: ~ 42.6 s |
Step 4#: ~ 1 min |
Single docking and refinement step: ~ 9.24 min |
Total: ~ 23.50 – 24.50 min |
Total: ~ 10 min |
Validation against previously published studies
Comparison of pDOCK with published MHC–peptide modeling and flexible docking methods.
S.No |
Technique |
Peptide Sequence |
MHC class |
PDB |
RMSD (Å) |
||
---|---|---|---|---|---|---|---|
Published |
Previousmethod |
pDOCK |
|||||
1 |
Grid-based Flexible docking [34] |
RGYVYQGL |
I |
1kpu# |
0.76 |
0.59 |
0.31 |
2 |
Grid-based Flexible docking [34] |
ALWGFVPVL |
I |
1i7u |
0.23 |
0.32 |
0.29 |
3 |
Grid-based Flexible docking [34] |
ELAGIGILTV |
I |
1jf1 |
0.22 |
1.53 |
0.30 |
4 |
Monte Carlo annealing [37] |
LLFGYPVYV |
I |
1duz# |
3.01 |
0.33 |
0.33 |
5 |
Simulated annealing [38] |
FLPSDFFPSV |
I |
1hhh |
1.59 |
1.10 |
0.48 |
6 |
Simulated annealing [38] |
GILGFVFTL |
I |
1hhi# |
0.46 |
0.32 |
0.16 |
7 |
Simulated annealing [38] |
ILKEPVHGV |
I |
1hhj# |
0.87 |
0.87 |
0.55 |
8 |
Simulated annealing [38] |
LLFGYPVYV |
I |
1hhk# |
0.78 |
0.33 |
0.33 |
9 |
Combinatorial buildup algorithm [39] |
RGYVYQGL |
I |
2vaa# |
0.56 |
0.32 |
0.22 |
10 |
Combinatorial buildup algorithm [40] |
LLFGYPVYV |
I |
1hhk# |
1.40 |
0.33 |
0.33 |
11 |
Combinatorial buildup algorithm [40] |
ILKEPVHGV |
I |
1hhj# |
1.30 |
0.87 |
0.55 |
12 |
Combinatorial buildup algorithm [40] |
GILGFVFTL |
I |
1hhi# |
1.60 |
0.32 |
0.16 |
13 |
Multiple copy algorithm [41] |
FAPGNYPAL |
I |
2vab# |
2.70 |
0.40 |
0.25 |
14 |
Multiple copy algorithm [42] |
GILGFVFTL |
I |
1hhi# |
1.40 |
0.32 |
0.16 |
15 |
GOLD/GLIDE Flexible docking [36] |
XXRXMASX |
II |
1d6e |
1.24/3.06 |
0.32 |
0.14 |
Experiment 2
Docking of multiple peptides onto a single template
Docking novel peptides onto a single template: pDOCK compared to our previous method.
MHC class |
Peptide PDB Allele |
Peptide PDB |
MHC Template Structure |
Template Allele |
Peptide Length |
Peptide Sequence |
Cα RMSD (Å) |
|
---|---|---|---|---|---|---|---|---|
Previous method |
pDOCK |
|||||||
I |
HLA-A*0201 |
2v2w |
1qrn |
HLA-A*0201 |
9 |
SLYNTVATL |
0.63 |
0.38 |
I |
HLA-A*0201 |
1hhh |
1qrn |
HLA-A*0201 |
10 |
FLPSDFFPSV |
0.58 |
0.25 |
I |
HLA-A*0201 |
1qse |
1qrn |
HLA-A*0201 |
9 |
LLFGYPRYV |
0.62 |
0.30 |
I |
HLA-A*0201 |
2bnq |
1qrn |
HLA-A*0201 |
9 |
SLLMWITQV |
0.97 |
0.77 |
I |
HLA-A*0201 |
2gj6 |
1qrn |
HLA-A*0201 |
9 |
LLFGKPVYV |
0.56 |
0.24 |
I |
HLA-A*0201 |
1qr1 |
1qrn |
HLA-A*0201 |
9 |
IISAVVGIL |
0.87 |
0.36 |
I |
HLA-A*0201 |
1qsf |
1qrn |
HLA-A*0201 |
9 |
LLFGYPVAV |
0.94 |
0.41 |
I |
HLA-A*0201 |
1bd2 |
1qrn |
HLA-A*0201 |
9 |
LLFGYPVYV |
0.68 |
0.46 |
I |
HLA-A*0201 |
1hhg |
1i4f |
HLA-A*0201 |
9 |
TLTSCNTSV |
0.58 |
0.56 |
I |
HLA-A*0201 |
1hhh |
1i4f |
HLA-A*0201 |
10 |
FLPSDFFPSV |
1.48 |
0.57 |
I |
H2-Kb |
1osz |
1nam |
H2-Kb |
8 |
RGYLYQGL |
0.85 |
0.47 |
I |
H2-Kb |
1fo0 |
1nam |
H2-Kb |
8 |
INFDFNTI |
0.62 |
0.35 |
I |
H2-Kb |
1g6r |
1nam |
H2-Kb |
8 |
SIYRYYGL |
0.66 |
0.11 |
I |
H2-Kb |
1kbg |
1nam |
H2-Kb |
8 |
RGYVYXGL |
0.40 |
0.06 |
I |
H2-Kb |
1g7p |
1nam |
H2-Kb |
9 |
SRDHSRTPM |
1.41 |
0.82 |
II |
HLA-DRB1*0101 |
1fyt |
2iam |
HLA-DRB1*0101 |
13 |
PKYVKQNTLKLAT |
0.69 |
0.35 |
II |
HLA-DRB1*0101 |
1klu |
2iam |
HLA-DRB1*0101 |
15 |
GELIGTLNAAKVPAD |
0.85 |
0.59 |
II |
HLA-DRB1*0101 |
1t5w |
2iam |
HLA-DRB1*0101 |
13 |
AAYSDQATPLLLS |
0.99 |
0.65 |
II |
HLA-DRB1*0101 |
1pyw |
2iam |
HLA-DRB1*0101 |
9 |
FVKQNAXAL |
0.40 |
0.32 |
II |
HLA-DRB1*0101 |
1sje |
2iam |
HLA-DRB1*0101 |
15 |
PEVIPMFSALSEGAT |
0.70 |
0.37 |
II |
HLA-DRB1*0101 |
1aqd |
2iam |
HLA-DRB1*0101 |
14 |
GSDWRFLRGYHQYA |
1.68 |
1.01 |
Experiment 3
Docking of same peptides onto variant templates
Docking promiscuous peptides onto variant templates: comparison of pDOCK with our previous method.
MHC class |
Peptide PDB Allele |
Peptide PDB |
MHC Template Structure |
Template Allele |
Peptide Length |
Peptide Sequence |
Cα RMSD compared to template peptides (Å) |
|
---|---|---|---|---|---|---|---|---|
Previous method |
pDOCK |
|||||||
I |
HLA-B*3501 |
1zhk# |
1zhl# |
HLA-B*3508 |
13 |
LPEPLPQGQLTAY |
0.62 |
0.44 |
I |
HLA-B*3501 |
1zsd# |
2ak4 |
HLA-B*3508 |
11 |
EPLPQGQLTAY |
1.15 |
0.79 |
I |
H2-Kb |
2vaa# |
1ce6 |
H2-Db |
9 |
FAPGNYPAL |
0.73 |
0.21 |
II |
HLA-DRB1*1501 |
1bx2# |
1fv1 |
HLA-DRB5*0101 |
14 |
ENPVVHFFKNIVTP |
1.01 |
0.22 |
In all, only one peptide generated using pDOCK from the single template docking experiments has the Cα RMSD value above 1.00 Å (Table 4) compared to 5 peptides (three from single template docking and two from variant template docking) with Cα RMSD values above 1.00 Å using our previous methodology (Table 4 and Table 5). It is thus clear that pDOCK accurately predicts the structure of cognate peptides in both single and variant template docking cases. These evaluation steps are also vital to establish the efficiency with which our new method can dock and subsequently predict novel peptides onto given MHC proteins.
Conclusions
We have developed pDOCK as a fast, accurate and robust method for flexible docking of peptides to MHC-I and MHC-II proteins. Our results provide evidence of overcoming limitations pertaining to the application of our previous methodology, such as the presence of water molecules in and around the peptide and within the peptide binding groove in the template and relatively longer computational time required. Benchmarking with our previous method for a dataset of 50 non-redundant pMHC complexes consistently produced least energy docked conformations of peptides below 1.00 Å Cα RMSD from their respective native orientations for all 50 peptides. The Cα RMSD range for the same dataset was 0.09 Å (1s7q) to 0.66 Å (1i1y) using pDOCK compared to a Cα RMSD range from 0.20 Å (1s7q) to 1.53 Å (1jf1) applying our previous protocol. These observations imply an improvement in the accuracy by upto two and a half folds compared to our previous protocol. The outcomes of our validation experiments suggest a seven-fold improvement in the accuracy of the pDOCK docking protocol. pDOCK can therefore be successfully applied as a generalized, efficient protocol for docking of peptides to MHC-I and MHC-II receptors with improved accuracy, greater coverage of peptide residues and vastly reduced computational time (up to 60% compared to our earlier method).
The average time taken to perform each step using pDOCK has also improved drastically compared to our old technique on a 2 CPU 3.20 GHz 3 GB RAM workstation. This is mainly due to the consolidation of the docking and refinement protocols into a single step docking and refinement procedure. Our results establish the efficacy of pDOCK to model highly accurate pMHC complex structures permitting conformational sampling of the peptide in MHC binding groove. The current study thus presents one of the most accurate pMHC docking protocols developed to date. pDOCK targets a more generic approach to generation of docked conformations of peptides using a single template for each allele. For some pMHC complexes however, appropriate addition of mediating molecules or considerations of solvent effects may lead to a possible improvement in docking accuracy. Rapid and large scale docking and scanning for identification of potential candidates for immunogenicity from repertoires of immunologically significant antigenic peptide sequences is possible by automating all the steps. No requirement for experimental data to be trained and the need of only a suitable template for a particular allele give pDOCK a prominent edge over other sequence-based techniques such as Artificial Neural Networks, Support Vector Machines, and Hidden Markov Models.
pDOCK is also highly efficient in accurately predicting the docked conformations of amino acid analogues or chemical components within the peptide ligand suggesting its possible use as a docking and evaluation tool in structure-based drug design protocols and chemoinformatics. The single and variant template docking experiments along with the validation experiments also serve as strong benchmarks for pDOCK against our old method. pDOCK can correctly predict the conformation of residues that extend into the MHC binding cleft and therefore could help identify essential contacts with the MHC receptor, responsible for reducing the half life of the pMHC complex such that the peptide is held long enough within the MHC groove for presentation at the APC cell surface leading to surveillance and recognition by the TR molecules which in turn results in the activation of T cells and triggers the adaptive immune response cascade. Another significant improvement in this study is that the peptide ligand is allowed full flexibility within the peptide biding groove of the MHC proteins, unlike our previous method where the peptide termini were docked rigidly to the MHC groove. This aspect of pDOCK has helped us carry out fully flexible peptide docking to the MHC proteins. Our results also indicate the successful application of this protocol for easy in silico identification of promiscuous peptide epitopes that are applicable to higher proportions of human population with greater propensity to bind to MHC proteins and consequently activate T cells making them key targets for the design of vaccines and immunotherapies.
Methods
Data
pDOCK was tested on a non-redundant dataset of 186 (149 MHC-I and 37 MHC-II) pMHC complexes from the MPID-T2 (http://biolinfo.org/mpid-t2) database for which X-ray crystal structures are available in the PDB and the IMGT/3Dstructure-DB. When there is more than one complex with the same bound peptide and the same allele, the structure with the highest resolution is selected to avoid redundancy. When more than one bound peptide is available in the selected crystal structure, all bound peptides in that crystal structure are analyzed. TR/pMHC structures in MPID-T2 database are treated as non-redundant entries unless they have the same peptide, allele and TR type. In which case, the structure with the best resolution is considered non-redundant. Similarly, a dataset of 25 (18 MHC-I and 7 MHC-II) pMHC complexes was selected from the pDOCK test set for single and variant template docking. When more than one allele is available as template for docking of peptides into a single or variant template, the allele with the highest resolution was selected. Redundancy in MPID-T2 data is primarily decided from the similarities in peptides, MHC alleles and TR types (in case of TR/pMHC structures). Since one publication can refer to crystal structures of many complexes, redundancy in the literature is not considered as a criterion for redundancy. Some redundant structures were used for variant template docking (Table 5) due to limited number of crystal structures with promiscuous peptides bound to different alleles in the PDB. Although the MPID-T2 database contains 294 pMHC complexes (273 classical and 21 non classical), the 21 non-classical and 87 redundant structures were discarded from this study in order to avoid any biasness in our results.
pMHC complexes for benchmarking and validation
A non-redundant dataset of 50 high quality (35 MHC-I and 15 MHC-II) pMHC complexes, with maximum 3.00 Å resolutions, was selected from the 186 pMHC complexes in the pDOCK test set for benchmarking with the previous methodology. 15 pMHC complexes were chosen for validation experiments depending on the ones used in the corresponding reference studies [34, 36, 37, 38, 39, 40, 41, 42].
The pDOCK protocol
Unlike our earlier method [2, 3], the new technique incorporates flexibility into the entire length of the peptide ligand. We have now incorporated a receptor modeling sub-step at the beginning of our novel schema (Figure 1), which involves rigorous homology modeling of MHC proteins from available MHC sequences by satisfaction of spatial restraints using MODELLER [43] followed by structure optimization and stringent structural quality assessment protocols to affirm the generation of high quality homology models of MHC proteins to be subsequently used in the pMHC docking strategy. Thereby, accounting for the validity of our methodology even in the absence of experimental structures for the MHC proteins and when only MHC sequences are available. However, this sub-step was not used in the current study as testing, benchmarking, validation, single template and variant template docking experiments are performed only on X-ray crystal structures of pMHC complexes.
The current pMHC docking technique is applied on MHC-I and MHC-II related peptides in two preparatory steps and a single consolidated docking and receptor step as follows: Preparatory step 1: receptor positioning using the Internal Coordinate Mechanics (ICM) global optimization algorithm [35]; Preparatory step 2: determining the docking grid using standardized values for MHC supertypes (MHC-I and MHC-II) from our preliminary studies and; A single docking and refinement step involving: ligand positioning, grid ligand docking followed by iterative ab initio refinements of backbone and ligand interacting side-chain dihedral angles of the MHC binding site residues to eliminate or minimize atomic clash regions at the pMHC interface using a Biased Monte Carlo procedure. The preparatory steps were together used to generate the receptor maps and the final single docking and refinement step was used to carry out ligand docking, generate the final least energy conformation and further refine the product.
Preparatory steps
Receptor modeling and positioning
Positioning of the MHC receptor is a major requirement in the pMHC docking simulation to ensure a best fit of the flexible peptide in the MHC groove. This first preparatory step (receptor modeling and positioning) is the least time consuming (only applicable to sub-step ‘b’) in the pDOCK docking protocol and involves two vital sub-steps: (a) homology model building by satisfaction of spatial restraints for MHC sequences where no structural data is available or inserting the MHC crystal structure into the docking simulation and; (b) positioning of the receptor within the docking simulation. Although not used in this study, high quality homology models can be generated, using our previously described three-step homology modeling procedure [44], for alleles with no structural data. Receptor positioning using the ICM global optimization algorithm assures the addition of any important missing residues in the template besides optimizing the zero occupancy side chains and any polar hydrogen atoms.
Determining the docking grid
Single step docking and refinement
Ligand positioning, grid ligand docking and refinement
As with receptor positioning, ligand positioning is also equally important in achieving the best docked conformations, with the lowest energy values for flexible peptides using pDOCK. The final, most exhaustive (in terms of computational time required compared to the other two steps) single step docking and refinement part deals with ligand positioning, grid ligand docking and refinement, comprising three very important sub-steps: (a) positioning of the peptide ligand either by using the original crystal structure or by inserting a peptide model into the docking simulation using the peptide sequence; (b) placing and positioning of the probe into the peptide binding groove using the Internal Coordinate Mechanics global optimization algorithm and; (c) flexible docking of the peptide into the MHC groove and refinement of all ligand and binding site residues using a Biased Monte Carlo procedure. Ligand positioning was carried out either by using ICM algorithm for existing peptides within the X-ray crystal structures of pMHC complexes or by manually inserting a peptide model into the docking simulation for each of the available peptide sequences (docking of novel peptides to a single template and docking of promiscuous peptides to variant templates). This was followed by placing a probe (red in Figure 6) in the MHC groove which provides an initial position for conformational sampling and docking simulations using the ICM algorithm.
ICM docking algorithm [35] runs flexible docking of peptide ligands to MHC peptide binding clefts. During the docking simulation, the ligand side-chain torsions that have been previously stored within the grid receptor maps (preparatory step 2) are changed in each random step using a Biased Monte Carlo procedure, which begins by pseudo-randomly selecting a set of torsion angles in the ligand and consequently finding the local energy minima about those angles. Upon satisfaction of the Metropolis criteria, novel conformations are adopted with a probability min (1, exp[−ΔG/RT]), where R is the universal gas constant and T is the absolute temperature of the simulation. The temperature was set to 300 K for the current study. To keep the ligand molecule close to the starting conformation, loose restraints are imposed on its positional variables. The internal energy function adopted for our simulations integrates internal van der Waals interactions energy (calculated using an extension of ECEPP/3 with force field parameters) [45], hydrogen bonding energy, torsion energy, electrostatic energy with a distance-dependent dielectric constant (ε = 4r; where ε is the distance-dependent dielectric constant and r is the distance) [46] and hydrophobic potential between the atoms of peptide residues and atoms of the binding site residues. The final energy incorporates configurational entropy of side chains and the surface-based solvation energy to select the best-iterated orientations. In brief, the complete optimal energy function, E,is made up of the internal energy of the ligand and the intermolecular energy of the optimized receptor potential maps and can be summarized as:
E = Evw + Een + 2.16 + 2.53 Ehb + 4.35 Ehp + 0.20 Esolv
where Evw is the internal van der Waals interaction energy, Een is the configurational/conformational entropy, is the electrostatic energy of solvation, Ehb is the hydrogen bonding energy, Ehp is the hydrophobic potential and Esolv is the surface-based solvation energy.
Finally, to improve the accuracy of the initial predicted conformation, refinement of the ligand as well as binding site residues backbone and side chains was performed as described in our previous methodology [2, 3] to overcome any atomic clashes detected at the pMHC binding interface, using ICM Biased Monte Carlo procedure. Again, restraints are imposed upon the positional variables of the Cα atoms of the peptide residues. The early stages of the refinement efforts try to trounce the consequences of docking fully flexible ligands to rigid receptors by introducing partial flexibility to the backbone of MHC peptide binding residues. Refinements of ligand and receptor side-chain torsions in the vicinity of 4.00 Å from the receptor were executed upon the final backbone structure of the peptides to keep the docked peptides closest to their starting conformations. The energy function, E, utilized for this refinement sub-step, is the sum of energy terms arising from the optimal energy function described above:
E = Evw + Ehbonds + Etors + Eelec + Esolv + Een
where Etors is the torsion energy, Eelec is the electrostatic energy and Een is the entropic term.
Declarations
Acknowledgements
JMK gratefully acknowledges the award of a Macquarie University Research Excellence Scholarship and a Macquarie University Postgraduate Research Fund. We also thank Dr. J.C. Tong, Institute for Infocomm Research, Singapore for useful discussions on grid docking. Open access publication charges were borne by Macquarie University.
This article has been published as part of Immunome Research Volume 6 Supplement 1, 2010: Ninth International Conference on Bioinformatics (InCoB2010): Immunome Research. The full contents of the supplement are available online at http://www.immunome-research.com/supplements/6/S1.
Authors’ Affiliations
References
- Sidney J, Assarsson E, Moore C, Ngo S, Pinilla C, Sette A, Peters B: Quantitative peptide binding motifs for 19 human and mouse MHC class I molecules derived using positional scanning combinatorial peptide libraries. Immunome Res. 2008, 4: 2-10.1186/1745-7580-4-2.PubMed CentralView ArticlePubMedGoogle Scholar
- Tong JC, Tan TW, Ranganathan S: Modeling the structure of bound peptide ligands to major histocompatibility complex. Protein Sci. 2004, 13 (9): 2523-2532. 10.1110/ps.04631204.PubMed CentralView ArticlePubMedGoogle Scholar
- Tong JC, Zhang GL, Tan TW, August JT, Brusic V, Ranganathan S: Prediction of HLA-DQ3.2beta ligands: evidence of multiple registers in class II binding peptides. Bioinformatics. 2006, 22 (10): 1232-1238. 10.1093/bioinformatics/btl071.View ArticlePubMedGoogle Scholar
- Lefranc MP, Duprat E, Kaas Q, Tranne M, Thiriot A, Lefranc G: IMGT unique numbering for MHC groove G-DOMAIN and MHC superfamily (MhcSF) G-LIKE-DOMAIN. Dev Comp Immunol. 2005, 29 (11): 917-938. 10.1016/j.dci.2005.03.003.View ArticlePubMedGoogle Scholar
- Tong JC, Bramson J, Kanduc D, Chow S, Sinha AA, Ranganathan S: Modeling the bound conformation of Pemphigus vulgaris-associated peptides to MHC class II DR and DQ alleles. Immunome Res. 2006, 2: 1-10.1186/1745-7580-2-1.PubMed CentralView ArticlePubMedGoogle Scholar
- Tong JC, Tan TW, Sinha AA, Ranganathan S: Prediction of desmoglein-3 peptides reveals multiple shared T-cell epitopes in HLA DR4- and DR6-associated Pemphigus vulgaris. BMC Bioinformatics. 2006, 7 (Suppl 5): S7-10.1186/1471-2105-7-S5-S7.PubMed CentralView ArticlePubMedGoogle Scholar
- Tong JC, Zhang ZH, August JT, Brusic V, Tan TW, Ranganathan S: In silico characterization of immunogenic epitopes presented by HLA-Cw*0401. Immunome Res. 2007, 3: 7-10.1186/1745-7580-3-7.PubMed CentralView ArticlePubMedGoogle Scholar
- Brown JH, Jardetzky TS, Gorga JC, Stern LJ, Urban RG, Strominger JL, Wiley DC: Three-dimensional structure of the human class II histocompatibility antigen HLA-DR1. Nature. 1993, 364 (6432): 33-39. 10.1038/364033a0.View ArticlePubMedGoogle Scholar
- Stern LJ, Brown JH, Jardetzky TS, Gorga JC, Urban RG, Strominger JL, Wiley DC: Crystal structure of the human class II MHC protein HLA-DR1 complexed with an influenza virus peptide. Nature. 1994, 368 (6468): 215-221. 10.1038/368215a0.View ArticlePubMedGoogle Scholar
- Stern LJ, Wiley DC: Antigenic peptide binding by class I and class II histocompatibility proteins. Structure. 1994, 2 (4): 245-251. 10.1016/S0969-2126(00)00026-5.View ArticlePubMedGoogle Scholar
- Khan JM, Tong JC, Ranganathan S: Structural Immunoinformatics: Understanding MHC-peptide-TR binding. Bioinformatics for Immunomics. Edited by: Davies MN, Ranganathan S, Flower DR. 2009, Springer, Immunomics Reviews Series, 3: 77-94. full_text.View ArticleGoogle Scholar
- Tong JC, Tan TW, Ranganathan S: Methods and protocols for prediction of immunogenic epitopes. Brief Bioinform. 2007, 8 (2): 96-108. 10.1093/bib/bbl038.View ArticlePubMedGoogle Scholar
- Yewdell JW, Bennink JR: Immunodominance in major histocompatibility complex class I-restricted T lymphocyte responses. Annu Rev Immunol. 1999, 17: 51-88. 10.1146/annurev.immunol.17.1.51.View ArticlePubMedGoogle Scholar
- Yu K, Petrovsky N, Schonbach C, Koh JY, Brusic V: Methods for prediction of peptide binding to MHC molecules: a comparative study. Mol Med. 2002, 8 (3): 137-148.PubMed CentralPubMedGoogle Scholar
- Nielsen M, Lundegaard C, Worning P, Hvid CS, Lamberth K, Buus S, Brunak S, Lund O: Improved prediction of MHC class I and class II epitopes using a novel Gibbs sampling approach. Bioinformatics. 2004, 20 (9): 1388-1397. 10.1093/bioinformatics/bth100.View ArticlePubMedGoogle Scholar
- Vordermeier M, Whelan AO, Hewinson RG: Recognition of mycobacterial epitopes by T cells across mammalian species and use of a program that predicts human HLA-DR binding peptides to predict bovine epitopes. Infect Immun. 2003, 71 (4): 1860-1867. 10.1128/IAI.71.4.1980-1987.2003.View ArticleGoogle Scholar
- Doytchinova IA, Flower DR: class I T-cell epitope prediction: improvements using a combination of proteasome cleavage, TAP affinity, and MHC binding. Mol Immunol. 2006, 43 (13): 2037-2044. 10.1016/j.molimm.2005.12.013.View ArticlePubMedGoogle Scholar
- Schonbach C, Kun Y, Brusic V: Large-scale computational identification of HIV T-cell epitopes. Immunol Cell Biol. 2002, 80 (3): 300-306. 10.1046/j.1440-1711.2002.01089.x.View ArticlePubMedGoogle Scholar
- Bhasin M, Raghava GP: A hybrid approach for predicting promiscuous MHC class I restricted T cell epitopes. J Biosci. 2007, 32 (1): 31-42. 10.1007/s12038-007-0004-5.View ArticlePubMedGoogle Scholar
- Zhang GL, Khan AM, Srinivasan KN, Heiny A, Lee K, Kwoh CK, August JT, Brusic V: Hotspot Hunter: a computational system for large-scale screening and selection of candidate immunological hotspots in pathogen proteomes. BMC Bioinformatics. 2008, 9 (Suppl 1): S19-10.1186/1471-2105-9-S1-S19.PubMed CentralView ArticlePubMedGoogle Scholar
- Srinivasan KN, Zhang GL, Khan AM, August JT, Brusic V: Prediction of class I T-cell epitopes: evidence of presence of immunological hot spots inside antigens. Bioinformatics. 2004, 20 (Suppl 1): i297-302. 10.1093/bioinformatics/bth943.View ArticlePubMedGoogle Scholar
- Donnes P, Elofsson A: Prediction of MHC class I binding peptides, using SVMHC. BMC Bioinformatics. 2002, 3: 25-10.1186/1471-2105-3-25.PubMed CentralView ArticlePubMedGoogle Scholar
- Li S, Yao X, Liu H, Li J, Fan B: Prediction of T-cell epitopes based on least squares support vector machines and amino acid properties. Anal Chim Acta. 2007, 584 (1): 37-42. 10.1016/j.aca.2006.11.037.View ArticlePubMedGoogle Scholar
- Liu W, Wan J, Meng X, Flower DR, Li T: In silico prediction of peptide-MHC binding affinity using SVRMHC. Methods Mol Biol. 2007, 409: 283-291. full_text.View ArticlePubMedGoogle Scholar
- Singh SP, Mishra BN: Ranking of binding and nonbinding peptides to MHC class I molecules using inverse folding approach: Implications for vaccine design. Bioinformation. 2008, 3 (2): 72-82.PubMed CentralView ArticlePubMedGoogle Scholar
- Knapp B, Omasits U, Frantal S, Schreiner W: A critical cross-validation of high throughput structural binding prediction methods for pMHC. J Comput Aided Mol Des. 2009, 23 (5): 301-307. 10.1007/s10822-009-9259-2.View ArticlePubMedGoogle Scholar
- Logean A, Rognan D: Recovery of known T-cell epitopes by computational scanning of a viral genome. J Comput Aided Mol Des. 2002, 16 (4): 229-243. 10.1023/A:1020244329512.View ArticlePubMedGoogle Scholar
- Kosmopoulou A, Vlassi M, Stavrakoudis A, Sakarellos C, Sakarellos-Daitsiotis M: T-cell epitopes of the La/SSB autoantigen: prediction based on the homology modeling of HLA-DQ2/DQ7 with the insulin-B peptide/HLA-DQ8 complex. J Comput Chem. 2006, 27 (9): 1033-1044. 10.1002/jcc.20422.View ArticlePubMedGoogle Scholar
- Sauton N, Lagorce D, Villoutreix BO, Miteva MA: MS-DOCK: accurate multiple conformation generator and rigid docking protocol for multi-step virtual ligand screening. BMC Bioinformatics. 2008, 9: 184-10.1186/1471-2105-9-184.PubMed CentralView ArticlePubMedGoogle Scholar
- Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Res. 2000, 28 (1): 235-242. 10.1093/nar/28.1.235.PubMed CentralView ArticlePubMedGoogle Scholar
- Kaas Q, Ruiz M, Lefranc MP: IMGT/3Dstructure-DB and IMGT/StructuralQuery, a database and a tool for immunoglobulin, T cell receptor and MHC structural data. Nucleic Acids Res. 2004, 32 (Database issue): D208-210. 10.1093/nar/gkh042.PubMed CentralView ArticlePubMedGoogle Scholar
- Ehrenmann F, Kaas Q, Lefranc MP: IMGT/3Dstructure-DB and IMGT/DomainGapAlign: a database and a tool for immunoglobulins or antibodies, T cell receptors, MHC, IgSF and MhcSF. Nucleic Acids Res. 2010, 38 (Database issue): D301-307. 10.1093/nar/gkp946.PubMed CentralView ArticlePubMedGoogle Scholar
- Ranganathan S, Tong JC, Tan TW: Structural immunoinformatics. Immunoinformatics. Edited by: Schonbach C, Ranganathan S, Brusic V. 2008, Springer, Immunomics Reviews Series, 51-61. full_text.View ArticleGoogle Scholar
- Bordner AJ, Abagyan R: Ab initio prediction of peptide-MHC binding geometry for diverse class I MHC allotypes. Proteins. 2006, 63 (3): 512-526. 10.1002/prot.20831.View ArticlePubMedGoogle Scholar
- Abagyan RA, Totrov MM: Ab initio folding of peptides by the Optimal-Bias Monte Carlo Minimization Procedure. J Comput Phys. 1999, 151: 402-421. 10.1006/jcph.1999.6233.View ArticleGoogle Scholar
- Wei HY, Tsai KC, Lin TH: Modeling ligand-receptor interaction for some MHC class II HLA-DR4 peptide mimetic inhibitors using several molecular docking and 3D QSAR techniques. J Chem Inf Model. 2005, 45 (5): 1343-1351. 10.1021/ci050140y.View ArticlePubMedGoogle Scholar
- Liu Z, Dominy BN, Shakhnovich EI: Structural mining: self-consistent design on flexible protein-peptide docking and transferable binding affinity potential. J Am Chem Soc. 2004, 126 (27): 8515-8528. 10.1021/ja032018q.View ArticlePubMedGoogle Scholar
- Rognan D, Lauemoller SL, Holm A, Buus S, Tschinke V: Predicting binding affinities of protein ligands from three-dimensional models: application to peptide binding to class I major histocompatibility proteins. J Med Chem. 1999, 42 (22): 4650-4658. 10.1021/jm9910775.View ArticlePubMedGoogle Scholar
- Desmet J, De Maeyer M, Spriet J, Lasters I: Flexible docking of peptide ligands to proteins. Methods Mol Biol. 2000, 143: 359-376.PubMedGoogle Scholar
- Sezerman U, Vajda S, DeLisi C: Free energy mapping of class I MHC molecules and structural determination of bound peptides. Protein Sci. 1996, 5 (7): 1272-1281. 10.1002/pro.5560050706.PubMed CentralView ArticlePubMedGoogle Scholar
- Rosenfeld R, Zheng Q, Vajda S, DeLisi C: Computing the structure of bound peptides. Application to antigen recognition by class I major histocompatibility complex receptors. J Mol Biol. 1993, 234 (3): 515-521. 10.1006/jmbi.1993.1607.View ArticlePubMedGoogle Scholar
- Rosenfeld R, Zheng Q, Vajda S, DeLisi C: Flexible docking of peptides to class I major-histocompatibility-complex receptors. Genet Anal. 1995, 12 (1): 1-21.View ArticlePubMedGoogle Scholar
- Sali A, Blundell TL: Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993, 234 (3): 779-815. 10.1006/jmbi.1993.1626.View ArticlePubMedGoogle Scholar
- Khan JM, Ranganathan S: A multi-species comparative structural bioinformatics analysis of inherited mutations in alpha-D-mannosidase reveals strong genotype-phenotype correlation. BMC Genomics. 2009, 10 (Suppl 3): S33-10.1186/1471-2164-10-S3-S33.PubMed CentralView ArticlePubMedGoogle Scholar
- Nemethy G, Gibson KD, Palmer KA, Yoon CN, Paterlini G, Zagari A, Rumsey S, Scheraga HA: Energy parameters in polypeptides, 10: Improved geometric parameters and nonbonded interactions for use in the ECEPP/3 algorithm, with application to praline-containing peptides. J Phys Chem. 1992, 96 (15): 6472-6484. 10.1021/j100194a068.View ArticleGoogle Scholar
- Fernandez-Recio J, Totrov M, Abagyan R: Soft protein-protein docking in internal coordinates. Protein Sci. 2002, 11 (2): 280-291. 10.1110/ps.19202.PubMed CentralView ArticlePubMedGoogle Scholar
Copyright
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.