Future Fusion Technology Division, Computational Science Center, Korea Institute of Science and Technology, P.O. Box 131, Cheongryang, Seoul 130-650, South Korea
Research by other investigators has established that insulin-like growth factor‐1 receptor (IGF-1R) is a key oncological target, and that derivatives of 1, 3-disubstituted-imidazo[1,5-] pyrazine are potent IGF-1R inhibitors. In this paper, we report on our three-dimensional quantitative structure activity relationship (3D-QSAR) studies for this series of compounds. We validated the 3D-QSAR models by the comparison of two major alignment schemes, namely, ligand-based (LB) and receptor-guided (RG) alignment schemes. The latter scheme yielded better 3D-QSAR models for both comparative molecular field analysis (CoMFA) (, ) and comparative molecular similarity indices analysis (CoMSIA) (, ). We submit that this might arise from the more accurate inhibitor alignment that results from using the structural information of the active site. We conclude that the receptor-guided 3D-QSAR may be helpful to design more potent IGF-1R inhibitors, as well as to understand their binding affinity with the receptor.
1. Introduction
The insulin-like growth factor-1 receptor is a membrane-associated receptor that belongs to subclass I of the
receptor tyrosine kinase (RTK) superfamily [1]. IGF-1R has been shown to
have significant roles in the regulation of normal cell growth. It has
mitogenic and survival effects on human cancer cells [2]. The Binding of IGF-1 to
IGF-1R activates the RTK, and later, in turn, activates a cascade of downstream
signals, which are postulated to stimulate cell proliferation and enhance
resistance to apoptosis [3]. Understandably, the abnormal
expression of the IGF-1R has been implicated to cancer. Epidemiological studies
have also shown a link between serum concentrations of IGF-1 and IGFBP-3 with
increased risks of breast cancer [4]. A number of anticancer
agents which inhibit the IGF-1R activity and proliferation [5] have been extracted from
plants [6] as well as synthesized, such
as BMS-554417 (2-(4-substituted-2-oxo-1,2-dihydropyridin-3-yl)-benzimidazole) [7] and NVP-AEW541 (pyrrolo[2,3-d]
pyrimidine derivative) molecules. Both of these compounds are orally
administered and have proved antitumor activity. Various QSAR techniques are being
used to explore more potent ligands [8–11]; but in this study, we
performed comparative three-dimensional quantitative structure activity relationship
(3D-QSAR) [12–14] analyses on IFG-1R inhibitors
[15] of imidazo [1, 5-]
pyrazine derivatives. In 3D-QSAR [14], determination of the
bioactive conformer [16] and molecular alignment of
the compounds is key factor to get meaningful results. The biologically active
conformations of the structures should be aligned in a way that represents a
similar binding mode [17]. Here we first applied the ligand-based
(LB) strategy using the systematic search-based minimum energy conformer approach [18]. Second, receptor-based 3D-QSAR [19] using molecular docking of inhibitors
in the available X-ray crystal structure [20] of the receptor protein. The qualities of these
3D-QSAR models were compared and discussed with respect to the IGF-1R target.
2. Material and Methods
A
series of 54 potent 1, 3-disubstituted imidazole [1, 5-] pyrazine
derivatives with their inhibitory
activities to IGF-1R were taken from the literature [15]. The dataset was randomly divided
into 43 and 11 molecules, the training and test datasets, respectively. The
observed IC50 values were converted into pIC50 values and are reported in Table 1.
Table 1: The structures and observed IGF inhibitory activities [
15].
2.1. Computational Details
The molecular
modeling studies were carried out using SYBYL 7.3. The initial structures were minimized
at Tripos force field [21] with MMFF94 charge by using
conjugate gradient method, and convergence criterion was 0.005 kcal/mol.The comparative molecular field
analysis (CoMFA) and comparative molecular similarity indices analysis (CoMSIA)
studies require aligned structures [16].
The ligand-based (LB) and receptor-guided (RG) alignment techniques were used
in two geometrical schemes respectively.
2.2. CoMFA and CoMSIA
Lennard-Jones and Coulomb potentials-based CoMFA analysis has
been performed and the steric as well as electrostatic energies were calculated
by using sp3 carbon probe atom with Van der Waal radius of 1.52 Å
and charge. The energies were truncated to kcal/mol and the electrostatic
contributions were ignored at lattice interactions with maximum steric
interactions. The CoMFA were generated by standard method in SYBYL. The CoMSIA
models were also derived with the same lattice box used as in CoMFA
calculations. All five CoMSIA similarity indices (steric, electrostatic, hydrophobic, H-bond
donor, and H-bond acceptor) were evaluated using the probe atom. The CoMSIA
models from hydrophobic and H-bonds were calculated between the grid point and
each atom of the molecule by a Gaussian function [14]. An attenuation factor’s default value of
0.30 was used, which is the standard distance dependence of molecular
similarity.
2.3. PLS Analysis and Validation of QSAR Models
In order to derive 3D-QSAR models, the CoMFA and CoMSIA
descriptors were used as independent variables and the pIC50 values as the dependent variable. Partial
least-square (PLS) method [22] was used to linearly
correlate these CoMFA and CoMSIA descriptors to the inhibitory activity values.
The CoMFA cutoff values were set to 30 kcal/mol for both steric and
electrostatic fields, and also all fields were scaled by the default options in
SYBYL. The cross-validation analysis was performed using the leave one out
(LOO) method in which one compound is removed from the dataset and its activity
is predicted using the model derived from the rest of the dataset. The
cross-validated correlation coefficient (q2) that resulted in
optimum number of components and lowest standard error of prediction were
calculated using the
following formulae,where
, , and are predicted,
actual, and mean values of the target property (pIC50), respectively.
The non-cross-validated PLS analyses were performed with column filtering value
of 2.0, to reduce analysis time with small effect on the q2 values. To further assess the robustness and statistical confidence of the
derived models, bootstrapping analysis for 100 runs were performed.
The predictive power of 3D-QSAR models, derived by using the training
set were examined by an external test set of eleven molecules. The predictive
ability of the models is expressed by the predictive r2 value, which is analogous to cross-validated r2 (q2)
and is calculated using the following formula:
where
SD is the sum of the squared deviations between the biological activities of
the test set and mean activities of the training molecules and PRESS is the sum
of squared deviation between predicted and actual activities of the test set
molecules.
3. Results and Discussion
3.1. Ligand-Based Alignment
In this scheme, the most active molecule was used as a template. Systematic
search routine was used in the conformational analysis and all rotatable bonds
were searched in increments from to . Conformational energies were
computed with electrostatic term, and the lowest energy conformer was selected. The template was modified for other ligands
of the series. All ligands were minimized by Tripos force field but the common
moiety was constrained during minimization. The molecules were aligned by
superimposing common substructures using SYBYL database alignment option. These
aligned structures were subsequently used for ligand-based CoMFA/CoMSIA probe
interaction energy calculations.
3.2. Receptor-Guided Alignment
This geometrical
scheme is based on docked geometry. The best docked mode of the smallest compound
was taken as template and modified for the other compounds. The compounds were minimized
by Tripos force field (Powell method, 2000 iterations, and 0.05 kcalmol-1Å-1 energy gradient convergence criteria). All minimized structures at this binding
mode were superimposed to get the molecular alignment for CoMFA and CoMSIA. The
superimposed structures inside the receptor site were further used for CoMFA
and CoMSIA analysis.
3.3. Molecular Docking
The structure
coordinates of IGF-1R were obtained from protein databank (1JQH) [20]. Recently, Mulvihill et al. [15] presented a possible binding
mode of compound-2 by using FlexX-based docking. Here we have also performed
molecular docking of same compound. The PDB file obtained from protein data
bank was used as receptor site. All water molecules were removed and the protein
was modified to dock inhibitor. The active site was defined with a distance of
6.5 Å of ATP binding site. The ligand-2 was docked into the monomer unit (A) of
IGF-1R and out of 100 conformers the best mode was selected as template. This
binding mode seems prominent as the hydrophobic zone of inhibitor corresponds
to hydrophobic pocket of IGFR. The residue E-1080, M-1082, K-1033, D-1086, G-1006,
and L-1005 makes hinge contact and might have significant role in the
inhibition of IGF-1R. It is also clear from all the figures that the depicted
mode holds 3 H-bonds in this region. The –OH group of benzene ring makes H-bond
with –NH of K-1033,
nitrogen of pyrimidine ring makes contact with –NH of M-1082 and both act as H-bond
acceptor. The NH2 group of pyrimidine ring acts as H-bond donor and
makes contact with oxygen of E1080.
3.4. CoMFA and CoMSIA Results
The CoMFA and
CoMSIA studies were carried out by using both geometrical schemes with different
descriptors fields independently and in combination. The ligand-based alignment
gave better results for CoMFA model using both field descriptors with cross-validated
and non-cross-validated ,
while for CoMSIA model, combination of steric, electrostatic, and H-bond
acceptor yielded the best statistical values with and . The internal predictivity of these CoMFA and CoMSIA models was also
good with boot-strapped correlation coefficient
and 0.85, respectively. These models were also validated on a test set of 11
molecules with predictive for CoMFA model and 0.57 for
CoMSIA model. In comparison to LB, receptor-guided alignment yielded more significant models with better
understanding of these inhibitors and receptor interactions. Best CoMFA models
were obtained by combination of steric and electrostatic field descriptors with
and . Whereas steric, electrostatic, and H-bond
acceptor filed descriptors gave the best CoMSIA model with and . To further asses the robustness and statistical confidence,
the boot strapping analysis were performed for 100 runs. The for CoMFA and CoMSIA models suggest that a good internal consistency
exists within the underlying dataset. The high predictive values
for CoMFA and CoMSIA (0.67 and 0.64, resp.) also prove models validity. In
our efforts to obtain the more pronounced model, region focusing was performed.
It only yielded high value which is not sufficient condition for
the model to have high predictive power [23]. The regression summary of
different 3D-QSAR models obtained at default parameters and after region
focusing are presented in Tables 2 and 3, respectively. The predicted pIC50 values for training and
test set from CoMFA and CoMSIA models are given in Tables 4 and 5,
respectively.
Table 2: Statistical summary of different PLS analysis. (GS: geometrical scheme; SE: standard error of estimate; n.:
number of components; : Fischer’s value for test of significance; : coefficient of determination after 100 bootstrapping runs; SD: standard deviation; Field contribution: (S) steric field, (E) electrostatic field, (H) hydrophobic field, (D) H-bond donor field, and (A) H-bond acceptor field.).
Table 3: Statistics of different PLS analysis after region focusing. (GS: geometrical scheme; SE: standard error of estimate; n.: number of components; : Fischer’s value for test of significance; : coefficient of determination after 100 bootstrapping runs; SD: standard deviation; Field contribution: (S) steric field, (E) electrostatic field, (H) hydrophobic field, (D) H-bond donor field, and (A) H-bond acceptor field.).
Table 4: Experimental and predicted activities with their residuals by CoMFA and CoMSIA analyses of the training set.
Table 5: Experimental and predicted activities with their residuals by CoMFA and CoMSIA analyses of the test set.
In 3D-QSAR, the determination of the
bioactive conformer and molecular alignment of the compounds is an important
step. In ligand-based techniques, the minimum energy conformers are often used
as bioactive conformer. In contrast, the binding poses obtained from cocrystal
structure are used in receptor-guided techniques. Here, both techniques were
used. The statistical results indicate that conformation obtained from
molecular docking is more reliable. In Figure 1, the yellow conformer displays systematic search-based
minimum energy conformer while the red structure shows docked conformer. The
findings are reasonable as the oxygen attached with benzyl group of docked
conformer is more closed to amino acid (Asp1086) that facilitates an H-bonding
between –NH of Asp-1086
and this oxygen atom of the inhibitor; but in case of minimum energy conformer
(yellow), the benzyl moiety is quite far and disfavors such interactions.
Figure 1: Comparison of minimum energy (yellow) and docking based (red) conformers.
3.5. The CoMFA Contour Maps
Figures 2 and 3 show the electrostatic and steric contour
maps of the best models based on receptor-guided alignment scheme. The electrostatic
interactions are represented by red- and blue-colored contours while steric
interactions are represented by green and yellow colored contours. In
electrostatic field, blue color contour
represents region where electropositive group enhances the activity, whereas red-color region likes electron-rich groups to
increase the biological activity. In case of steric interactions, the green
region demands bulky substituents to enhance the activity, while in yellow
contours, bulky substituents decrease the activity.
Figure 2: CoMFA electrostatic maps with the most (red) and least (orange) active compound within the active site.
Figure 3: CoMFA steric maps with the most (red) and least (orange) active compound within the active site.
The most potent compound-47 (red color) and least-active
compound-9 (orange color) of the series with CoMFA contour maps have been
superimposed in the active site of the receptor protein. Figure 2 shows that
red polyhedrons locate the region where electron-rich group will enhance the
inhibitory activity, and vice versa for blue polyhedron. Therefore, the phenyl
ring in compound-47 might be responsible for its higher activity than methoxy
group of compound-9 because
it might have the - interactions with the phenyl ring of phenyl alanine (Phe1010)
amino acid. The red contour around 1–3 carbon of cyclobutane also demands the electron-rich group for higher potency. Compound-47 has amino group at C-3 position which
might be responsible for its higher activity than least-active compound-9. It
is also clear in most of compounds from the dataset that electron-rich group at
this position have higher activity than compound-9. In Figure 3, green polyhedron
locates the region
where bulky substitutent would increase the inhibitory activity and yellow
polyhedron where the steric bulk is not required for high potency of the
compounds. The small green contour near the phenyl ring of compound-47 explains
its higher activity than compound-9. Similarly, the green contour around 2 and 3
carbon of cyclobutane requires the bulky substitutent to be highly active. Thus
the bulky substitutent at this position in dataset favors the higher inhibitory
activity of the compounds than compound-9. Yellow polyhedron below the plane of
phenyl ring and cyclopropane requires the small group to be more active.
3.6. CoMSIA Contour Maps
The CoMSIA contour
maps were also developed on the models based on the geometrical scheme 2. Figures
4, 5, and 6 show the steric electrostatic and H-bond acceptor contour maps
superimposed in the active site of the IGF-1R. In CoMSIA method, steric and
electrostatic contours maps have the same meaning as that of CoMFA contour maps
whereas H-bond acceptor contours are represented by magenta and red colors.
Magenta favors H-bond acceptor group while red disfavors. The steric and
electrostatic maps are more or less similar to the corresponding CoMFA models (Figures
2 and 3, resp.) except that there is a small green contour near phenyl
ring of compound-47 in CoMFA model. In Figure 6, the magenta contour around C-2
and C-3 position of cyclobutane favors the H-bond accepting group to enhance the
inhibitory activity of the molecules. Thus the H-bond accepting substituent at
C-4 position might enhance inhibitory activity of the compounds through H-bonding
with Glycine (Gly1008) or Valine (Val1013).
Figure 4: CoMSIA electrostatic maps with the most (red) and least (orange) active compound within the active site.
Figure 5: CoMSIA steric maps with the most (red) and least (orange) active compound within the active site.
Figure 6: CoMSIA H-bond acceptor map with the most (red) and least (orange) active compound within the active site.
4. Conclusion
A
comparative CoMFA and CoMSIA models were developed for the series of potent
IGF-1R inhibitors. Ligand-based and receptor-guided protocols were applied to
develop the models. Receptor-guided alignment gave models with better
statistics than the ones from the ligand-based approach, presumably because the
alignment using receptor information is more realistic. Moreover, the
interpretation of receptor-guided models are directly associated with the
receptor information. That is, in general, the superposition of a CoMFA or
CoMSIA contour map inside the receptor shows reasonable correspondence between the
contour map property and the physical property of surrounding active site
region. This provides more detailed understanding about the interaction between
the series of inhibitors and IGF-1R. The information drawn here can be used to
design new inhibitors of IGF-1R.
Acknowledgment
The authors would like to thank Jung Soo Oh for the valuable support.