Journal of Chemistry

Journal of Chemistry / 2013 / Article

Research Article | Open Access

Volume 2013 |Article ID 374804 |

Shunlai Li, Rutao Zhang, Hongguang Du, "3D-QSAR Study on a Series of VEGFR-2 Kinase Inhibitors: 3-Pyrrole Substituted Indolin-2-Ones Compounds", Journal of Chemistry, vol. 2013, Article ID 374804, 8 pages, 2013.

3D-QSAR Study on a Series of VEGFR-2 Kinase Inhibitors: 3-Pyrrole Substituted Indolin-2-Ones Compounds

Academic Editor: Mire Zloh
Received04 May 2013
Revised02 Sep 2013
Accepted29 Sep 2013
Published27 Nov 2013


The vascular endothelial growth factor receptor-2 kinases (VEGFR-2) are attractive targets for the development of anticancer agents. Self-organizing molecular field analysis (SOMFA) (a simple three-dimensional quantitative structure-activity relationship (3D-QSAR) method) is used to study the structure-activity correlation of 3-pyrrole substituted indolin-2-ones VEGFR-2 inhibitors. The statistical results, cross-validated (0.5267) and non-cross-validated (0.5623), show a reliable predictive ability. The contributions of shape and electrostatic fields are 42.7% and 57.3%, respectively. Analysis of SOMFA models through shape and electrostatic grids provide useful information for the design and optimization of new 3-pyrrole substituted indolin-2-one based VEGFR-2 inhibitors.

1. Introduction

Cancer is the second leading cause of death in the world. Researchers have found that receptor tyrosine kinases (RTKs) play an important role in oncogenic transformation of cells [1]. VEGFR-2 (vascular endothelial growth factor receptor-2), as a member of the family of RTKs, is widely investigated in the pathogenesis of several disorders [27]. It is not only widely distributed in the organization of vascular endothelial cells but also distributed in some tumor cells; it plays an important role in the cell signalling of VEGF and tumor proliferation [8]. Recent research has shown that the blockade of VEGFR-2 signalling by small molecular inhibitors to the kinases domain can inhibit the growth of solid tumors [912]. Therefore, inhibition of the VEGFR-2 has become an important research direction in the treatment of cancers [13].

In recent years, a number of VEGFR-2 kinase inhibitors have been developed as anticancer agents, including analogues of quinazoline, indolin ketones, pyridazine, and quinoline structures [1417]. Recent research found that a series of 3-pyrrole substituted indolin-2-one compounds show good inhibitory activity against VEGFR-2, shown in Table 1, including sunitinib (compound 4) which has been recently approved by US FDA for the treatment of gastrointerstinal stromal tumor (GIST) and renal cell carcinoma (RCC). However, there are fewer quantitative structure-activity relationship (2D-QSAR and 3D-QSAR) studies and other molecular modeling works on VEGFR-2 targets [1823].



Compd.R1R2R3IC50 (μM)pIC50




Compd.R1R2IC50 (µM)pIC50




Compd.R1R2IC50 (µM)pIC50


Test set.

The self-organizing molecular field analysis (SOMFA) [24] is a simple 3D-QSAR technique, which has been developed by Robinson et al. The method has similarities to both comparative molecular field analysis (CoMFA) [25] and molecular similarity studies. Like CoMFA, a grid-based approach is used; however, SOMFA only uses steric and electrostatic maps; which are related to interaction energy maps; no probe interaction energies need to be evaluated. The weighting procedure of the grid points by mean-centered activity is an important ingredient of the SOMFA procedure. Like the similarity methods, it is the intrinsic molecular properties, such as the molecular shape and electrostatic potential, which are used to develop the QSAR models.

A SOMFA model could suggest a method of tackling the all-important alignment, which all 3D-QSAR methods have faced. The inherent simplicity of this method allows the possibility of aligning the training compounds as an integral part of the model derivation process and of aligning prediction compounds to optimize their predicted activities.

The purpose of this paper is to describe the application of self-organizing molecular field analysis (SOMFA) on a series of 3-pyrrole substituted indolin-2-one compounds, a novel class of VEGFR-2 kinases inhibitors. Therefore, our main objective is to provide some useful structure-activity information by SOMFA analysis and design novel inhibitors of VEGFR-2 kinases in the hope that these molecules will be developed into powerful anticancer agents.

2. Materials and Method

2.1. Data Set

The structures and bioactivities of 3-pyrrole substituted indolin-2-one compounds 1–34 were chosen from the papers by Sun et al. [26, 27]. They were classified into three groups according to the common parent structure, shown in Table 1.

Thirty-four compounds were divided into two sets. Compounds 1–25 were used as a training set while the remaining 9 compounds were used as a test set. The training set was used to build SOMFA models and the test set was used to evaluate the models. Their activities shown in Table 1 were described by IC50 (µM), which were converted into pIC50. Higher pIC50 value indicated greater inhibitory activity.

2.2. Molecular Modeling and Alignment

The 3D structures of all analogues were constructed with ArgusLab 4.01 software [28], running on an AMD Athlon 64 X2 Dual Core Processor 5000+ CPU/Microsoft Win XP platform.

Unless otherwise indicated, parameters are default. Full geometry optimizations are performed by MM2 method in the ArgusLab software. The conformations are then performed by RMS overlapping and fitted with compound 4 as a reference. Two different alignments are selected to define overlap. The first superposition of molecules, which are optimized by energy minimization and overlaid based on common structure using alignment A, has been shown in Figure 1. The second superposition of molecules, which are optimized by energy minimization and overlaid based on common structure using alignment B, has been shown in Figure 2.

For all analogues, the final active conformations search has also been performed by dock into active site method in eHiTS software [29]. The crystal structure of VEGFR-2 in the complex with its inhibitor 00J (PDB entry code 2XIR, resolution: ) is downloaded from RCSB Protein Data Bank ( After removing inhibitor 00J and water molecules and adding all the hydrogen atoms, the site where 00J binds with protein was defined as the active binding site.

The third superposition of molecules, which are obtained based on the binding conformation and their alignment in active site of VEGFR-2, has been shown in Figure 3.

According to the three alignments of these analogues, they are then performed using SOMFA analysis. Using VEGA software [30, 31], the electrostatic parameters of overlaid analogues are assigned charges by three different Hamiltonian semiempirical methods (AM1, PM3, and MNDO). After calculation, they are converted into CSSR file format, the only file format which the SOMFA2 program can accept to process a SOMFA analysis.

2.3. SOMFA 3D-QSAR Models

In the SOMFA study, similar to our previous works [32, 33], a grid originating at with a resolution of 0.5, 1.0, and , respectively, is generated around the aligned compounds. Twenty-seven models using different alignments, charges, and resolutions of grid are shown in Table 2.

Model no.AlignmentChargeResolution of grid ( )


: Non-cross-validated correlation coefficient; : cross validated correlation coefficient; : -test value; : standard error of estimate; : mixing coefficient of SOMFA model. : predictive .

For all of the studied compounds, shape and electrostatic potential are generated. To sum up the predictive power of these two properties into one final model, we combine their individual predictions using a weighted average of the shape and electrostatic potential based QSAR, using a mixing coefficient () as illustrated in the following [24]:

Clearly, multiproperty predictions could have been obtained through multiple linear regression.

Using (1) instead gives greater insight into the resultant model by allowing the study of the variation in predictive power with different values of .

With the highest value of , the SOMFA models then are derived by the partial least squares (PLS), implemented in SPSS software [34] with cross-validation.

The predictive ability of the model is quantitated in terms of which is defined in the following: where and .

SD is the sum of squares of the difference between the observed values and their meaning and PRESS is the prediction error sum of squares. The final models are constructed by a conventional regression analysis with the optimum value of mixing coefficient being equal to that yielding the highest and value according to (2).

3. Results and Discussion

SOMFA, a novel 3D-QSAR methodology, is employed for the analysis with the training set composed of 25 various compounds, whose biological activities have been known. Statistical results of 27 SOMFA models are listed in Table 2.

Among the 27 models, the 20th model is the best predictive model, which showed cross-validated value of 0.5267, non-cross-validated value of 0.5623, standard error of 0.5165, and value of 41.11, proved a good statistical correlation and predictive ability. It was also indicated that the SOMFA model was reliable and able to predict activities of new analogues as well. According to the value of , the contributions of steric and electrostatic field are 42.7% and 57.3%, respectively. Consequently, electrostatic field had a slightly bigger effect than that of steric field on the bioactivities of the analogues.

Taking alignments into account, we found that the alignment had a profound influence on the result. The model which was built by docking-based alignment showed the highest value of and of the three alignments, because it reflected a true pharmacophoric conformation docked in the cavity of receptor, indicating that it was more reliable and accurate.

Taking charges into account, according to alignment B and C, we found that the results were less sensitive to the quantum chemistry charge. That is to say, it made a slight difference that we use various semiempirical methods, for example, AM1, PM3, and MNDO.

Taking resolution of grid into account, we found that the values of and presented in model 10–18 indicated the correlation and reliability of models, , while in model 1–10, 18–27, , because it is known that a finer grid resolution produced a better correlation and reliability. However, if the grid is far too fine, the amount of noise in the data is increased and thus the model is inaccurate. In model grid resolution produced the best result.

Taking the values of into account, we found that the values of models which were built by alignment A showed higher values. That is to say, the contribution of steric field is four times more than that of electrostatic field. Due to the structure-based alignment, they were given such a high contribution of steric field. While according to alignment B and alignment C, we found that steric field had almost the same influence to the electrostatic field, indicating that the steric and electrostatic interactions of molecules could be two equally important factors for the bioactivities of the analogues.

The experimental and predicted activities of training set and test set are reported in Table 3. Figure 4 shows the correlation between experimental and predicted activities of SOMFA model for the training set and test set.

Compd.Experimental pIC50Predicted pIC50ResidualaCompd.Experimental pIC50Predicted pIC50Residuala


Test set; Residuala = Experimental – Predicted.

It is known that the best way to validate the 3D-QSAR model is to predict bioactivities of compounds in test set. The SOMFA analysis of the test set composed of 9 compounds is reported in Figure 4, indicating a satisfying linear correlation and moderate difference between experimental and predicted activities. From Figure 4, we found that compound 32 had a large residual and thus was classified as outlier. There may be a complicated relationship between structure and activity in this compound. On the whole, the model performed well in the activity prediction of most of the test compounds.

SOMFA calculation for both shape and electrostatic potentials is performed, then combined to get an optimal coefficient according to (1), indicating that the electrostatic field contribution is of a little high importance. The master grid maps derived from the best model are used to display the contribution of shape and electrostatic potentials. The master grid maps give a direct visualization of which parts of the compounds differentiate the activities of compounds in the training set under study. The master grid also offers an interpretation as to how to design and then synthesize some novel compounds with much higher activities. The visualization of the shape master grid and electrostatic master grid of the best SOMFA model is shown in Figures 5 and 6, respectively, with compound 4 (sunitinib) as the reference.

Each master grid map is colored in two different colors for favorable and unfavorable effects. In other words, the electrostatic features are red (more positive charge increases activity or more negative charge decreases activity) and blue (more negative charge increases activity or more positive charge decreases activity), and the shape features are red (more steric bulk increases activity) and blue (more steric bulk decreases activity), respectively.

As shown in Figure 5, the shape master grid shows red-colored regions where steric bulk enhances activity and blue-colored regions where steric bulk detracts from activity. We found that there was a high density of blue lattice points surrounding 6-position of indolinone ring and 4-position of pyrrole ring, which suggested that more bulky substituents in these areas would decrease the bioactivities. We also found a big region of red lattice points appearing near 2-position and diethylamine group, which suggested that more bulky substituents in these areas would remarkably increase the bioactivities.

As shown in Figure 6, the electrostatic potential master grid shows red-colored regions where increased positive charge is favorable for bioactivities and blue-colored regions where increased negative charge is favorable for bioactivity. We found that there was a high density of red lattice points surrounding indolinone ring and pyrrole ring, suggesting that reducing electronic density would increase bioactivities. While diethylamine group surrounded by blue lattice points suggested that negatively charged substituents would increase the bioactivities.

4. Conclusion

In this study, according to different alignments, 3D-QSAR analysis was carried out to construct a highly accurate and predictive SOMFA model (, ). The contributions of steric and electrostatic fields are 42.7% and 57.3%, respectively, indicating that the electrostatic interaction of molecules could be a little more important factor for the bioactivities of the analogues. The final grid maps help to better interpret the structure-activity relationship of these analogues. Generally, the small-sized electropositive potential substituents (e.g., methyl, ethyl, and aliphatic amines) at the indolinone ring and pyrrole ring increase the activity, and the big-sized electronegative potential substituents (e.g., benzene ring with electron-withdrawing groups and pyridine ring) at the diethylamine group increase the activity. All analyses of SOMFA models may provide some useful information in the design of new analogues of sunitinib.

Conflict of Interests

The authors declare that they have no conflict of interests with each other, and they also have no conflict of interests with SOMFA, ArgusLab, eHiTS, VEGA, or SPSS software.


This work was supported by the National Natural Science of Foundation of China (no. 21272022). The authors gratefully acknowledge Daniel Robinson and the Computational Chemistry Research Group (Oxford University, UK) for the use of the SOMFA software.


  1. D. R. Robinson, Y.-M. Wu, and S.-F. Lin, “The protein tyrosine kinase family of the human genome,” Oncogene, vol. 19, no. 49, pp. 5548–5557, 2000. View at: Google Scholar
  2. G. A. Fava, “Affective disorders and endocrine disease: new insights from psychosomatic studies,” Psychosomatics, vol. 35, no. 4, pp. 341–353, 1994. View at: Google Scholar
  3. D. A. Walsh and L. Haywood, “Angiogenesis: a therapeutic target in arthritis,” Current Opinion in Investigational Drugs, vol. 2, no. 8, pp. 1054–1063, 2001. View at: Google Scholar
  4. L. P. Aiello, R. L. Avery, P. G. Arrigg et al., “Vascular endothelial growth factor in ocular fluid of patients with diabetic retinopathy and other retinal disorders,” New England Journal of Medicine, vol. 331, no. 22, pp. 1480–1487, 1994. View at: Publisher Site | Google Scholar
  5. M. Detmar, “The role of VEGF and thrombospondins in skin angiogenesis,” Journal of Dermatological Science, vol. 24, no. 1, pp. S78–S84, 2000. View at: Publisher Site | Google Scholar
  6. J. Folkman, “Anti-angiogenesis: new concept for therapy of solid tumors,” Annals of Surgery, vol. 175, no. 3, pp. 409–416, 1972. View at: Google Scholar
  7. L. A. Liotta, P. S. Steeg, and W. G. Stetler-Stevenson, “Cancer metastasis and angiogenesis: an imbalance of positive and negative regulation,” Cell, vol. 64, no. 2, pp. 327–336, 1991. View at: Google Scholar
  8. S. Shinkaruk, M. Bayle, G. Laïn, and G. Déléris, “Vascular Endothelial Cell Growth Factor (VEGF), an emerging target for cancer chemotherapy,” Current Medicinal Chemistry, vol. 3, no. 2, pp. 95–117, 2003. View at: Publisher Site | Google Scholar
  9. S. Baka, A. R. Clamp, and G. C. Jayson, “A review of the latest clinical compounds to inhibit VEGF in pathological angiogenesis,” Expert Opinion on Therapeutic Targets, vol. 10, no. 6, pp. 867–876, 2006. View at: Publisher Site | Google Scholar
  10. B. M. Klebl and G. Müller, “Second-generation kinase inhibitors,” Expert Opinion on Therapeutic Targets, vol. 9, no. 5, pp. 975–993, 2005. View at: Publisher Site | Google Scholar
  11. K. Holmes, O. L. Roberts, A. M. Thomas, and M. J. Cross, “Vascular endothelial growth factor receptor-2: structure, function, intracellular signalling and therapeutic inhibition,” Cellular Signalling, vol. 19, no. 10, pp. 2003–2012, 2007. View at: Publisher Site | Google Scholar
  12. H. Zhong and J. P. Bowen, “Molecular design and clinical development of VEGFR kinase inhibitors,” Current Topics in Medicinal Chemistry, vol. 7, no. 14, pp. 1379–1393, 2007. View at: Publisher Site | Google Scholar
  13. N. Ferrara, K. J. Hillan, H.-P. Gerber, and W. Novotny, “Discovery and development of bevacizumab, an anti-VEGF antibody for treating cancer,” Nature Reviews Drug Discovery, vol. 3, no. 5, pp. 391–400, 2004. View at: Google Scholar
  14. T. Inai, M. Mancuso, H. Hashizume et al., “Inhibition of vascular endothelial growth factor (VEGF) signaling in cancer causes loss of endothelial fenestrations, regression of tumor vessels, and appearance of basement membrane ghosts,” American Journal of Pathology, vol. 165, no. 1, pp. 35–52, 2004. View at: Google Scholar
  15. M. A. J. Duncton, E. L. Piatnitski Chekler, R. Katoch-Rouse et al., “Arylphthalazines as potent, and orally bioavailable inhibitors of VEGFR-2,” Bioorganic and Medicinal Chemistry, vol. 17, no. 2, pp. 731–740, 2009. View at: Publisher Site | Google Scholar
  16. A. L. Thomas, B. Morgan, M. A. Horsfield et al., “Phase I study of the safety, tolerability, pharmacokinetics, and pharmacodynamics of PTK787/ZK 222584 administered twice daily in patients with advanced cancer,” Journal of Clinical Oncology, vol. 23, no. 18, pp. 4162–4171, 2005. View at: Publisher Site | Google Scholar
  17. J. S. Beebe, J. P. Jani, E. Knauth et al., “Pharmacological characterization of CP-547,632, a novel vascular endothelial growth factor receptor-2 tyrosine kinase inhibitor for cancer therapy,” Cancer Research, vol. 63, no. 21, pp. 7301–7309, 2003. View at: Google Scholar
  18. Z. Ye, X. Mengzhu, Z. Xue et al., “Substituted indolin-2-ones as p90 ribosomal S6 protein kinase 2 (RSK2) inhibitors: molecular docking simulation and structure-activity relationship analysis,” Bioorganic & Medicinal Chemistry, vol. 21, pp. 1724–1734, 2013. View at: Google Scholar
  19. J. K. Sanmati, J. Rahul, S. Lokesh et al., “QSAR analysis on 3,4-disubstituted-1H-pyrazol-5(4H)-ones as KDR/VEGFR-2 kinase inhibitors,” Pharmacia Lettre, vol. 4, pp. 1060–1070, 2012. View at: Google Scholar
  20. X. Jiang, G. Ou, D. Yan, M. Zhang, and X. Yuan, “3D-QSAR studies of VEGFR-2 kinase inhibitors based on docking,” Letters in Drug Design and Discovery, vol. 8, no. 10, pp. 926–942, 2011. View at: Publisher Site | Google Scholar
  21. H. Zeng and H. Zhang, “Combined 3D-QSAR modeling and molecular docking study on 1,4-dihydroindeno[1,2-c]pyrazoles as VEGFR-2 kinase inhibitors,” Journal of Molecular Graphics and Modelling, vol. 29, no. 1, pp. 54–71, 2010. View at: Publisher Site | Google Scholar
  22. X. Lu, Y. Chen, and Q. You, “Pharmacophore guided 3D-QSAR CoMFA analysis of amino substituted nitrogen heterocycle ureas as KDR inhibitors,” QSAR and Combinatorial Science, vol. 28, no. 11-12, pp. 1524–1536, 2009. View at: Publisher Site | Google Scholar
  23. B. K. Sharma, S. K. Sharma, P. Singh, and S. Sharma, “A quantitative structure-activity relationship study of novel, potent, orally active, selective VEGFR-2 and PDGFRα tyrosine kinase inhibitors: derivatives of N-Phenyl-N′-{4-(4-quinolyloxy phenyl}urea as antitumor agents,” Journal of Enzyme Inhibition and Medicinal Chemistry, vol. 23, no. 2, pp. 168–173, 2008. View at: Publisher Site | Google Scholar
  24. D. D. Robinson, P. J. Winn, P. D. Lyne, and W. G. Richards, “Self-organizing molecular field analysis: a tool for structure-activity studies,” Journal of Medicinal Chemistry, vol. 42, no. 4, pp. 573–583, 1999. View at: Publisher Site | Google Scholar
  25. R. D. Cramer III, D. E. Patterson, and J. D. Bunce, “Comparative molecular field analysis (CoMFA)—1. Effect of shape on binding of steroids to carrier proteins,” Journal of the American Chemical Society, vol. 110, no. 18, pp. 5959–5967, 1988. View at: Google Scholar
  26. L. Sun, C. Liang, S. Shirazian et al., “Discovery of 5-[5-fluoro-2-oxo-1,2-dihydroindol-(3Z)-ylidenemethyl]-2, 4-dimethyl-1H-pyrrole-3-carboxylic acid (2-diethylaminoethyl)amide, a novel tyrosine kinase inhibitor targeting vascular endothelial and platelet-derived growth factor receptor tyrosine kinase,” Journal of Medicinal Chemistry, vol. 46, no. 7, pp. 1116–1119, 2003. View at: Publisher Site | Google Scholar
  27. L. Sun, N. Tran, C. Liang et al., “Design, synthesis, and evaluations of substituted 3-[(3- or 4- carboxyethylpyrrol-2-yl)methylidenyl]indolin-2-ones as inhibitors of VEGF, FGF, and PDGF receptor tyrosine kinases,” Journal of Medicinal Chemistry, vol. 42, no. 25, pp. 5120–5130, 1999. View at: Publisher Site | Google Scholar
  28. M. A. Thompson, “ArgusLab 4.0.1.,” Planaria Software LLC, Seattle, Wash, USA, 2001, View at: Google Scholar
  29. Z. Zsoldos, D. Reid, A. Simon, S. B. Sadjad, and A. P. Johnson, “eHiTS: a new fast, exhaustive flexible ligand docking system,” Journal of Molecular Graphics and Modelling, vol. 26, no. 1, pp. 198–212, 2007. View at: Publisher Site | Google Scholar
  30. A. Pedretti, L. Villa, and G. Vistoli, “VEGA: a versatile program to convert, handle and visualize molecular structure on Windows-based PCs,” Journal of Molecular Graphics and Modelling, vol. 21, no. 1, pp. 47–49, 2002. View at: Publisher Site | Google Scholar
  31. VEGA ZZ Release, a versatile program to convert, handle and visualize molecular structure on Windows-based PCs,
  32. S. Li, Y. Zheng, and W. Yin, “Three dimensional quantitative structure-activity relationship studies on a series of typical tricycle compounds,” Journal of Beijing University of Chemical Technology, vol. 34, no. 3, pp. 249–253, 2007 (Chinese). View at: Google Scholar
  33. S.-L. Li, M.-Y. He, and H.-G. Du, “3D-QSAR studies on a series of dihydroorotate dehydrogenase inhibitors: analogues of the active metabolite of leflunomide,” International Journal of Molecular Sciences, vol. 12, no. 5, pp. 2982–2993, 2011. View at: Publisher Site | Google Scholar
  34. SPSS software version 16, IBM Corporation, Somers, NY, USA, 2007,

Copyright © 2013 Shunlai Li et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

More related articles

 PDF Download Citation Citation
 Download other formatsMore
 Order printed copiesOrder

Related articles

We are experiencing issues with article search and journal table of contents. We are working on a fix as to remediate it and apologise for the inconvenience.

Article of the Year Award: Outstanding research contributions of 2020, as selected by our Chief Editors. Read the winning articles.