Ma XH et al / Acta Pharmacol Sin 2004 Jul; 25 (7): 950-958
Xiao-hui MA, Xiao-yi ZHANG, Jian-jun TAN, Wei-zu CHEN, Cun-xin WANG2
College of Life Science and Bioengineering, Beijing University of Technology, Beijing 100022, China
1 Project supported by the National Natural Science Foundation of China (No 30170230), the Beijing Natural Science Foundation (No 5032002 and 5042003), and Beijing City Education Committee (No KZ200410005002).
2 Correspondence to Prof Cun-xin WANG. Phn 86-10-6739-2724. Fax 86-10-6739-2724. E-mail cxwang@bjut.edu.cn
Received 2003-10-28 Accepted 2004-03-16
KEY WORDS HIV integrase; inhibitors; quinolines; comparative molecular field analysis; AutoDock
ABSTRACT
AIM: To understand pharmacophore properties of styrylquinoline derivatives and to design inhibitors of HIV-1 integrase. METHODS: Comparative molecular field analysis (CoMFA) was performed to analyze three-dimensional quantitative structure-activity relationship (3D-QSAR) of styrylquinoline derivatives. Thirty-eight compounds were randomly divided into a training set of 28 compounds and a test set of 10 compounds. The stability of 3D-QSAR models was proved by the analysis of cross-validated and non-cross-validated methods. Moreover, the binding mode of these compounds and integrase was constructed by AutoDock program. RESULTS: The CoMFA model of the training compounds was reasonably predicted with cross-validated coefficient (q2) and conventional (r2) values (up to 0.696 and 0.754). Then the model was validated by the test set. The resulting CoMFA maps visualized structural requirements for the biological activity of these inhibitors. Docking results showed that a carboxyl group at C-7 and a hydroxyl group at C-8 in the quinoline subunit, bound closely to the divalent metal cofactor (Mg2+) around the integrase catalytic site. Moreover, there is a linear correlation between the binding energy of the inhibitors with integrase and their inhibitory effect. CONCLUSIONS: The present study indicated that the CoMFA model together with docking results could give us helpful hints for drug design as well as interpretation of the binding affinity between these inhibitors and integrase.
INTRODUCTION
Along with reverse transcriptase (RT) and protease (PR), HIV integrase (IN) is recognized as an important target for treatment of acquired immunodefi ciency syndrome (AIDS). Drugs included in the highly active antiretroviral therapy (HAART)[1] regime are mainly directed against RT and PR. However, toxicities and emergence of drug-resistant virus necessitate the examination of complimentary targets for therapeutic development[2,3]. Accordingly, significant effort is devoted to find inhibitors against the third viral enzyme, IN[4-6].
IN catalyzes the insertion of viral cDNA into the host genome. Integration occurs via a multi-step sequence of reaction, which includes cleavage of a dinucleotide pair from the 3'-end (termed 3'-processing) followed by transfer of the resulting shortened strands into the target DNA (termed strand transfer). Although a large number of compounds have been reported to inhibit HIV-1 integrase in biochemical assays[6-10], only a few exhibit a substantial level of antiviral activity at nontoxic concentrations and distinct selectivity for IN and no clinically useful inhibitor has been developed to date[7,10]. This requires insights into the factors that influence inhibitor's activity and selectivity toward IN to guide drug design.
The present study has reported styrylquinoline derivatives as potent HIV IN inhibitors. These compounds display antiviral activities in a de novo infection assay of CEM cells and are devoid of cytotoxicity[11,12]. Ouali et al have analyzed thermodynamic and geometrical parameters of inhibitors binding to the core domain of RSV[13]. Some research groups have developed pharmacophores of different classes of integrase inhibitors including styrylquinoline[14-17]. Despite the remarkable research work, the binding mode of styryl-quinoline derivatives to IN and the rational design of new inhibitors remain to be elucidated. Consequently, our primary aim is to obtain the CoMFA[18] model to predict the activity of styrylquinoline derivatives and the rational binding mode of styrylquinoline derivatives to IN. CoMFA and docking simulations were extended to styrylquinoline derivatives in this work. We anticipate this study might give significant information for drug design.
MATERIALS AND METHODS
Data sets To develop reliable CoMFA, 38 styryl-quinoline derivatives with diverse structures and distinct activities were selected from two literatures of the same research group[11,12]. The biological activity of each compound expressed as IC50, which is the inhibitory concentration of compounds needed to inhibit 50 % of HIV-1 IN in 3'-processing, was converted to pIC50 (-lgIC50) for the 3D-QSAR analysis. Thirty-eight derivatives were randomly divided into two groups: a training set of 28 compounds and a test set of 10 compounds.
3D-QSAR studies Three-dimensional structures of these inhibitors were built using SYBYL 6.9[19]. The initial structures were minimized using the semi- empirical program MOPAC 6.0 at the AM1 Hamilto-nian[20]. The Mullicken population analysis atomic charges at the AM1 level (AM1-Mul) were considered in the present work. Molecular alignment was carried out using the SYBYL Multifit option. The SYBYL spring constant value was 20 and compound SQ15, one of the most active structures, was used as the alignment template shown in Fig 1.
Fig 1. The compound SQ15 used as a template for molecular alignment (the bold face shows the core structure used to perform molecular alignment) and stereoview of all aligned compounds.
The CoMFA region was defined to extend 5 Å beyond the van der Waals radii of the assembly of superimposed molecules along each of the principal axes of a Cartesian coordinate system. The grid spacing was set to 2Å in all x, y, and z directions. The default sp3 carbon atom with a formal charge of +1|e| was selected as the probe atom. In CoMFA calculation, steric (Lennard- Jones 6-12 potential) and electrostatic (Coulombic potential) field were generated with a distance-dependent dielectric at each lattice point and the maximum values were truncated to 30 kcal/mol. Additionally, H-bonding fields were included in the CoMFA study. Once an H-bond CoMFA column has been created, the acceptor and donor components are nominally a "steric" field type component and an "electrostatic" field type component, respectively.
The correlation of the CoMFA descriptor with corresponding pIC50 was evaluated by the PLS regression analysis module[21] in SYBYL. To choose the optimum number of principal components and check the statistical significance of the models, cross-validations were performed by means of the Leave-one-out (LOO) procedure in the enhanced version of PLS. The optimum number of components for the training set was determined by the highest cross-validated (q2) and the smallest predictive error sum of squares (PRESS). Finally, the PLS analysis of the predictive 3D-QSAR models was repeated without cross-validation for the optimum component of the training set. Conventional correlation coefficient (r2) and its standard error (s) were computed, and the CoMFA coefficient contour plots were generated. CoMFA columns (ie, values at a given lattice point) were filtered with the cutoff of 2.0 kcal/mol to speed up computation. For the final model to be used to predict the test set, PLS fit was done without filtering.
Molecular docking To further understand the binding mode between styrylquinoline derivatives and HIV-1 IN, these compounds were docked to the core domain of IN by the AutoDock3.0 program[22]. The structure of the IN core domain was taken from the Brookhaven Protein Databank (PDB ID: 1QS4)[23] and the A-chain residues of 1QS4 were remained to dock. The missing residues (141-144) were inserted by using the loop search algorithm in SYBYL BIOPOLYMER module. The polar hydrogen atoms and Amber atom charges of A-chain protein were added with SYBYL and the active site magnesium ion was maintained throughout the entire process (with a charge of +2). For ligands, partial atomic charges were added by AM1-Mul charges method with SYBYL.
In docking simulation, the dimensions of the grids were thus 30 Å×30 Å×30 Å with a spacing of 0.375 Å and the center close to OD2 atom of Asp 64. Docking was run using the Lamarckian genetic algorithm and a standard protocol, with a maximum number of energy evaluations of 1.5×106 per run, and the other run parameters were maintained at their default settings. There were 20 runs for each system using the optimized run parameters. The results less than 1.0 Å in positional root-mean-square deviation (rmsd) were clustered together and represented by the result with the most favorable free energy of binding.
RESULTS AND DISCUSSION
CoMFA models CoMFA is the versatile and powerful technique for studying the quantitative structure-activity relationship at 3D level (3D-QSAR). It has been successfully applied to guide the design of bioactive molecules[24,25].
Tab 1 lists the PLS results of the CoMFA model using different field combinations. For CoMFA models, the q2 value changed from 0.597 to 0.621 by adding the H-bonding fields. After region focusing, the q2 value of the CoMFA model with the steric, electrostatic, and H-bonding fields is distinctly increased (q2=0.696) because the weights for those meaningful grid points are increased and the noise is decreased. Consequently, the CoMFA 2 model with the steric, electrostatic and H-bonding fields after region focusing was selected as the best CoMFA model. The non-cross-validated PLS analysis yielded the model with r2 of 0.754 and s of 0.426. Moreover, the best CoMFA model was applied to predict the activities of ten molecules in the test set (r2test=0.633, stest=0.633). Fig 2 shows the prediction results for the test set with the best CoMFA models.
Tab 1. PLS statistic data of CoMFA modelsa obtained from different fields.
| |
CoMFA 1 |
CoMFA 2 |
CoMFA 3b |
|
|
Steric+Ele |
Steric+Elec+ |
Steric +Elec+ |
| |
|
H-bond |
H-bond |
| q2 |
0.597 |
0.621 |
0.696 |
|
PRESS |
0.556 |
0.528 |
0.473 |
|
r2 |
0.765 |
0.725 |
0.754 |
|
s |
0.425 |
0.450 |
0.426 |
|
F |
40.617 |
68.511 |
79.657 |
|
Components |
2 |
2 |
1 |
|
Fraction |
|
|
|
|
Steric |
0.410 |
0.186 |
0.173 |
|
Electrostatic |
0.590 |
0.177 |
0.241 |
|
Acceptor |
|
0.344 |
0.294 |
| Donor |
|
0.294 |
0.291 |
a Using AM1_Mul Partial Charges
and Grid spacing: 1.5 Å.
b CoMFA model generated after region focus. c Ele is the electrostatic
field.
Fig 2. Plots of the predictions of the best CoMFA model for the test-set compounds.
CoMFA contour maps Steric, electrostatic and H-bonding features of the CoMFA model for integration data were shown in Fig 3. These maps showed the changes of molecular fields were associated with differences of biological activity. For reference, SQ15 was displayed inside the field. The model suggested that activity would be favored by the presence of a bulky group near the volume colored green (around the carboxyl group at C-7 in quinoline ring and two hydroxyl groups at C-3' and C-4' in the ancillary phenyl ring), while steric restriction near those colored yellow (around C-8 position). Similarly, positive charge near the blue regions (eg, the region near the H atom in C-7 carboxyl group, N atom of quinoline, and C-3' position of phenyl ring) and negative charge near the red regions (eg, near the carboxyl oxygen atom at C-7) would favor increased activity (Fig 3A). This means that the groups or atoms with positive charge in the blue regions and negative charges in the red regions are very important to ligand binding. For instance, when the C-7 hydrogen atom in compound SQ27 is substituted with carboxylic group to give compound SQ2, inhibitory potency will be improved more than 20-fold. The compounds of SQ24 and SQ25 with a bulky group at C-8 position exhibit a poor antiviral activity (pIC50=4.00).
Fig 3. The CoMFA models for styrylquinoline derivatives. A) The contour plot of the CoMFA steric and electrostatic fields. B) The contour plot of the CoMFA H-bonding fields.
The H-bonding field contributions were shown in Fig 3B. From the fraction of field, the H-bonding acceptor field was slightly more important than the other fields. The contour indicated that more H-bonding acceptor groups in the green region increased potency, whereas more H-bonding acceptor groups in the yellow regions (near hydroxyl groups at C-3', C-4', C-5' and C-8) decreased potency. The H-binding donor favorable areas were marked blue color, while the H-binding donor unfavorable areas were labeled red. The blue region was near the OH of C-7 carboxyl group, namely, the OH might form the stable H-bond with some residues in protein. For example, the methylation of the carboxylic group in compound SQ12 to give compound SQ20 can make inhibitory potency drop 50-fold.
The CoMFA results described similarities and differences between these compounds and correlated them with changes in the antiviral activity. The results reflected structural requirements for biological activity from the inhibitors, hence we further explore the binding mode between styrylquinolines and IN.
Binding mode prediction of styrylquinolines / IN Most of the compounds possess various sites of possible protonation. For instance, SQ12 has protonation sites at N and at the oxygen atoms of C-7, C-8, C-3', and C-4'. As the two pKa values of catechol in aqueous solutions are 9.45 and 12.8 (the second one is questionable)[26], catechol is not ionized in the reaction buffer, and the chemical formula of the catechol moiety of the drugs is shown in Tab 2. In contrast, o-hydroxybenzoic acid has two pKa values 2.98 (CO2H/CO2-) and 12.38 (OH/O-)[27]. According to the spectroscopic studies of Bardez et al[28] on 8-hydroxyquinoline, the proton belongs to the hydroxyl group in the ground state but migrates to N in the excited state. The ground-state pKa of 8-hydroxyquinoline in aqueous solutions at 20 ºC are 5.13 (NH+/N) and 9.89 (OH/O-)[29]. As a consequence, compounds from SQ1 to SQ20 probably have a net charge of -1, SQ38 compound has a charge of -2, and the other has a zero charge. In our work, the binding mode of styrylquinoline compounds with various protonation and integrase were analyzed by AutoDock program.
Tab 2. Structures, HIV-1 IN inhibitory activity, and docking results of styrylquinoline derivatives.
DGb is the estimated binding free energy of the cluster results for the best binding site and is given in kcal/mol; Rank is the scoring of the cluster in 20 docked structures; N is the number of the structures involved in the cluster.
As labeled in Fig 4a, the well-clustered docking results were obtained for most of investigated ligands (SQ1-SQ22, SQ30-SQ38). The docking of these compounds produced the two clusters occupying two separate but overlapping binding regions. The two clusters occupy a narrow binding region formed by Asp64, Ser66, His67, Leu68, Gly70, Asp116, Gln148, Gly149, Glu152, Asn155, Lys156, and Lys159. In the present work, the styrylquinoline inhibitors were different from the cinnamoyl inhibitors studied by Buolamwini et al[30]. The styrylquinoline compounds lie in a nearby coordination to magnesium cation. As shown in Fig 4B, compound SQ15 as a template was displayed in the binding site. The oxygen atoms of C-7 carboxyl group and C-8 hydroxyl group may coordinate with Mg2+ ion within the enzyme catalytic site. This docking mode was the same as the experiment report given by Zouhiri et al[12]. Additionally, the estimated docking energies, ranks and the number of the structures involved in the clusters were calculated by AutoDock and listed in Tab 2. The estimated docking energy correlated well with the biological activity pIC50 in a general sense, and compounds with the higher activity showed the lower AutoDock energy scores (Tab 2). This correlation was more pronounced for molecules that differed mostly by the presence of a carboxyl group at C-7 and a hydroxyl group at C-8, and their absence or methylation. Consequently, we have proposed the two orientations to be probably the best binding mode for the styrylquinoline inhibitors at the IN active site. The docking results together with the 3D-QSAR contour plots clearly demonstrate that the presence of a carboxyl group at C-7, a hydroxyl at C-8 and an aromatic ring is required for the biological activity and the binding mode of these compounds. Additionally, we suggest that adding H-bonding acceptors at C-3 in the quinoline ring would form H-bonds with the surface residues in IN to improve biological activities of styrylquinoline inhibitors.
Fig 4. The binding mode of styrylquinoline derivatives to IN. A) The docking cluster of styrylquinoline compounds in the IN active site. B) Superimposition of one docked structure (SQ15) over physicochemical properties of the IN active site.
CONCLUSION
In this study, the CoMFA models show reasonably predictability to estimate the biological activity of the styrylquinoline compounds. The CoMFA contour maps also visualize structural similarities and differences among the styrylquinoline compounds and show that the functional group and region will affect the antiviral activity of the inhibitors in three-dimensional space. On the other hand, molecular docking simulation shows one potential binding mode of these compounds to IN. As shown in docking results, a carboxyl group at C-7 and a hydroxyl group at C-8 in the quinoline subunit interact with the divalent metal cofactor within the IN catalytic site. The binding mode is highly complementary to the analysis from the experiment. Results from the present study demonstrate the utility of CoMFA 3D-QSAR models and molecular docking as predictors of inhibitor-IN binding affinity. Computer screening base on the results is in progress. In addition, in order to get a more native-like binding mode of these inhibitors with IN, the minimization of docking structures and the CoMFA study with the optimized docking alignment is currently underway.
ACKNOWLEDGMENT We thank Prof Olson for providing us the AutoDock program. We also thank Dr Ting-jun HOU and Yi-hua WANG for helpful suggestions.