ABSTRACT
Objectives:
The present study aimed to establish significant and validated quantitative structure-activity relationship (QSAR) models for neuraminidase inhibitors and correlate their physicochemical, steric, and electrostatic properties with their anti-influenza activity.
Materials and Methods:
We have developed and validated 2D and 3D QSAR models by using multiple linear regression, partial least square regression, and k-nearest neighbor-molecular field analysis methods.
Results:
2D QSAR models had q2: 0.950 and pred_r2: 0.877 and 3D QSAR models had q2: 0.899 and pred_r2: 0.957. These results showed that the models werere predictive.
Conclusion:
Parameters such as hydrogen count and hydrophilicity were involved in 2D QSAR models. The 3D QSAR study revealed that steric and hydrophobic descriptors were negatively contributed to neuraminidase inhibitory activity. The results of this study could be used as platform for design of better anti-influenza drugs.
INTRODUCTION
Quantitative structure-activity relationship (QSAR) is a technique of indirect drug designing. It is a method of quantification of the relationship of structure with biological activities of a set of molecules having common parent structure and useful in lead optimization. Magnitudes of particular physical properties are considered in classical QSAR. Steric, electrostatic, and hydrophobic properties are covered in 3D QSAR. The objective of the current study was to utilize the reported biological data of a series of anti-influenza compounds to develop predictive QSAR models and to explore the relationship between the ligand properties and biological activity.
Influenza virus is the causative agent for the contagious respiratory infectious disease influenza. Influenza A, B, and C are the three types of flu virus. The worst influenza pandemic occurred in 1918, and it caused 40-100 million deaths worldwide.1 Recently new subtypes such as H7N7 and H7N2, H9N2, and H7N9 have also been identified to cause human infection.2,3,4 The four major classes of anti-influenza drugs available now are inhibitors of hemagglutinin, M2 ion channel blockers, inhibitors of viral RNA polymerase, and inhibitors of neuraminidase.5
The number of QSAR studies6,7,8,9,10,11,12 has been reported for various classes of influenza inhibitors. In the current study, we had selected thiazolidine-4-carboxylic acid derivatives to provide structural insight responsible for selectivity of these derivatives toward influenza by QSAR analysis. Due to their high structural diversity and broad-type biological activity, these compounds were selected for the present study. The developed 2D and 3D QSAR models could be used to design new anti-influenza compounds.
MATERIALS AND METHODS
CS Chem Office 2004 (Cambridge Soft Corp., Cambridge, USA) and Vlife MDS 4.3 (VLife Sci. Tech. Lim, Pune, India) are the modeling software used in the present study. The neuraminidase inhibition activity (pIC50: -log10 IC50) of 28 thiazolidine-4-carboxylic acid derivatives (Table 1) was taken from the research reported by Liu et al.13
RESULTS AND DISCUSSION
CONCLUSION
Significant and predictive QSAR models were developed for thiazolidine neuraminidase inhibitor. 2D QSAR model evidenced the influence of structural properties and neuraminidase inhibitory activity of thiazolidines. The engendered 3D QSAR contour maps evidenced the influence of ligand features on the enzyme neuraminidase. It is concluded that modifications in the structure of thiazolidines based on the information obtained from the present study could lead to new thiazolidines with potent neuraminidase inhibitory activity. Further in silico tests, such as molecular docking, and kinetic and dynamic studies can be carried out for a better understanding of the mechanism of action. The field is also further open for designing, synthesis, and biological evaluation of potent anti-influenza virus compounds, pharmacokinetic studies, and clinical studies to establish those molecules as drug.
Energy minimization
Using CS Chem Office, the structure of thiazolidines was sketched by, and the 3D structure optimization was done in Vlife MDS by following the method reported by Veerasamy et al.14 Merck molecular force field energy minimized stable structure of individual compound was stored as Sybyl.mol2 files and used to compute various 2D independent descriptors.
2D QSAR analyses
An auto-scaling method was used to reduce the number of descriptors to 200.14 The data set was split into training set and prediction set by adopting the sphere exclusion (dissimilarity values 2 and 2.5) and random selection methods (10 trials, 70%, 75%, 80%, and 85%).15 2D QSAR equations using multiple linear regression (MLR) and partial least squares regression (PLS) methods were built as per the method reported by Veerasamy et al.14
Model quality and validation
The methods compiled by Veerasamy et al.14 were used to check the model quality and validation of the QSAR models.16
3D QSAR analyses
Compound 23 was used as a scaffold to align the molecules using template alignment method. The method reported by Veerasamy et al.14 was used to generate rectangular grid around the aligned molecules. The selected field descriptors were electrostatic, steric, and hydrophobic. For the electrostatic and steric field, 10.0 and 30.0 kcal/mole were used as the cutoff values.14
“A methyl probe of charge +1 at the lattice points of the grid was used to compute steric, electrostatic and hydrophobic interaction energies”.14 The k-nearest neighbor (kNN)-molecular field analysis and PLS methods with each one of the following variable selection methods (stepwise forward-backward) or (genetic algorithm) or simulated annealing) were used to generate 3D QSAR models. The variable selection methods were discussed somewhere else.14 The methods compiled by Veerasamy et al.14 were used to validate the 3D QSAR models.16
2D QSAR
A data set of 28 thiazolidines and their influenza neuraminidase inhibitory (pIC50) activity in Table 1 was utilized in the present in silico study. Two of the best and significant models obtained by using various feature selection and development methods were equations (1) and (2). The used criteria were 80% random selection, stepwise forward-backward variable selection, and MLR.
pIC50: 2.704+0.157 (±0.004) hydrogen count - 6.833 (±0.477) SK average
Hydrophilicity + 0.314 (±0.047) SsssNE-index - 0.279 (±0.050) SsCH3E-index equation (1)
Test set compounds: 1, 6, 16, 21, 26, 27
n=22, r2: 0.968, r2 se: 0.132, q2: 0.950, q2 se: 0.165, F4,17: 128.955, pred r2: 0.877, pred_r2 se: 0.308, Z score r2: 6.941, Z score Q2: 4.150, best rand R2: 0.518, best rand Q2: 0.158
pIC50: 3.139+1.0825 T_N_N_3 - 0.1862 SsCH3E-index + 0.1566 hydrogens count equation (2)
Test set compounds: 3, 11, 16, 19, 22, 27
n=22, r2 = 0.945, r2 se: 0.161 q2: 0.923, q2 se: 0.190, F3,18: 164.005, pred r2: 0.908, pred_r2 se: 0.281, Z score r2: 8.915, Z score Q2: 7.203, best rand R2: 0.526, best rand Q2: 0.340
Equation (1) could explain 96.8% and predict 87.7%, and equation (2) could explain 94.5% and predict 90.8% of the variance of the influenza virus neuraminidase inhibitory data. Thus, the selected good model was equation (1). The absence of intercorrelation between the descriptors was also observed. The parameters (hydrogen count, SK average hydrophilicity, SsssNE-index, SsCH3E-index) were involved, and the calculated influenza virus neuraminidase inhibitory activity by equation (1) is given in Table 2. The correlation between descriptors in 2D QSAR model equation (1) is given in Table 3. The correlation of experimental and predicted activities is graphically represented in Figure 1.
The good internal prediction of selected model was exhibited by q2: 0.950, and the external prediction power was also confirmed by pred r2: 0.877 (pred r2 >0.6). The low randomized r2 (0.118) and q2 (0.158) values confirmed the robustness of the model, and the results were not due to a chance correlation.
The positive contribution of descriptor hydrogen count in the selected model clearly suggests that influenza virus neuraminidase inhibitory activity could be increased with an increase in the number of hydrogen atoms in a compound. The SK average hydrophilicity is influencing the activity variation and is indirectly proportional to the activity. The SK average hydrophilicity reveals the importance of average hydrophilic value on the Van der Waals surface. The SsssNE-index has positive effect on the activity. It is an electrotopological state index descriptor. The SsssNE-index highlights the significance of the number of nitrogen atoms connected with three single bonds in a molecule. The SsCH3E-index has a positive effect on the activity, and it shows the importance of the number of -CH3 groups connected with one single bond in a molecule.
3D QSAR
The criteria used were 80% random training and test selection method, stepwise forward-backward variable selection, and kNN method.
pIC50: S_775 (-0.229-0.163); H_582 (-0.056-0.011) equation (3)
Test set compounds: 1, 3, 10, 11, 26
k-nearest neighbour: 4
n=22, q2: 0.896, q2 se: 0.232, pred r2: 0.931, pred_r2 se: 0.182
pIC50: H_682 (-0.060-0.016); S_775 (-0.229-0.128) equation (4)
Test set compounds: 1, 3, 10, 11, 26
k-nearest neighbour: 4
n=23, q2: 0.899, q2 se: 0.224, pred r2: 0.957, pred_r2 se: 0.153
Equation (3) could predict 93.1%, and equation (4) could predict 95.7% of the variance of the influenza virus neuraminidase inhibitory data. Thus, the selected good model was equation (4). The parameters involved in the selected model (steric and hydrophobic) and the calculated influenza virus neuraminidase inhibitory activity by equation (4) are given in Table 4 and correlation in Table 5. Figure 2 shows the contribution plot for steric and hydrophobic interactions in lattice. The good internal prediction of the model was confirmed by q2: 0.899. The external prediction power of the model was confirmed by pred r2: 0.957 (pred r2 >0.6).
Hydrophobic descriptors like H_682 with a negative range around the chemical structure of neuraminidase inhibitor indicate that more hydrophobicity is not favorable on those sites for the influenza virus neuraminidase inhibitory activity of the compounds. Steric descriptor like S_775 with a negative range around the chemical structure of neuraminidase inhibitor indicates that the bulky groups are not favorable on those sites for the influenza virus neuraminidase inhibitory activity. Figure 3 shows the plots of predicted vs. observed values of pIC50.