NEURAL NETWORK–BASED SEARCH FOR COX-2 ACTIVE LIGANDS FROM COXIB-LIKE AND SIMILAR COMPOUNDS
Liza Tybaco Billones1*, Alex Cerbito Gonzaga1
|
|
ABSTRACT
The development of novel non-steroidal anti-inflammatory drugs (NSAIDs) free of serious side effects remains an attractive area of research. The availability of hundreds of compounds with known inhibitory activity against COX-2, the intended enzyme target of most NSAIDs, provides an excellent opportunity to explore various quantitative structure-activity relationship models and apply them in the binary classification of compounds. In this work, an artificial neural network or neural net (NN) model was constructed on a dataset consisting of 1380 compounds and 184 attributes, i.e., molecular descriptors. A feedforward NN consisting of 63 input nodes, 1 hidden layer with 33 nodes, and trained on 80% of the dataset by a backpropagation algorithm, has learned after 200 training cycles to classify compounds as active or inactive against COX-2. It has excellent predictive performance (accuracy = 93.5%, AUC = 0.97) on the 20% test set. The neural net classified 875 newly designed variants of COX-2 selective inhibitors and 163 structurally related compounds as active against the COX-2 target. The top hits have superior (or at least comparable) binding affinities compared to the control and possess the desirable properties of an oral drug.
Keywords: Molecular descriptors, NSAID, COX-2 inhibitors, Neural network, Anti-inflammatory
Introduction
The long-established non-steroidal anti-inflammatory drugs (NSAIDs) such as ibuprofen, naproxen, and indomethacin possess adverse effects that include gastrointestinal bleeding, ulceration, and perforation. This is associated with the inhibition of the cytoprotective cyclooxygenase-1 (COX-1) enzyme [1] that is expressed in many tissues. COX-1 is a housekeeping enzyme [2] that mediates important physiological functions such as the production of the mucus lining for inner stomach protection and overall tissue homeostasis [1], and thus dubbed the “good COX”. On the other hand, the inducible isozyme COX-2 is expressed largely at the sites of inflammation and is responsible for the production of pro-inflammatory prostaglandins that cause pain, fever, and inflammation [3], and hence tagged the “bad COX”. Thus, drugs that would selectively inhibit the activity of COX-2 exert analgesic, antipyretic, and anti-inflammatory effects and the side effects associated with COX-1 inhibition [4, 5].
The newer so-called “coxibs” such as celecoxib [6], etoricoxib [7], and parecoxib [8] are remarkably selective to COX-2. But even this group of NSAIDs has an array of undesirable side effects that are specific to a particular drug [9, 10]. The later discovery of COX-2 expression in kidneys, brain, and spinal cord [11, 12] indicated that the prostaglandins derived from COX-2 have beneficial effects including renal function regulation [13] and ulcer healing [14]. For this reason, some selective NSAIDs may also present side effects such as cardiovascular and renal problems [15, 16] depending on the extent of COX-2 inhibition. Thus, the discovery of new classes of anti-inflammatory compounds with optimal COX-2 risk is still an active area of research.
The ligand-based method is an established approach in drug discovery that uncovers new active chemical entities using the structural information of known active compounds [17]. It utilizes quantitative models that relate the observed biological activity to structure-derived molecular properties. The experimental bioactivity can be measured with a continuous variable like IC50 (half maximal inhibitory concentration), EC50 (half maximal effective concentration), or MIC (minimum inhibitory concentration); or a nominal variable like compound classification, i.e., active or inactive. Several statistical techniques and machine learning methods can be applied in classification tasks such as logistic regression [18], discriminant analysis [19], random forest [20], and artificial neural network [21], among others. This work developed a neural network (NN) model that was applied to the Derivatives or “coxib”-like variants, and to the Similars, another set of structurally related compounds.
Materials and Methods
The compounds included in this study were collected from literature as described elsewhere [22, 23]. These published articles on COX inhibitors covered more than two decades of research. The compounds in the dataset were assigned to either the Active or Inactive group depending on their IC50 values, that is, Active if IC50 ≤ 10 μM and Inactive if IC50 >
10 μM.
A personal computer running on Microsoft Windows 7 Professional 64-bit OS using a 3.50-GHz IntelÒ CoreÔ i7-4770K processor with 8.00-GB RAM was used in the creation, manipulation, and optimization of the molecular structures and calculation of their properties. A machine with a MacOS Catalina operating system, 3.1-GHz Dual-Core Intel Core i7 processor, and 16-GB RAM was used in the data analysis and model construction. The chemical structures were drawn using ChemDraw Professional 16.0 (www.perkinelmerinformayics.com) tool, the geometry optimization at the semi-empirical PM3 level was carried out in Spartan 16 (www.wavefun.com), and the calculation of molecular descriptors was performed in both Discovery Studio (DS) version 4.0 (Biovia, Inc.) and Spartan 16. MS Excel was used in organizing the data and RapidMiner Studio 9.7.001 (www.rapidminer.com) in Neural Net modeling and data analyses [24]. The ADMET (absorption, distribution, metabolism, excretion, and toxicity) properties, QED (Quantitative Estimate of Druglikeness) scores, Synthetic Accessibility (SA) scores, and molecular docking data were obtained as described in our previous work [22, 23].
Two sets of compounds were generated where the hits would be determined. The set of Derivatives was composed of 1100 compounds created by performing bioisosteric replacement [25] at selected positions in 5 structures that represent the 5 largest families, i.e. cyclopentenes, imidazolyls, difluorobenzenes, furanyls/thiophenyls, and isoxazoles, of known COX-2 inhibitors [22, 23]. The similarities included the 600 top hits in a similarity search study done in SwissSimilarity (http://www.swisssimilarity.ch).
Results and Discussion
Data was first cleaned by removing duplicate compounds and descriptors that were essentially invariable or had numerous missing items. The final number of compounds was 1380 with 929 actives and 451 inactives, and the number of descriptors was 184. Subsequently, the 80-20 dataset split was applied putting aside 1104 (80%) compounds for model construction and 276 (20%) for model validation. The best NN model was then implemented in the Derivatives and the Similars.
Mimicking biological neural networks, artificial neural networks (ANN) or simply neural nets (NN) are composed of artificial neurons or nodes, which are interconnected by weights that resemble the biological synapses [26]. Excitatory connections are characterized by positive weights while the inhibitory synapses are reflections of negative weights [27]. A linear combination was formed that summed up all inputs, which are modified according to assigned or calculated weights. An activation function, which in many applications was a sigmoid function, was employed to limit the amplitude or keep the output within an acceptable range, i.e. scaled to 0 and 1, or -1 and +1. NN is an adaptive system that uses nonlinear data modeling of complex relationships between inputs and outputs like in this case, between molecular descriptors and compound classification, i.e., active or inactive.
This work applied multilayer perceptron (MLP) that develops a model utilizing a feedforward neural network and backpropagation learning technique [28]. It consists of at least three layers of nonlinearly-activating nodes that are fully connected in a feedforward way [29], that is, each node in one layer connects with a certain weight to every node in the following layer, moving in only one direction from the input nodes through the hidden nodes to the output nodes. The backpropagation algorithm is a supervised learning method involving two phases, propagation and weight update, that are repeated until a good network performance is reached [30]. This algorithm computes the value of some predefined error function by comparing the predicted outcomes with the observed values. The error is then fed back through the network by which the algorithm adjusts the weights such that the error diminishes. This process is repeated for a sufficiently large number of training cycles until this error reaches a certain minimum value and the network has learned a certain target function. In this work, the optimization was stopped when the error decreased to 1 × 10-4.
For hyperparameter tuning, the 80-20 data split was again applied to the dataset intended for model construction, so that a set of 883 compounds was used to build the models that were then tested on the remaining 221 compounds. Among the performance metrics, model specificity was considered also important in that a model should be able to identify potential dropouts and so avoid the costly high attrition rate at the later stages of the drug development. Figure 1 shows the best model constructed using 63 attributes, that allowed a maximum of 0.75 correlation coefficient among the variables. On the validation set, the corresponding best NN model had 96.4% accuracy, 97.3% sensitivity, and 94.4% specificity, outshining both the Multiple Logistic Regression [22] and Random Forest [23] models. Like in the RF study, the models with fewer input nodes, i.e., limited information, gave less accurate predictions. But the inclusion of more but very highly correlated descriptors (r > 0.75) did not further improve the model performance. The best NN model was also indicated with an excellent predictive performance by both ROC-AUC [31] and Cohen’s kappa coefficient [32] with high values of 0.977 and 0.918, respectively (Figure 1b).
|
|
a) |
b) |
Figure 1. Performance Scores of Neural Net models as a function of the correlation coefficient (r) among the independent variables: training cycle = 200, learning rate = 0.01, momentum = 0.9, number of hidden layers = 1; input data was shuffled before learning, the sigmoid function was used as activation function, and optimization was stopped at epsilon error below 1 × 10-4. |
Figure 2 shows that performance scores improve with an increase in the number of training cycles, until the 200th cycle. Setting the training cycle at 200, Figure 3 shows that performance scores fluctuate around the average values of 92.5% accuracy, 94.4% sensitivity, 88.6% specificity, 0.98 AUC, and 0.92 kappa. Nevertheless, a careful examination of the Figure unveiled the best-performing NN, that is the one with 33 nodes in the hidden layer. The effect of the size of the hidden layer was further explored, but more hidden layers beyond what is shown in the graph did not result in an improvement in the model performance.
|
|
a) |
b) |
Figure 2. Performance Scores of Neural Net models as a function of the number of training cycles: r < 0.75, learning rate = 0.01, momentum = 0.9, number of hidden layers = 1; input data was shuffled before learning, the sigmoid function was used as activation function, and optimization was stopped at epsilon error below 1 × 10-4. |
|
|
a) |
b) |
Figure 3. Performance Scores of Neural Net models as a function of the hidden layer size: r < 0.75, learning rate = 0.01, momentum = 0.9, number of hidden layers = 1; input data was shuffled before learning, the sigmoid function was used as activation function, and optimization was stopped at epsilon error below 1 × 10-4. |
Hence, the best NN model (Figure 4a) was that built from 63 input nodes with a single hidden layer of 33 nodes on 200 training runs. The model, as shown in Figure 4b, correctly predicted the classification of the compounds in the test set with 93.5% overall accuracy, 96.2% sensitivity (179 out of 186 actives), and 87.8% specificity (79 out of 90 inactives). The outstanding performance of the model was further demonstrated by the ROC curve that is very close to the top-left corner with the corresponding AUC value that is very close to 1 (0.97) indicating a very good measure of separability between active and inactive.
|
|
a) |
|
|
|
b) |
c) |
Figure 4. a) The best NN model constructed from 63 independent variables using 1 hidden layer with 33 nodes, the closer to 1 (red) the greater the confidence that the compound is active against COX-2; b) the NN model prediction of compound classification, i.e., active/inactive; c) the NN model ROC-AUC plot: training cycle = 200, learning rate = 0.01, momentum = 0.9; input data was shuffled before learning, the sigmoid function was used as activation function, and optimization was stopped at epsilon error below 1 × 10-4. |
When applied to the set of 1100 Derivatives, the NN model classified 875 as actives, i.e., orange to red data points in Figure 5. The average probability of being active 0.95 is slightly higher than the Multiple Logistic Regression (MLogR) model prediction (0.94) [22], and much higher than that of the RF model (0.69) [23]. Consistent with previous studies, nearly all (96%) cyclopentane derivatives are predicted as active.
|
Figure 5. The NN model prediction of compound classification of the Derivatives. The closer to 1 (red) the greater the confidence that the compound is active against COX-2, the closer to 0 (blue) the greater the confidence that the compound is inactive. |
On the Similars, the NN model classified 163 out of 600 (27%) compounds as actives (Figure 6). This is slightly fewer than the 188 compounds (33%) identified by the RF model [23]. However, the mean probability of being active (0.89) of compounds predicted active by NN active is remarkably greater than that in RF (0.61) and can be considered comparable to that of the MLogR (0.92). This is not surprising considering that the weight adjustments during NN modeling utilized a sigmoid activation function similar to the logistic function in MLogR modeling.
Predicted active compounds from the Derivatives and Similars got through additional screening filters such as measures of drug-likeness, ADMETox, and synthetic accessibility. The average Quantitative Estimate of Druglikeness [33] or QED score of active Derivatives was 0.67 and 71% (797) of them have a QED score above 0.5, i.e., druglike. Having an average synthetic accessibility (SA) score of 3.4 which is within the acceptable range (SA score < 6) [34], the chemical synthesis of the active Derivatives is highly feasible. Most of them have optimal aqueous solubility (68%) and intestinal absorption (98%), all are non-mutagens and non-inhibitors of CYP2D6, and mostly are non-carcinogens (90%), though most are hepatotoxic (98%). Likewise, with an average QED of 0.63, the majority (61%) of the predicted active Similars possess druglike properties, i.e., QED score above 0.5, and can be easily synthesized in the laboratory, i.e., all SA scores were less than 6. They are mostly soluble in biological fluid (72%), non-carcinogens (84%), non-mutagens (81%), non-inhibitors of CYP2D6 (92%), and have good intestinal absorption (93%).
|
Figure 6. The NN model prediction of compound classification of the Similars. The closer to 1 (red) the greater the confidence that the compound is active against COX-2, the closer to 0 (blue) the greater the confidence that the compound is inactive. |
The top hits satisfied the criteria, namely (a) PA > 0.7, (b) QED > 0.8, (c) 1 ≤ SAS ≤ 6, (d) 2 ≤ AS ≤ 4, (e) 0 ≤ IA ≤ 1, (f) Non-Carcinogen, (g) Non-Mutagen, and (h) CYP2D6 Non-inhibitor and were ranked according to PA or probability of being active. The top 20 hits, that is, 10 from each set, are listed in Table 1 along with their properties. The top 10 hits from Derivatives were also the same compounds found in the previous MLogR study [22]. However, only 3 (S269, S265, and S499) of the top 10 hits from Similars were featured in that previous account. Among the new top Similars identified here, the diarylmethylenecyclobutenes (S34, S35, S375) appeared to be highly promising with greater binding energies (-8.6, -9.0, -8.8 kcal/mol, respectively) than the control ligands. Aside from being non-carcinogens, non-mutagens, and non-inhibitors of the CYP2D6 enzyme, the top 20 hits have excellent PA, QED, and SA scores, which are much-desired characteristics of potential candidate compounds.
Table 1. Druglikeness, Synthetic Accessibility, and Binding Energy of the Top Hits.
Rank |
Compound |
PA |
QED |
SAS |
BE |
Derivatives |
|||||
1 |
|
1.000 |
0.916 |
3.52 |
-10.0 |
2 |
D251 |
1.000 |
0.937 |
4.11 |
-9.6 |
3 |
D56 |
1.000 |
0.931 |
2.98 |
-9.0 |
4 |
D58 |
1.000 |
0.911 |
3.04 |
-8.6 |
5 |
D73 |
1.000 |
0.905 |
2.91 |
-8.5 |
6 |
D53 |
1.000 |
0.916 |
2.94 |
-8.8 |
7 |
D62 |
1.000 |
0.921 |
2.87 |
-8.9 |
8 |
D51 |
1.000 |
0.942 |
4.18 |
-8.5 |
9 |
D94 |
1.000 |
0.926 |
2.83 |
-8.7 |
10 |
D17 |
0.995 |
0.912 |
2.89 |
-8.7 |
Similars |
|||||
1 |
S269 |
0.999 |
0.879 |
2.33 |
-8.1 |
2 |
S266 |
0.999 |
0.848 |
2.39 |
-8.0 |
3 |
S34 |
0.999 |
0.841 |
3.36 |
-8.6 |
4 |
S264 |
0.999 |
0.848 |
2.40 |
-8.3 |
5 |
S35 |
0.998 |
0.861 |
3.07 |
-9.0 |
6 |
S15 |
0.998 |
0.868 |
2.90 |
-6.2 |
7 |
S265 |
0.998 |
0.879 |
2.29 |
-8.3 |
8 |
S553 |
0.939 |
0.849 |
3.97 |
-6.1 |
9 |
S375 |
0.928 |
0.875 |
2.87 |
-8.8 |
10 |
S499 |
0.820 |
0.855 |
2.84 |
-7.7 |
D=Derivative, S=Similar; PA=Probability(Active); QED= Quantitative Estimate of Druglikeness, 1=most druglike; SAS (Synthetic Accessibility Score): 1=very easy to synthesize, 10=very difficult to synthesize, (1-6 are acceptable); BE=Binding energy (kcal/mol); Molecular docking control: Mefenamic acid (BE = –8.6 kcal/mol) [35], Etoricoxib (BE = –7.8 kcal/mol)
Conclusion
A neural network model was constructed on a dataset consisting of 1380 compounds that have been tested experimentally against COX-2 enzyme and were classified as Active (IC50 ≤ 10 μM) or Inactive (IC50 >
10 μM). Over 180 molecular descriptors served as the independent variables, and 63 of these attributes were used as input in training the NN. The best performing NN consisted of a single hidden layer with 33 nodes and has learned after 200 training cycles at a learning rate of 0.01 and momentum of 0.9. This NN is an outstanding binary classifier as indicated by an excellent classification accuracy of 93.5% in the test set and ROC-AUC of 0.97.
The NN model classified 875 out of 1100 newly designed compounds (Derivatives) and 188 out of 600 structurally related compounds (Similars) as active against the COX-2 enzyme with high probability. The top hits of 10 Derivatives and 10 Similars have excellent drug-likeness and toxicity profiles and can be synthesized without much difficulty. Except for two from the Similars, these hits have superior (or at least comparable) binding affinities with the drug target compared to the control ligands. These top compounds can be prioritized in any subsequent drug discovery endeavor geared towards the development of a new generation of COX-2-acting NSAIDs.
Acknowledgments: This work was fully supported by the Office of the Vice President of Academic Affairs, University of the Philippines System through the Faculty Reps Administrative Staff Development Program (FRASDP).
Conflict of interest: None
Financial support: None
Ethics statement: The study was supported by University of the Philippines Faculty REPS and Administrative Staff Development Program (FRASDP) (2019-0307-0057-3954).
1. Laine L, Takeuchi K, Tarnawski A. Gastric mucosal defense and cytoprotection: bench to bedside. Gastroenterology. 2008;135(1):41-60. doi:10.1053/j.gastro.2008.05.030
2. Gierse JK, Hauser SD, Creely DP, Koboldt C, Rangwala SH, Isakson PC, et al. Expression and selective inhibition of the constitutive and inducible forms of human cyclo-oxygenase. Biochem J. 1995;305(2):479-84. doi:10.1042/bj3050479
3. Seibert K, Zhang Y, Leahy K, Hauser S, Masferrer J, Perkins W, et al. Pharmacological and biochemical demonstration of the role of cyclooxygenase 2 in inflammation and pain. Proc Natl Acad Sci USA. 1994;91(25):12013-7.
4. Wey S, Augustyniak ME, Cochran ED, Ellis JL, Fang X, Garvey DS, et al. Structure-Based Design, Synthesis, and Biological Evaluation of Indomethacin Derivatives as Cyclooxygenase-2 Inhibiting Nitric Oxide Donors. J Med Chem. 2007;50(25):6367-82.
5. Kurumbail RG, Kiefer JR, Marnett LJ. Cyclooxygenase enzymes: catalysis and inhibition. Curr Opin Struct Biol. 2001;11(6):752-60. doi:10.1016/s0959-440x(01)00277-9
6. Penning TD, Talley JJ, Bertenshaw SR, Carter JS, Collins PW, Docter S, et al. Synthesis and biological evaluation of the 1,5-diarylpyrazole class of cyclooxygenase-2 inhibitors: identification of 4-[5-(4-methylphenyl)-3-(trifluoromethyl)-1H-pyrazol-1-yl] benzenesulfonamide (SC-58635, celecoxib). J Med Chem. 1997;40(9):1347-65.
7. Riendeau D, Percival MD, Brideau C, Charleson S, Dubé D, Ethier D, et al. Etoricoxib (MK-0663): preclinical profile and comparison with other agents that selectively inhibit cyclooxygenase-2. J Pharmacol Exp Ther. 2001;296(2):558-66.
8. Talley JJ, Bertenshaw SR, Brown DL, Carter JS, Graneto MJ, Kellog MS, et al. N-[[(5-methyl-3-phenylisoxazol-4-yl)-phenyl]sulfonyl]propanamide, sodium salt, parecoxib sodium: a potent and selective inhibitor of COX-2 for parenteral administration. J Med Chem. 2000;43(9):1661-3.
9. Bally M, Dendukuri N, Rich B, Nadeau L, Helin-Salmivaara A, Garbe E, et al. Risk of acute myocardial infarction with NSAIDs in real world use: bayesian meta-analysis of individual patient data. BMJ. 2017;357:j1909. doi:10.1136/bmj.j1909
10. Lanas A, Chan FK. Peptic ulcer disease. Lancet. 2017;390(10094):613-24. doi:10.1016/S0140-6736(16)32404-7
11. Smith WL, Garavito RM, DeWitt DL. Prostaglandin endoperoxide H synthase (cyclooxygenase)-1 and -2. J Biol Chem. 1996;271(52):33157-60.
12. Hoffmann C. COX-2 in brain and spinal cords Implications for therapeutic use. Curr Med Chem. 2000;7(11):1113-20.
13. Rodriguez F, Llinas MT, Gonzalez JD, Rivera J, Salazar FJ. Renal changes induced by a cyclooxygenase-2 inhibitor during normal and low sodium intake. Hypertension. 2000;36(2):276-81.
14. Tibble J, Sigthorsson G, Caldwell C, Palmer RH, Bjarnason I. Effects of NSAIDs on cryoprobe-induced gastric ulcer healing in rats. Aliment Pharmacol Ther. 2001;15(12):2001-8.
15. Zhang J, Ding EL, Song Y. Adverse effects of cyclooxygenase 2 inhibitors on renal and arrhythmia events: Meta-analysis of randomized trials. J Am Med Assoc. 2006;296(13):1619-32.
16. McGettigan P, Henry D. Cardiovascular risk and inhibition of cyclooxygenase: A systematic review of the observational studies of selective and nonselective inhibitors of cyclooxygenase 2. J Am Med Assoc. 2006;296(13):1633-44.
17. Bacilieri M, Moro S. Ligand-based drug design methodologies in drug discovery process: an overview. Curr Drug Discov Technol. 2006;3(3):155-65. doi:10.2174/1570016306780136781
18. Sperandei S. Understanding logistic regression analysis. Biochem Med (Zagreb). 2014;24(1):12-8. doi:10.11613/BM.2014.003
19. McLachlan GJ. Discriminant Analysis and Statistical Pattern Recognition. 2004, Wiley Interscience.
20. Ho TK. Random decision forests. InProceedings of 3rd international conference on document analysis and recognition 1995 Aug 14 (Vol. 1, pp. 278-282). IEEE.
21. Haykin S. Neural Networks and Learning Machines, 3rd edition, 2008.
22. Billones LT, Gonzaga AC. Multiple Logistic Regression Modeling of Compound Class as Active or Inactive Against COX-2 and Prediction on Designed Coxib Derivatives and Similar Compounds. Chem-Bio Inform J. 2022;22:63-87.
23. Billones LT, Gonzaga AC. Random Forest Modeling of Molecular Descriptors of COX-2 – Targeted Non-Steroidal Anti-inflammatory Drugs (NSAIDs). Pharmacophore. 2022;13(6).
24. Banu GM, Babikar T, Bashier I, Sasikala N. Implementation of Artificial Neural Networks and Decision Tree Algorithms for Heart Disease Diagnosis. Int J Pharm Res Allied Sci. 2019;8(2):110-4.
25. Barillari C, Brown N. Classical bioisosteres. In Bioisosteres in medicinal chemistry; Nathan, B. eds.; Wiley-VCH: Weinheim, 2012, pp15-29.
26. Hopfield JJ. Neural networks and physical systems with emergent collective computational abilities. Proc Natl Acad Sci USA. 1982;79(8):2554-8.
27. Mustafina S, Uspenskaya N, Smirnov D, Yashin D, Mustafina S, Larin O. Forecasting the Growth of Wheat Shoots based on Neural Networks. Entomol Appl Sci Lett. 2020:7(2):70-6.
28. RapidMiner Studio. Available from: https://rapidminer.com
29. Zell A. Simulation of Neural Networks (1st ed.). Addison-Wesley; 1994. p. 73.
30. Goodfellow I, Bengio Y, Courville A. Back-Propagation and other differentiation algorithms. In: Deep Learning. 2016, MIT Press. pp. 200-20.
31. Chan C. What is a ROC Curve and How to Interpret It. Displayr Blog. 2018 [cited 2022 Aug 24]. Available from: https://www.displayr.com/what-is-a-roc-curve-how-to-interpret-it/
32. Cohen J. A coefficient of agreement for nominal scales. Educ Psychol Meas. 1960;20(1):37-46. doi:10.1177/001316446002000104
33. Bickerton GR, Paolini GV, Besnard J, Muresan S, Hopkins AL. Quantifying the chemical beauty of drugs. Nat Chem. 2012;4(2):90-8. doi:10.1038/nchem.1243
34. Ertl P, Schuffenhauer A. Estimation of synthetic accessibility score of drug-like molecules based on molecular complexity and fragment contributions. J Cheminform. 2009;1:8. doi:10.1186/1758-2946-1-8
35. Orlando BJ, Malkowski MG. Substrate-selective inhibition of cyclooxygeanse-2 by fenamic acid derivatives is dependent on peroxide tone. J Biol Chem. 2016;291(29):15069-81. doi:10.1074/jbc.M116.725713