APPLICATIONS OF MACHINE LEARNING FROM CONSTRUCTING THE DATABASE TO THE LEAD DISCOVERY: PRACTICAL APPROACHES ON CANCER-RELATED PROTEINS

Said Moshawih; Long Chiau Ming; Nurolaini Kifli; Hui Poh Goh

Konferans Bildirisi

APPLICATIONS OF MACHINE LEARNING FROM CONSTRUCTING THE DATABASE TO THE LEAD DISCOVERY: PRACTICAL APPROACHES ON CANCER-RELATED PROTEINS

Yıl 2023, Cilt: 27 Sayı: Current Research Topıcs In Pharmacy: Pharmacology Debates, 3 - 5, 28.06.2025

Said Moshawih Long Chiau Ming , Nurolaini Kifli Hui Poh Goh

Öz

Drug discovery using advanced computational tools such as machine learning has succeeded in reducing about 40% and 60% of the time and costs required by conventional drug discovery pipelines respectively. In this study we aim at building a combinatorial library of anthraquinone and chalcone derivative and producing a workflow of different screening and scoring methodologies to find hits against cancerrelated proteins, and examine them using molecular dynamic and mechanics simulations. A combinatorial library, consisting of virtual compounds, was synthesized using 20 anthraquinone and 24 chalcone core structures via R-group enumeration methodology. The resulting compounds were optimized to the near drug-likeness properties and the physicochemical descriptors were calculated for all datasets and compared with commercially available databases such as FDA, Non-FDA, and natural products (NPs) datasets from ZINC 15. A workflow of a novel virtual screening and scoring methods was optimized based on the nature of the protein target. As a result; the optimized enumeration resulted in 1,610,268 compounds with NP-Likeness, and synthetic feasibility mean scores close to FDA, Non-FDA, and NPs datasets. The cheminformatic analysis illustrated an overlap between the chemical space of the generated library was more prominent with NPs with the lowest molecular diversity compared with other natural and synthetic drugs databases. Moreover, the consensus scoring methodology that we produced was based on quantitative structure-activity relationship, pharmacophore fitness, shape similarity, and docking scores. The optimized virtual screening for the protein targets was found to be beneficial in the retrospective enrichment studies, as it prioritized true positives in high percentage (ROC curve > 0.9). Compared to all other conventional screening methods individually, consensus scoring outperformed them. It was also found that this method of multistage virtual screening overcome challenges in the training set such as limited number of data points and limited diversity of activity. In molecular mechanic simulations, the range of activity of the experimental datasets plays a crucial role in the nature of the correlation between experimental activity values and binding free energy obtained by MM/GBSA calculations. In conclusion, consensus scoring using z-score fusion method is a beneficial way of virtual screening especially when the training dataset is imbalanced.

Anahtar Kelimeler

Cheminformatics, virtual screening, machine learning, drug design, consensus scoring

Kaynakça

[1] Moshawih S, Goh HP, Kifli N, Idris AC, Yassin H, Kotra V, Goh KW, Liew KB, Ming LC. Synergy between machine learning and natural products cheminformatics: Application to the lead discovery of anthraquinone derivatives. Chem Biol Drug Des. 2022;100(2):185-217. [CrossRef]
[2] Moshawih S, Lim AF, Ardianto C, Goh KW, Kifli N, Goh HP, Jarrar Q, Ming LC. Target-Based Small Molecule Drug Discovery for Colorectal Cancer: A Review of Molecular Pathways and In Silico Studies. Biomolecules. 2022;12(7):878. [CrossRef]
[3] Moshawih S, Hadikhani P, Fatima A, Goh HP, Kifli N, Kotra V, Goh KW, Ming LC. Comparative analysis of an anthraquinone and chalcone derivatives-based virtual combinatorial library. A cheminformatics "proof-of-concept" study. J Mol Graph Model. 2022;117:108307. [CrossRef]

Yıl 2023, Cilt: 27 Sayı: Current Research Topıcs In Pharmacy: Pharmacology Debates, 3 - 5, 28.06.2025

Said Moshawih Long Chiau Ming , Nurolaini Kifli Hui Poh Goh

Öz

Kaynakça

[1] Moshawih S, Goh HP, Kifli N, Idris AC, Yassin H, Kotra V, Goh KW, Liew KB, Ming LC. Synergy between machine learning and natural products cheminformatics: Application to the lead discovery of anthraquinone derivatives. Chem Biol Drug Des. 2022;100(2):185-217. [CrossRef]
[2] Moshawih S, Lim AF, Ardianto C, Goh KW, Kifli N, Goh HP, Jarrar Q, Ming LC. Target-Based Small Molecule Drug Discovery for Colorectal Cancer: A Review of Molecular Pathways and In Silico Studies. Biomolecules. 2022;12(7):878. [CrossRef]
[3] Moshawih S, Hadikhani P, Fatima A, Goh HP, Kifli N, Kotra V, Goh KW, Ming LC. Comparative analysis of an anthraquinone and chalcone derivatives-based virtual combinatorial library. A cheminformatics "proof-of-concept" study. J Mol Graph Model. 2022;117:108307. [CrossRef]

Toplam 3 adet kaynakça vardır.

Ayrıntılar

Birincil Dil	İngilizce
Konular	Eczacılık ve İlaç Bilimleri (Diğer)
Bölüm	Reviews
Yazarlar	Said Moshawih 0000-0003-4840-0460 Long Chiau Ming 0000-0002-6971-1383 Nurolaini Kifli 0000-0002-8817-7956 Hui Poh Goh 0000-0002-0480-399X
Yayımlanma Tarihi	28 Haziran 2025
Yayımlandığı Sayı	Yıl 2023 Cilt: 27 Sayı: Current Research Topıcs In Pharmacy: Pharmacology Debates

Kaynak Göster

APA	Moshawih, S., Ming, L. C., Kifli, N., Goh, H. P. (2025). APPLICATIONS OF MACHINE LEARNING FROM CONSTRUCTING THE DATABASE TO THE LEAD DISCOVERY: PRACTICAL APPROACHES ON CANCER-RELATED PROTEINS. Journal of Research in Pharmacy, 27(Current Research Topıcs In Pharmacy: Pharmacology Debates), 3-5.
AMA	Moshawih S, Ming LC, Kifli N, Goh HP. APPLICATIONS OF MACHINE LEARNING FROM CONSTRUCTING THE DATABASE TO THE LEAD DISCOVERY: PRACTICAL APPROACHES ON CANCER-RELATED PROTEINS. J. Res. Pharm. Temmuz 2025;27(Current Research Topıcs In Pharmacy: Pharmacology Debates):3-5.
Chicago	Moshawih, Said, Long Chiau Ming, Nurolaini Kifli, ve Hui Poh Goh. “APPLICATIONS OF MACHINE LEARNING FROM CONSTRUCTING THE DATABASE TO THE LEAD DISCOVERY: PRACTICAL APPROACHES ON CANCER-RELATED PROTEINS”. Journal of Research in Pharmacy 27, sy. Current Research Topıcs In Pharmacy: Pharmacology Debates (Temmuz 2025): 3-5.
EndNote	Moshawih S, Ming LC, Kifli N, Goh HP (01 Temmuz 2025) APPLICATIONS OF MACHINE LEARNING FROM CONSTRUCTING THE DATABASE TO THE LEAD DISCOVERY: PRACTICAL APPROACHES ON CANCER-RELATED PROTEINS. Journal of Research in Pharmacy 27 Current Research Topıcs In Pharmacy: Pharmacology Debates 3–5.
IEEE	S. Moshawih, L. C. Ming, N. Kifli, ve H. P. Goh, “APPLICATIONS OF MACHINE LEARNING FROM CONSTRUCTING THE DATABASE TO THE LEAD DISCOVERY: PRACTICAL APPROACHES ON CANCER-RELATED PROTEINS”, J. Res. Pharm., c. 27, sy. Current Research Topıcs In Pharmacy: Pharmacology Debates, ss. 3–5, 2025.
ISNAD	Moshawih, Said vd. “APPLICATIONS OF MACHINE LEARNING FROM CONSTRUCTING THE DATABASE TO THE LEAD DISCOVERY: PRACTICAL APPROACHES ON CANCER-RELATED PROTEINS”. Journal of Research in Pharmacy 27/Current Research Topıcs In Pharmacy: Pharmacology Debates (Temmuz 2025), 3-5.
JAMA	Moshawih S, Ming LC, Kifli N, Goh HP. APPLICATIONS OF MACHINE LEARNING FROM CONSTRUCTING THE DATABASE TO THE LEAD DISCOVERY: PRACTICAL APPROACHES ON CANCER-RELATED PROTEINS. J. Res. Pharm. 2025;27:3–5.
MLA	Moshawih, Said vd. “APPLICATIONS OF MACHINE LEARNING FROM CONSTRUCTING THE DATABASE TO THE LEAD DISCOVERY: PRACTICAL APPROACHES ON CANCER-RELATED PROTEINS”. Journal of Research in Pharmacy, c. 27, sy. Current Research Topıcs In Pharmacy: Pharmacology Debates, 2025, ss. 3-5.
Vancouver	Moshawih S, Ming LC, Kifli N, Goh HP. APPLICATIONS OF MACHINE LEARNING FROM CONSTRUCTING THE DATABASE TO THE LEAD DISCOVERY: PRACTICAL APPROACHES ON CANCER-RELATED PROTEINS. J. Res. Pharm. 2025;27(Current Research Topıcs In Pharmacy: Pharmacology Debates):3-5.