Pharmacophore an International Research Journal
Pharmacophore
Submit Manuscript
Open Access | Published: 2026 - Issue 1

Self-Supervised Molecular Models for P-Glycoprotein Substrate Prediction Using Transporter Assay Data Download PDF


, ,
  1. Department of Pharmaceutical Informatics and AI, Faculty of Pharmacy, University of Copenhagen, Copenhagen, Denmark.
  2. Department of Computational Pharmacology, Faculty of Engineering, Technical University of Denmark, Lyngby, Denmark.
Abstract

P-glycoprotein efflux can strongly constrain oral absorption, brain penetration, and intracellular drug exposure. Computational substrate prediction is therefore an important early filter for molecules likely to face transporter-mediated disposition liabilities. Most transporter models rely on limited labeled assay data and are often trained directly on endpoint-specific measurements. This ignores the broader chemical information contained in large collections of unlabeled molecular structures. This MDL article proposes a self-supervised molecular model for P-glycoprotein substrate prediction. The model pre-trains on large unlabeled chemical databases and is then adapted to a limited set of validated transporter assay labels. A molecular encoder would be pre-trained using contrastive and masked-structure objectives over graph or SMILES representations. The pre-trained encoder would then be coupled to a lightweight classifier for binary substrate prediction using curated P-glycoprotein assay labels.
Conceptually, the self-supervised model would be expected to offer better data efficiency than a model trained only from limited labeled transporter data. Attribution methods could also highlight molecular features associated with P-glycoprotein recognition. Self-supervised molecular learning could make transporter prediction more accessible when labeled assay data are scarce. This approach may support earlier design of molecules with more favorable absorption and distribution profiles.

Cite this article
Vancouver
Andersen T, Nielsen L, Sørensen M. Self-Supervised Molecular Models for P-Glycoprotein Substrate Prediction Using Transporter Assay Data. Pharmacophore. 2026;17(1):91-100. https://doi.org/10.51847/SLmtXHWnZI
APA
Andersen, T., Nielsen, L., & Sørensen, M. (2026). Self-Supervised Molecular Models for P-Glycoprotein Substrate Prediction Using Transporter Assay Data. Pharmacophore, 17(1), 91-100. https://doi.org/10.51847/SLmtXHWnZI

Related articles:
Most viewed articles:
QR code:

Short Link:
Views: 65

Downloads: 24
Quick Access

Associations

Pharmacophore
ISSN: 2229-5402

Copyright © 2026 Pharmacophore. Authors retain copyright of their article if they are accepted for publication.
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.