<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD with MathML3 v1.3 20210610//EN" "JATS-archivearticle1-3-mathml3.dtd"><article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"
  dtd-version="1.3" xml:lang="en" article-type="research-article">
  <?DTDIdentifier.IdentifierValue -//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.2 20190208//EN?>
  <?DTDIdentifier.IdentifierType public?>
  <?SourceDTD.DTDName JATS-journalpublishing1.dtd?>
  <?SourceDTD.Version 1.2?>
  <?ConverterInfo.XSLTName jats2jats3.xsl?>
  <?ConverterInfo.Version 1?>
  <?properties open_access?>
  <front>
    <journal-meta>
      <journal-id journal-id-type="iso-abbrev">Pharmacophore</journal-id>
      <journal-id journal-id-type="publisher-id">pharmacophorejournal.com</journal-id>
      <journal-id journal-id-type="publisher-id">Pharmacophore</journal-id>
      <journal-title-group>
        <journal-title>Pharmacophore</journal-title>
      </journal-title-group>
      <issn pub-type="epub">2229-5402</issn>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="publisher-id">pharmacophorejournal.com-6853</article-id>
      <article-id pub-id-type="doi">10.51847/vwHtDaETbQ</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Original research</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Regulatory Text Mining System for Pharmaceutical Quality Risk Detection from Guidelines and Deviation Reports</article-title>
      </title-group>
                    <contrib-group>
                      <contrib contrib-type="author">
              <name>
                <surname>Martins</surname>
                <given-names>Bruno</given-names>
              </name>
                              <xref rid="aff1" ref-type="aff">1</xref>
                                                            <xref rid="cor1" ref-type="corresp" />
                          </contrib>
                      <contrib contrib-type="author">
              <name>
                <surname>Pereira</surname>
                <given-names>Lucas</given-names>
              </name>
                              <xref rid="aff1" ref-type="aff">1</xref>
                                        </contrib>
                      <contrib contrib-type="author">
              <name>
                <surname>Azevedo</surname>
                <given-names>Renata</given-names>
              </name>
                              <xref rid="aff2" ref-type="aff">2</xref>
                                        </contrib>
                      <contrib contrib-type="author">
              <name>
                <surname>Costa</surname>
                <given-names>Pedro</given-names>
              </name>
                              <xref rid="aff1" ref-type="aff">1</xref>
                                        </contrib>
                  </contrib-group>
                  <aff id="aff1">
            <label>1</label>Department of Computational Pharmacology, Faculty of Pharmacy, University of Minho, Braga, Portugal.
          </aff>
                  <aff id="aff2">
            <label>2</label>Department of Pharmaceutical Intelligence Systems, Faculty of Pharmacy, University of Porto, Porto, Portugal.
          </aff>
                          <author-notes>
            <corresp id="cor1">
              <bold>Address for correspondence:</bold> Prof. Wael Abu Dayyih, Department of
              Pharmaceutical Chemistry, Faculty of Pharmacy, Mutah University, Al-Karak 61710, Jordan.
                          </corresp>
          </author-notes>
                    <pub-date pub-type="epub">
        <day>28</day>
        <month>04</month>
        <year>2025</year>
      </pub-date>
      <volume>16</volume>
      <issue>2</issue>
      <fpage>32</fpage>
      <lpage>42</lpage>
      <permissions>
        <copyright-statement>
          Copyright: &#x000a9; 2026 Pharmacophore
        </copyright-statement>
        <copyright-year>2026</copyright-year>
        <license>
          <ali:license_ref xmlns:ali="http://www.niso.org/schemas/ali/1.0/"
            specific-use="textmining" content-type="ccbyncsalicense">
            https://creativecommons.org/licenses/by-nc-sa/4.0/</ali:license_ref>
          <license-p>This is an open access journal, and articles are distributed under the terms of
            the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 License, which allows
            others to remix, tweak, and build upon the work non-commercially, as long as appropriate
            credit is given and the new creations are licensed under the identical terms.</license-p>
        </license>
      </permissions>
      <abstract>
        <title>A<sc>BSTRACT</sc></title>
        <p>Pharmaceutical quality relies on the proactive detection of process, product, and compliance risks, yet critical signals are often hidden within unstructured sources such as regulatory guidance, deviation narratives, CAPA records, inspection observations, and other quality documents, which are typically reviewed in a fragmented manner. Traditional quality risk management depends heavily on manual document review and local expertise, making it challenging to identify recurring issues across sites, benchmark internal deviations against external regulatory expectations, or develop a comprehensive view of emerging risks. This article proposes an AI-powered regulatory text mining system that ingests regulatory guidelines, deviation reports, and CAPA records to extract risk entities and their relationships, link them to manufacturing processes, and build a queryable quality-risk knowledge graph. The framework integrates document ingestion, preprocessing, named entity recognition, relation extraction, transformer-based risk classification, knowledge graph construction, and dashboard-based decision support, with human verification to ensure interpretability, auditability, and compliance with regulatory standards. By converting scattered textual information into actionable quality-risk intelligence, the system enables quality teams to anticipate compliance gaps, prioritize CAPA activities, and respond more rapidly to evolving regulatory expectations, shifting pharmaceutical organizations from reactive documentation toward predictive, science-based quality oversight.</p>
      </abstract>
      <kwd-group>
                <kwd>Regulatory text mining</kwd>
                <kwd>Pharmaceutical quality</kwd>
                <kwd>Deviation reports</kwd>
                <kwd>CAPA</kwd>
                <kwd>Natural language processing</kwd>
                <kwd>Knowledge graph</kwd>
              </kwd-group>
    </article-meta>
  </front>
</article>