Journal article icon

Journal article

Transforming medical regulations into numbers: vectorizing a decade of medical device regulatory shifts in the USA, EU, and China

Abstract:
Navigating the regulatory frameworks that ensure the safety and efficacy of medical devices can be challenging, especially across different regions. These frameworks often require redundant testing, slowing down the process of getting innovations to patients. This study leverages Natural Language Processing (NLP) to analyze 664 regulations and guidelines from the USA, EU, and China over the past decade, covering over 200 million tokens (individual words and sub-word units processed by BERT’s tokenizer). We categorize regulations into key phases—such as animal studies, clinical trials, and other testing stages—and use Bidirectional Encoder Representations from Transformers (BERT) to perform Named Entity Recognition (NER), identifying key regulatory terms and entities. By converting these texts into numerical representations and segmenting them by phase, country, and year, we compare jurisdictional requirements and assess their alignment. Additionally, we apply Latent Dirichlet Allocation (LDA) for theme analysis to observe changes in regulatory focus over time, reflecting evolving priorities and challenges. Our analysis reveals notable semantic similarities and differences between countries and phases. For instance, the closest alignment in animal study regulations is between China and the USA, with a mean cosine distance of 0.33. These findings highlight the computational potential in regulatory science, offering valuable insights for researchers, policymakers, and industry professionals.
Publication status:
Published
Peer review status:
Peer reviewed

Actions

Access Document

Publisher copy:
10.1145/3793533

Authors

More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
More by this author
Institution:
University of Oxford
Division:
MPLS
Department:
Engineering Science
Role:
Author
ORCID:
0000-0001-7306-2630


Publisher:
Association for Computing Machinery (ACM)
Journal:
ACM Transactions on Computing for Healthcare More from this journal
Volume:
7
Issue:
2
Pages:
1-34
Article number:
23
Publication date:
2026-03-17
Acceptance date:
2026-01-12
DOI:
EISSN:
2637-8051
ISSN:
2691-1957


Language:
English
Keywords:
Pubs id:
2366829
Local pid:
pubs:2366829
Deposit date:
2026-03-31
ARK identifier:

Terms of use


Views and Downloads






If you are the owner of this record, you can report an update to it here: Report update to this record

TO TOP