NIHR Biomedical Research Centre: Oxford

Enabling translational research through partnership

NIHR 20th Anniversary NIHR website
NIHR Biomedical Research Centre: Oxford
  • Home
  • About
    • About us
    • Impact
    • Our next BRC
    • Steering Committee
    • Promoting equality, diversity and inclusion in research
    • Current Vacancies
    • Stay in Touch
    • Contact Us
  • Research
    • Research Overview
      • NIHR Clinical Research Facility
      • Ethics in the NIHR BRC: Oxford
      • Health Economics
      • Medical Statistics
    • Cancer
    • Cardiovascular Medicine
    • Digital Health from Hospital to Home
    • Gene and Cell Therapy
    • Genomic Medicine
    • Imaging
    • Inflammation across Tissues
    • Life-saving Vaccines
    • Metabolic Experimental Medicine
    • Modernising Medical Microbiology and Big Infection Diagnostics
      • Theme overview
      • Infections in Oxfordshire Database (IORD)
    • Musculoskeletal
    • Preventive Neurology
    • Respiratory Medicine
    • Surgical Innovation, Technology and Evaluation
    • Translational Data Science
  • Patient & Public Involvement
    • For patients and the public
    • For researchers
    • More information
  • Training
  • Industry & Partnerships
  • News
  • Events
  • Videos

Research Theme

IORD Project

The accuracy of redaction tools for de-identification of medical records

COMPLETED
IORD category: Electronic Health Records
Chief Investigator: Prof David Eyre
Sponsor: OUH
Research location: OUH NHS Trust
Approval date: 18 Oct 2023

To safely use medical records in research via IORD and other similar research databases, the records are first “de-identified”. This means taking out names, dates of birth, addresses, phone numbers, NHS numbers, etc, so that researchers cannot see information that could be used to identify who the record belongs to.

It is easy to do this when this information is stored with an obvious label like “name”. However, rarely information can accidentally be hidden in different parts of the data, like the text of a medical record. For example, a comment about a birthday might be included in the report of a scan or a doctor might write the name of village where a person lives in the text of an entry about how they might have picked up an infection.

There are tools which can be used to find this information. We want to test how effective they are at finding patient identifiers in different kinds of free text so we can take them out. We will test these tools on scan reports, on infection and other lab reports, and patient reviews by infection teams. Two NHS doctors will also review the same text and mark any patient identifiers. Then we can see how well the tools work by comparing them with what the doctors found. This will let us work out if the tools are doing a good job or not, and which is the best to use in future.

See publication: Benchmarking transformer-based models for medical record deidentification: A single centre, multi-specialty evaluation

Modernising Medical Microbiology icon

Modernising Medical Microbiology and Big Infection Diagnostics

  • Theme overview
  • Sub-theme 1: Novel rapid, high-throughput diagnostic workflows for infection
  • Sub-theme 2: Big data-led infection diagnosis and management strategies
  • Contacts
  • Videos
  • News

Infections in Oxfordshire Research Database (IORD)

  • IORD Overview
    • What data is in IORD?
    • Different kinds of data in IORD
    • Who can use the data?
    • How do they get the data?
    • What do they do with the data?
    • What kind of questions has IORD answered? Why is this important?
    • What was it like before IORD?
    • Opting out
  • IORD Application Trajectory
  • IORD Projects
  • IORD Publications
  • IORD Privacy Notice
  • IORD Glossary
  • IORD FAQs
  • IORD Infographics
You are here: Home > IORD > The accuracy of redaction tools for de-identification of medical records

Subscribe to the BRC Oxford Newsletter

Keep informed about the work of the BRC Oxford by subscribing to our Mailchimp e-newsletter. It is produced several times a year and delivers news and information about upcoming events straight to your inbox.

Subscribe Now

Feedback

We’d love to hear your feedback. Please contact us at obrcenquiries@ouh.nhs.uk

BRC Oxford on Social Media

  • Bluesky
  • Facebook
  • LinkedIn
  • YouTube
  • Data Control and Privacy
  • Accessibility
  • Our Partners
  • Disclaimer
  • Contact

Copyright © 2026 NIHR Biomedical Research Centre: Oxford