A sensible censor for sharing medical records

July 24, 2008

(PhysOrg.com) -- Newly developed MIT software will help to allay patients' fears about who has access to their confidential records, facilitating the use of that data for medical research.

In the July 24 issue of the journal BMC Medical Informatics and Decision Making, a team of MIT researchers describes a computer program capable of automatically deleting details from medical records that may identify patients, while leaving important medical information intact.

Patient records that are to be shared within the research community must have any identifying information removed, according to the U.S. Health Insurance Portability and Accountability Act (HIPAA). However, manual removal of identifying information is prohibitively expensive, time consuming and prone to error-constraints that have prompted considerable research toward developing automated techniques for "de-identifying" medical records.

The MIT team aimed to solve this problem. "We've developed a free and open-source software package to allow researchers to accurately de-identify text in medical records in a HIPAA-compliant manner," said Gari D. Clifford, a principal research scientist in the Harvard-MIT Division of Health Sciences and Technology (HST) who led the work with Principal Investigator Roger G. Mark, a professor in HST and MIT's Department of Electrical Engineering and Computer Science.

According to Dr. Zohara Cohen, program director at the National Institute of Biomedical Imaging and Bioengineering, sponsor of the work, the information in patients' medical records is a "largely untapped treasure trove" that the biomedical research community could use to better understand diseases and their treatments.

"The automated de-identification software developed under the guidance of Dr. Mark is a big step forward in permitting the widespread sharing of patient information without the risk of compromised privacy and confidentiality," Cohen said.

Clifford, Mark and colleagues tested their censoring software on 1,836 nursing notes (a total of 296,400 words). Using multiple experts and additional algorithms, they replaced all personal information with "fake" data. In their BMC paper, they report that "the software successfully deleted more than 94 percent of the confidential information, while wrongly deleting only 0.2 percent of the useful content. This is significantly better than one expert working alone, at least as good as two trained medical professionals checking each other's work and many, many times faster than either."

The team is providing researchers access to the evaluation dataset together with the software to allow others to improve their systems, and to allow the software to be adapted to other data types that may exhibit different qualities.

Provided by MIT


print this article email this article download pdf blog this article bookmark this article     Stumble it Digg this share on Facebook retweet share on Reddit add to delicious
Rate this story - not rated yet


July 24, 2008 all stories

Comments: 0

not rated yet
  • Stumble this up

  • Digg this

  • share this

  • hide
  • Related Stories

  • Software cos. eye key patent case in Supreme Court
    created 11 hours ago | popularity not rated yet | comments 0
  • Task force develops new radiation guidelines for brachytherapy
    created Nov 03, 2009 | popularity not rated yet | comments 0
  • Cell phones become handheld tools for global development
    created Oct 29, 2009 | popularity not rated yet | comments 0
  • Facebook for scientists: Map your expertise
    created Oct 27, 2009 | popularity not rated yet | comments 0
  • Scientists Create NICE Solution to Pneumonia Vaccine Testing Problems
    created Oct 20, 2009 | popularity not rated yet | comments 0



  • hide
  • Relevant PhysicsForums posts

Other News

Developmental delay could stem from nicotinic receptor deletion

Medicine & Health / Genetics

created 3 hours ago | popularity 4.5 / 5 (2) | comments 0

The loss of a gene through deletion of genetic material on chromosome 15 is associated with significant abnormalities in learning and behavior, said a consortium of researchers led by Baylor College of Medicine in a report ...


House passes health care bill on close vote (AP)

Landmark health bill passes House on close vote

Medicine & Health / Health

created 11 hours ago | popularity 3.3 / 5 (7) | comments 1

(AP) -- The Democratic-controlled House narrowly passed far-reaching health care legislation, handing President Barack Obama a hard-won victory on his chief domestic priority though the road ahead in the ...


Children who often drink full-fat milk weigh less

Medicine & Health / Health

created Nov 03, 2009 | popularity 5 / 5 (4) | comments 5

Eight-year-old children who drink full-fat milk every day have a lower BMI than those who seldom drink milk. This is not the case for children who often drink medium-fat or low-fat milk. This is one conclusion of a thesis ...


Turn On, Tune In, Develop?

Turn On, Tune In, Develop? Researchers Examine How Brain Benefits From Musical Training

Medicine & Health / Neuroscience

created Nov 06, 2009 | popularity 5 / 5 (11) | comments 4

For most people music is an enjoyable, although momentary, form of entertainment. But for those who seriously practiced a musical instrument when they were young, perhaps when they played in a school orchestra ...


'All-natural' sex pill contains Viagra chemical: FDA

Medicine & Health / Medications

created Nov 05, 2009 | popularity 3 / 5 (2) | comments 4

The US food and drug safety watchdog warned Thursday that an over-the-counter men's sex aid, labeled as all-natural, contains a chemical similar to the active ingredient in Viagra and could be dangerous.