A sensible censor for sharing medical records

July 24th, 2008

(PhysOrg.com) -- Newly developed MIT software will help to allay patients' fears about who has access to their confidential records, facilitating the use of that data for medical research.

In the July 24 issue of the journal BMC Medical Informatics and Decision Making, a team of MIT researchers describes a computer program capable of automatically deleting details from medical records that may identify patients, while leaving important medical information intact.

Patient records that are to be shared within the research community must have any identifying information removed, according to the U.S. Health Insurance Portability and Accountability Act (HIPAA). However, manual removal of identifying information is prohibitively expensive, time consuming and prone to error-constraints that have prompted considerable research toward developing automated techniques for "de-identifying" medical records.

The MIT team aimed to solve this problem. "We've developed a free and open-source software package to allow researchers to accurately de-identify text in medical records in a HIPAA-compliant manner," said Gari D. Clifford, a principal research scientist in the Harvard-MIT Division of Health Sciences and Technology (HST) who led the work with Principal Investigator Roger G. Mark, a professor in HST and MIT's Department of Electrical Engineering and Computer Science.

According to Dr. Zohara Cohen, program director at the National Institute of Biomedical Imaging and Bioengineering, sponsor of the work, the information in patients' medical records is a "largely untapped treasure trove" that the biomedical research community could use to better understand diseases and their treatments.

"The automated de-identification software developed under the guidance of Dr. Mark is a big step forward in permitting the widespread sharing of patient information without the risk of compromised privacy and confidentiality," Cohen said.

Clifford, Mark and colleagues tested their censoring software on 1,836 nursing notes (a total of 296,400 words). Using multiple experts and additional algorithms, they replaced all personal information with "fake" data. In their BMC paper, they report that "the software successfully deleted more than 94 percent of the confidential information, while wrongly deleting only 0.2 percent of the useful content. This is significantly better than one expert working alone, at least as good as two trained medical professionals checking each other's work and many, many times faster than either."

The team is providing researchers access to the evaluation dataset together with the software to allow others to improve their systems, and to allow the software to be adapted to other data types that may exhibit different qualities.

Provided by MIT


print this article email this article download pdf blog this article bookmark this article     Digg this Stumble it share on Facebook share on Reddit add to delicious save to Yahoo! bookmarks
not rated yet


July 24th, 2008 all stories
Medicine & Health / Other

Comments: 0
Rank: not rated yet

  • Stumble this up

  • Digg this

  • Share it:
  • share on Facebook
  • share on MySpace
  • share on Slashdot
  • rss-newsfeed
  • share on Google
  • share on Reddit
  • add to delicious
  • save to Yahoo! bookmarks
  • share on Windows Live
  • Add to Mixx!
Rating: not rated yet



  • Physicists Demonstrate Quantum Memory with Matter Qubits
    Physicists Demonstrate Quantum Memory with Matter Qubits
    Physics / General Physics
    created Jul 03, 2009 | popularity 4.4 / 5 (17) | comments 1
  • 'Holey' Nanosheets for Wastewater Dye Removal
    Nanotechnology / Nanomaterials
    created Jul 01, 2009 | popularity 5 / 5 (5) | comments 1
  • Jellyfish Robot Swims Like its Biological Counterpart
    Jellyfish Robot Swims Like its Biological Counterpart
    Electronics / Robotics
    created Jun 26, 2009 | popularity 4.4 / 5 (8) | comments 1
  • Could Maxwell's Demon Exist in Nanoscale Systems?
    Could Maxwell's Demon Exist in Nanoscale Systems?
    Physics / General Physics
    created Jun 24, 2009 | popularity 4.4 / 5 (18) | comments 29
  • Living Safely with Robots, Beyond Asimov's Laws
    Living Safely with Robots, Beyond Asimov's Laws
    Electronics / Robotics
    created Jun 22, 2009 | popularity 4.6 / 5 (52) | comments 40
  • Other News

    Parents' endorsement of vigorous team sports increases children's physical activity, say researchers

    Medicine & Health / Psychology & Psychiatry

    created 33 minutes ago | popularity not rated yet | comments 0

    Parents who value strenuous team sports are more likely to influence their children to join a team or at least participate in some kind of exercise, and spend less time in front of the TV or computer, a new study says.


    Caffeine reverses memory impairment in Alzheimer's mice

    Caffeine reverses memory impairment in Alzheimer's mice

    Medicine & Health / Research

    created 1hour ago | popularity not rated yet | comments 0

    Coffee drinkers may have another reason to pour that extra cup. When aged mice bred to develop symptoms of Alzheimer's disease were given caffeine - the equivalent of five cups of coffee a day - their memory ...


    Researchers find possible environmental causes for Alzheimer's, diabetes

    Medicine & Health / Diseases

    created 1hour ago | popularity not rated yet | comments 0

    A new study by researchers at Rhode Island Hospital have found a substantial link between increased levels of nitrates in our environment and food with increased deaths from diseases, including Alzheimer's, diabetes mellitus ...


    Variations in 5 genes raise risk for most common brain tumors

    Medicine & Health / Genetics

    created 17 hours ago | popularity not rated yet | comments 1

    Common genetic variations spread across five genes raise a person's risk of developing the most frequent type of brain tumor, an international research team reports online in Nature Genetics.


    Wind power may have its own environmental problems

    Medicine & Health / Health

    created 16 hours ago | popularity 3.7 / 5 (6) | comments 4

    Wind power generation is expected to be a clean and environmentally friendly natural energy source, but a new kind of environmental problem has surfaced as infrasonic waves caused by windmills are suspected of causing health ...