A sensible censor for sharing medical records

July 24, 2008

(PhysOrg.com) -- Newly developed MIT software will help to allay patients' fears about who has access to their confidential records, facilitating the use of that data for medical research.

In the July 24 issue of the journal BMC Medical Informatics and Decision Making, a team of MIT researchers describes a computer program capable of automatically deleting details from medical records that may identify patients, while leaving important medical information intact.

Patient records that are to be shared within the research community must have any identifying information removed, according to the U.S. Health Insurance Portability and Accountability Act (HIPAA). However, manual removal of identifying information is prohibitively expensive, time consuming and prone to error-constraints that have prompted considerable research toward developing automated techniques for "de-identifying" medical records.

The MIT team aimed to solve this problem. "We've developed a free and open-source software package to allow researchers to accurately de-identify text in medical records in a HIPAA-compliant manner," said Gari D. Clifford, a principal research scientist in the Harvard-MIT Division of Health Sciences and Technology (HST) who led the work with Principal Investigator Roger G. Mark, a professor in HST and MIT's Department of Electrical Engineering and Computer Science.

According to Dr. Zohara Cohen, program director at the National Institute of Biomedical Imaging and Bioengineering, sponsor of the work, the information in patients' medical records is a "largely untapped treasure trove" that the biomedical research community could use to better understand diseases and their treatments.

"The automated de-identification software developed under the guidance of Dr. Mark is a big step forward in permitting the widespread sharing of patient information without the risk of compromised privacy and confidentiality," Cohen said.

Clifford, Mark and colleagues tested their censoring software on 1,836 nursing notes (a total of 296,400 words). Using multiple experts and additional algorithms, they replaced all personal information with "fake" data. In their BMC paper, they report that "the software successfully deleted more than 94 percent of the confidential information, while wrongly deleting only 0.2 percent of the useful content. This is significantly better than one expert working alone, at least as good as two trained medical professionals checking each other's work and many, many times faster than either."

The team is providing researchers access to the evaluation dataset together with the software to allow others to improve their systems, and to allow the software to be adapted to other data types that may exhibit different qualities.

Provided by MIT


Rank not rated yet
Relevant PhysicsForums posts

More news stories

Complex wiring of the nervous system may rely on a just a handful of genes and proteins

Researchers at the Salk Institute have discovered a startling feature of early brain development that helps to explain how complex neuron wiring patterns are programmed using just a handful of critical genes. ...

Medicine & Health / Research

created 11 hours ago | popularity 4.9 / 5 (9) | comments 1 | with audio podcast

Team isolates nerve cells involved in storing long term memory and gene proteins associated with them

(Medical Xpress) -- A research team in Taiwan has succeeded in isolating two nerve cells in fruit fly brains that are believed to be the major players in allowing for the formation of long term memories. Furthermore, ...

Medicine & Health / Neuroscience

created 17 hours ago | popularity 5 / 5 (4) | comments 2 | with audio podcast report

Seeing colors in music, tasting flavors in shapes may happen in life's early months

Famed violinist Itzhak Perlman sees a deep forest green whenever he plays a B-flat on his Stradivarius' G string. The A on the E string is red.

Medicine & Health / Psychology & Psychiatry

created 18 hours ago | popularity 4.5 / 5 (2) | comments 2 | with audio podcast

Both maternal and paternal age linked to autism

Older maternal and paternal age are jointly associated with having a child with autism, according to a recently published study led by researchers at The University of Texas Health Science Center at Houston (UTHealth).

Medicine & Health / Psychology & Psychiatry

created 15 hours ago | popularity 4.3 / 5 (3) | comments 0 | with audio podcast

New understanding of DNA repair could eventually lead to cancer therapy

A research group in the Faculty of Medicine & Dentistry at the University of Alberta is hoping its latest discovery could one day be used to develop new therapies that target certain types of cancers.

Medicine & Health / Cancer

created 15 hours ago | popularity 4.8 / 5 (6) | comments 0 | with audio podcast


Anonymous knocks CIA website offline (Update)

The website of the Central Intelligence Agency was inaccessible on Friday after the hacker group Anonymous claimed to have knocked it offline.

New error-correcting codes guarantee the fastest possible rate of data transmission

Error-correcting codes are one of the triumphs of the digital age. They’re a way of encoding information so that it can be transmitted across a communication channel — such as an optical fiber o ...

Humans may have helped the decline of African rainforests 3000 years ago

(PhysOrg.com) -- Large areas of rainforests in Central Africa mysteriously disappeared over three thousand years ago, to be replaced by savannas. The prevailing theory has been that the cause was a change ...

Google users warned of threat to smartphone wallets

Users of Google smartphone wallets were being warned on Friday that there is a way to crack pass codes intended to thwart thieves from going on illicit shopping sprees.

New power source discovered

(PhysOrg.com) -- Researchers at the Massachusetts Institute of Technology (MIT) and RMIT University have made a breakthrough in energy storage and power generation.

The power of estrogen -- male snakes attract other males

A new study has shown that boosting the estrogen levels of male garter snakes causes them to secrete the same pheromones that females use to attract suitors, and turned the males into just about the sexiest ...