This is a sophisticated and subtle arena of work in which many people have contributed, and so your questions cannot be answered in just a few sentences. A couple of articles that might help.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2528047/ http://dms.data.jhu.edu/data-management-resources/publish-and-share/de-identify-human-subjects-data/applications-to-assist-in-de-identification-of-human-subjects-data/ Because you didn't mention HIPAA, you need to check into the national/legal environment in which you will be using these notes and see what types of Identifying information pertain to your use case. If it is HIPAA, you can find specific guidance for attributes that should be dummied, excised, or obfuscated. On Tue, Jul 24, 2018 at 12:36 PM, RAJAT TANWAR <[email protected]> wrote: > I was trying to de-identify my clinical notes and i came across cTakes. I > am very impressed with the demo of cTakes. > > I want to install it and De-identify my clinical notes. > > Could i be provided with some assistance on how to approach it. > > Thanks and Regards >
