HI Leonard,

We usually have to build our own gold standards, depending on what we're
looking for.


What we use for clinical documents is mtsamples.  http://www.mtsamples.com
These are medically transcribed notes from multiple disciplines.  They are
de-identified, but not annotated.

Another option, if you're looking for gold standards is to check out i2b2:
https://www.i2b2.org/NLP/DataSets/Main.php

I haven't used their datasets, so I'm not exactly sure how to get them, but
I think if you register you might be able to grab datasets for smoking,
medications, and relationships.

Good luck,

Neal



From:   Leonard Jacuzzo <[email protected]>
To:     [email protected]
Date:   06/22/2012 07:06 PM
Subject:        Public Gold Standards?



Hi I know this is not a UIMA specific question, but I am exploring NLP and
UIMA.

But I don't have the resources to develop a Medical Gold Standard set of
annotated documents. To do any real exploration, I need one these.

Does anyone on this list know where I can obtain de-identified gold
standard documents with which to test my set ups?


Any help will be greatly appreciated.

Best wishes,
Leonard

Reply via email to