I have encountered a situation in which the cTakes clinical pipeline output
differs between multiple runs on the same text with the same configuration.
The following snippets from a single document are sufficient to demonstrate
the issue:
a gentle curve going into. irrigated with Bacitracin.
Hi Bruce,
I would venture to say that this is neither expected nor desired.
Before you fix it (or in addition to a fix), try to run with the new dictionary
lookup. It will have a different behavior, and it will be the default
dictionary lookup in future releases of cTakes – making fixes to
If I understand correctly, I would need new dictionary resources to run the
rare word lookup method.
Where can I find the necessary dictionary(ies) or how do I build them?
[image: IMAT Solutions] http://imatsolutions.com
Bruce Tietjen
Senior Software Engineer
[image: Mobile:] 801.634.1547
Good point ...
I tried to check in to sourceforge but had problems. I will try again right
now ...
Building a custom dictionary is possible with the DictionaryTool in cTakes
sandbox, but that is a different rabbit hole.
-Original Message-
From: Bruce Tietjen
Hi Bruce,
With Pei's help I just updated the sourceforge repo with the cTakes
dictionaries. Checkout artifact ctakes-resources-snomed-rword-hsqldb-2011ab
Sean
-Original Message-
From: Bruce Tietjen [mailto:bruce.tiet...@perfectsearchcorp.com]
Sent: Wednesday, October 08, 2014 11:52