Johnsd11 commented on issue #61: URL: https://github.com/apache/ctakes/issues/61#issuecomment-2835987432
Depending on the configuration of the dictionary lookup (https://cwiki.apache.org/confluence/display/CTAKES/cTAKES+4.0+-+Fast+Dictionary+Lookup), I think it is entirely possible that you will get some CUI differences under your scenario. For example, if you are: * comparing dictionary A + B versus dictionary A alone, and * dictionary A contains "cholecystitis": CUI001, and * dictionary B contains "acute cholecystitis": CUI002, and "cholecystitis": CUI001, and * your dictionary lookup is set to "Most Precise Terms Persistence", and * your source document contains the phrase "acute cholecystitis", then * when looking up against dictionaries A + B, you will return "CUI002" (i.e. only the more precise dictionary B term), and * when looking up against dictionary A, you will return "CUI001" There are probably other similar configuration options that can lead to similar slight differences that depend on the specific contents of the reference dictionaries and config options in ctakes, some of which might have defaults in one annotator or another. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
