Peter, I know Sean is busy this week and he may not see this for a while. But I tried this method over the summer and got it to work so I'm fairly confident that's the right approach still. Some of the details may have changed from two years ago, so I would also check out this directory as a starting point: http://svn.apache.org/viewvc/ctakes/trunk/ctakes-dictionary-lookup-fast-res/src/main/resources/org/apache/ctakes/dictionary/lookup/fast/example/bsv/
Tim ________________________________________ From: Abramowitsch, Peter <[email protected]> Sent: Thursday, January 4, 2018 7:28 AM To: [email protected] Subject: Re: How to use external CSV or BSV in addition to FastUMLS attention Sean [EXTERNAL] Further to my previous message, Sean, I was wondering if you could tell me whether this answer you gave in 2015, is still the right way to do things in ctakes4.x permalink: https://urldefense.proofpoint.com/v2/url?u=http-3A__markmail.org_message_s3ztinppusvsciss&d=DwIFAg&c=qS4goWBT7poplM69zy_3xhKwEW14JZMSdioCoppxeFU&r=Heup-IbsIg9Q1TPOylpP9FE4GTK-OqdTDRRNQXipowRLRjx0ibQrHEo8uYx6674h&m=Xq7U7BTlhofW8xpZfuBKuudNTqry4yt5RzaoBoPLRIg&s=BSEa_ZZMusVnqd2JbfeyoBxsDD1ZdfsHVXO56wR8erA&e= Subject: RE: How to update cTAKES so that new top level categories come out based on local dictionary?<https://urldefense.proofpoint.com/v2/url?u=http-3A__markmail.org_message_s3ztinppusvsciss&d=DwIFAg&c=qS4goWBT7poplM69zy_3xhKwEW14JZMSdioCoppxeFU&r=Heup-IbsIg9Q1TPOylpP9FE4GTK-OqdTDRRNQXipowRLRjx0ibQrHEo8uYx6674h&m=Xq7U7BTlhofW8xpZfuBKuudNTqry4yt5RzaoBoPLRIg&s=BSEa_ZZMusVnqd2JbfeyoBxsDD1ZdfsHVXO56wR8erA&e=> [permalink] <https://urldefense.proofpoint.com/v2/url?u=http-3A__markmail.org_message_s3ztinppusvsciss&d=DwIFAg&c=qS4goWBT7poplM69zy_3xhKwEW14JZMSdioCoppxeFU&r=Heup-IbsIg9Q1TPOylpP9FE4GTK-OqdTDRRNQXipowRLRjx0ibQrHEo8uYx6674h&m=Xq7U7BTlhofW8xpZfuBKuudNTqry4yt5RzaoBoPLRIg&s=BSEa_ZZMusVnqd2JbfeyoBxsDD1ZdfsHVXO56wR8erA&e=> From: Finan, Sean ([email protected]) Date: Oct 6, 2015 2:04:56 pm List: org.apache.incubator.ctakes-dev Regards Peter From: <Abramowitsch>, Peter Abramowitsch <[email protected]<mailto:[email protected]>> Date: Thursday, January 4, 2018 at 12:50 PM To: "[email protected]<mailto:[email protected]>" <[email protected]<mailto:[email protected]>> Subject: How to use external CSV or BSV in addition to FastUMLS Can someone point me to any up-to-date how-tos on how to include external CSV/BSV type resources to add synonyms, and other terms for dictionary lookup to augment the FAST UMLS resources that comes out of the box. Perhaps I have missed something, but looking at the CTakesDictionaryCreator UI, it looks like it is designed only to choose subsets of the UMLS data set rather than allowing one to bring in completely new information sources. I scoured the Marklogic ctakes user archive, but so many of the entries are old and I'm not sure they describe the current way of doing things. The only approach I could see would be to take use the AggregateEngine description and have it point to the CSV annotator, creating a completely new AE but this would build other types of annotation, whereas what I'm thinking about is a case for creating identified mentions such as a DiseaseDisorderMention based on finding an acronym that the UMLS resource doesn't know about, even though the concept in its full textual form is there. I'm sure this is not a unique request and apologize in advance if it has already been answered somewhere - Peter
