Hi Team,

We set up cTAKES4.0.0 as our NLP engine for our profile recently . We have 
faced situations where some of the expected tokens are not picked up by cTAKES 
during clinical text extraction. So our first thought process was to identify 
where the dictionary is configured and how that can be updated. After some code 
analysis  it was found that the dictionary is configured in the  below path 
under ctakes/resources for sources RxNorm and SNOMEDCT_US
[cid:image003.png@01D635F4.428C6D80]

We were able to open the hsqldb using the hsql db gui and found out that some 
of our required entries are already there . So if I come specifically to our 
current problem. The  Pap Smear and Mamogram are two clinical terms which are 
not currently recognized by cTAKES in our profile.

*       If I look into the .script file , Pap Smear and Mammogram/Mammography 
is already present in the .script file and in the respective tables. PFB a 
snapshot as below

  [cid:image006.jpg@01D635F4.428C6D80]                    
[cid:image010.jpg@01D635F4.428C6D80]







But still this was not recogonised by cTAKES. I see there are some filters 
working on top of the available entries in dictionary(ctakes-gui and 
ctake-gui-res). Will that be because of these filters the tokens are not 
recognized as expected. Could you pls. share us what exactly these filters do. 
This will help us in future also when we are trying to add new terms into the 
dictionary



*       What are the steps to do if we need to add/edit entries into the 
existing dictionaries. I see we can add/edit the existing values in .scripts 
files but  our primary doubt is if suppose I have a term 'xyz' to be added to 
dictionary how can I get the CUI and other values like TUI,RINDEX,TCOUNT and 
PREFTERM. Is it fine if I can give any random value for the 
TUI/CUI/RINDEX/TCOUNT. I could also see options to create custom bsv 
dictionaries but couldn't see much documentation for it. Kindly advise which is 
the better option from the below 3.



o   Generate a custom dictionary using METAMORPHOSYS UML installation 
tool(where we provide sources as ICD10,RxNORM,SNOMEDCT_US) and leverage the 
full set of .rrf  files in the meta folder . Is this approach better if the 
entries to be populated are maximal?

o   Add/edit the available dictionary sno_rx_16ab and in that case how to 
provide valid values for each columns like CUI, TUI,RINDEX,TCOUNT and PREFTERM. 
If the entries to be populated are minimal is this approach would be better?.

o   Use a custom bsv , in that case how should we add  values to custom bsv. 
Could you also provide a sample in that case.

I found a Metathesaurus browser in the below url , where I can search for the 
terms and get the CUI  and the respective source like ICD/CPT/MDR. But still I 
was unable to get the other required attributes to  be populated like 
TUI,RINDEX,TCOUNT and PREFTERM. Could you pls. brief what these attributes 
signifies

https://uts.nlm.nih.gov//metathesaurus.html<https://uts.nlm.nih.gov/metathesaurus.html>

Kindly advise us on how to proceed on this and correct us if we went wrong 
somewhere. This would be of great help for us

P.S : We comply with UMLS license


Thanks & Regards
[cid:D3145E69-CD94-48C1-877F-5134EEAFB598]
Abad Ayyub
Vnet: 406170 | Cell : +91-9447379028


This e-mail and any files transmitted with it are for the sole use of the 
intended recipient(s) and may contain confidential and privileged information. 
If you are not the intended recipient(s), please reply to the sender and 
destroy all copies of the original message. Any unauthorized review, use, 
disclosure, dissemination, forwarding, printing or copying of this email, 
and/or any action taken in reliance on the contents of this e-mail is strictly 
prohibited and may be unlawful. Where permitted by applicable law, this e-mail 
and other e-mail communications sent to and from Cognizant e-mail addresses may 
be monitored. This e-mail and any files transmitted with it are for the sole 
use of the intended recipient(s) and may contain confidential and privileged 
information. If you are not the intended recipient(s), please reply to the 
sender and destroy all copies of the original message. Any unauthorized review, 
use, disclosure, dissemination, forwarding, printing or copying of this email, 
and/or any action taken in reliance on the contents of this e-mail is strictly 
prohibited and may be unlawful. Where permitted by applicable law, this e-mail 
and other e-mail communications sent to and from Cognizant e-mail addresses may 
be monitored.

Reply via email to