Hi Ravi,
In the attached LookupDesc_csv_sample.xml, It looks like it is still configured
to use DirectLookupInitializerImpl...
<dictionaryRef idRef="DICT_CSV_SAMPLE" />
<lookupInitializer
className="org.apache.ctakes.dictionary.lookup.ae.DirectLookupInitializerImpl">
</lookupInitializer>
Try:
<lookupInitializer
className="org.apache.ctakes.dictionary.lookup.ae.FirstTokenPermLookupInitializerImpl">
<properties>
<property key="textMetaFields" value="0|1"/>
<property key="maxPermutationLevel" value="7"/>
<property key="windowAnnotations"
value="org.apache.ctakes.typesystem.type.textspan.LookupWindowAnnotation"/>
</properties>
</lookupInitializer>
Instead.
> I also wanted to know if this is the only method to use a non-UMLS vocabulary
> as a dictionary in cTAKES
It should be configurable to use CSV, Lucene, MySQL/HSQLDB, etc. where one can
insert their custom vocabulary, and they and also implement custom lookup
algorithms if needed.
--Pei
From: ravi garg [mailto:[email protected]]
Sent: Wednesday, April 24, 2013 8:29 AM
To: [email protected]
Subject: Re: Regarding Entity Recognition
Hey Sorry for delayed reply.
I believe the changes you are suggesting, are to be made in
LookupDesc_csv_sample.xml. I made those changes but still didn't get the
required results.
I am attaching the files here for reference.
I also wanted to know if this is the only method to use a non-UMLS vocabulary
as a dictionary in cTAKES
Regards,
Ravi Garg
On Tue, Apr 23, 2013 at 1:40 AM, Chen, Pei <[email protected]>
wrote:
Ravi,
Could you please attach the DictionaryLookupAnnotarCSV.xml
In particular, please consider using the FirstTokenPermLookupInitializerImpl vs
DirectLookup.
<lookupInitializer
className="org.apache.ctakes.dictionary.lookup.ae.FirstTokenPermLookupInitializerImpl">
<properties>
<property key="textMetaFields" value="0|1"/>
<property key="maxPermutationLevel" value="7"/>
<property key="windowAnnotations"
value="org.apache.ctakes.typesystem.type.textspan.LookupWindowAnnotation"/>
</properties>
</lookupInitializer>
I hope that helps.
From: ravi garg [mailto:[email protected]]
Sent: Monday, April 22, 2013 4:09 PM
To: [email protected]
Subject: Re: Regarding Entity Recognition
Sorry, But this too doesn't solve the problem
On Tue, Apr 23, 2013 at 1:28 AM, Savova, Guergana
<[email protected]> wrote:
Try adding in the dictionary:
Knee|knee pain|..
The first field is reserved for the first word of the phrase.
Regards,
--Guergana
From: ravi garg [mailto:[email protected]]
Sent: Monday, April 22, 2013 3:37 PM
To: [email protected]
Subject: Re: Regarding Entity Recognition
Hey,
Thanks for reply.
First let me brief you on what configuration I am using. I am using
AggregatePlaintextProcessor.xml with DictionaryLookupAnnotar being
DictionaryLookupAnnotarCSV.xml which reads dictionary from two files i.e one
being the flat dictionary1.csv and another the lucene index one. I have added
knee pain as single term in dictionary1.csv (like knee pain| knee pain) but
still I am not being to get them as single entity. Am I missing something here?
Regards,
Ravi Garg
On Tue, Apr 23, 2013 at 12:49 AM, Chen, Pei <[email protected]>
wrote:
Hi Ravi,
Yes, in your example "knee pain", the default behavior in the dictionary lookup
will create 3 IdentifiedAnnotations
"knee", "pain", as well as "knee pain".
[Assuming the terms exist in the UMLS dictionary]
--Pei
From: ravi garg [mailto:[email protected]]
Sent: Monday, April 22, 2013 3:06 PM
To: [email protected]
Subject: Regarding Entity Recognition
Hey,
First of all Congrats for building such a wonderful software. I am very new to
cTAKES so had a very basic question to ask.
My query is Is it possible to identify multiple words as a single entity, for
eg right now knee pain gets identified as 'knee' and 'pain', but is it possible
to get 'knee pain' as single identity. If so what all changes I have to make to
get going.
--
Ravi Garg
3rd Year
MSc (hons) Biological Sciences
B.E (hons) Computer Science and Engineering
BITS Pilani KK Birla Goa Campus
--
Ravi Garg
3rd Year
MSc (hons) Biological Sciences
B.E (hons) Computer Science and Engineering
BITS Pilani KK Birla Goa Campus
--
Ravi Garg
3rd Year
MSc (hons) Biological Sciences
B.E (hons) Computer Science and Engineering
BITS Pilani KK Birla Goa Campus
--
Ravi Garg
3rd Year
MSc (hons) Biological Sciences
B.E (hons) Computer Science and Engineering
BITS Pilani KK Birla Goa Campus