Hi,

We have a custom decent size dictionary (~1.4M concepts) in a Lucene Index

I'd like to have an implementation of AbstractJCasTermAnnotator, e.g.
DefaultJCas, finding terms from the lucene index directly. I can think on
two options, but I'd like to get everyone's input

1- Create a hsql db containing a dictionary using an approach similar
to org.apache.ctakes.gui.dictionary,DictionaryBuilder and then some sort of
LuceneConceptFactory extending AbstractConceptFactory

2- Creating a new Dictionary Lookup, e.g. LuceneJCasTermAnnotation, similar
to DefaultJCasTermAnnotator with the signature of the findTerms method
something like this

void findTerms( IndexSearcher searcher, List<FastLookupToken> allTokens)

I've seen that for cTakes v3 there was something similar in
the LuceneDictionaryImpl but that doesn't seem to work with the Fast
Dictionary Lookup approach for cTakes-4.0.0

Thanks in advance for any ideas or suggestions!
Iker

Reply via email to