Re: UmlsOverlapLookupAnnotator + BsvRareWordDictionary: # tokens skipped varies? [EXTERNAL]

2018-03-07 Thread Kean Kaufmann
Sean -- Thanks as always! > It shouldn't be a problem, but is there an eol character after the " II" in your bsv? It shouldn't be necessary, but who knows. Double checked: yeah, no. Yes, there's an eol; no, that doesn't seem to be it. > Can you create a Jira item with this information?

Re: UmlsOverlapLookupAnnotator + BsvRareWordDictionary: # tokens skipped varies? [EXTERNAL]

2018-03-07 Thread Finan, Sean
Hi Kean, It does sound like you are getting some odd results. I will need to look into the code, but I won't have time to do so for a few days. My initial thoughts are below. >If I add an entry with a comma in it: >then "chronic kidney disease), stage II" gets picked up, no matter what. Well

Re: UmlsOverlapLookupAnnotator + BsvRareWordDictionary: # tokens skipped varies?

2018-03-07 Thread Kean Kaufmann
P.S. Extra config bit: I also removed "CD" from the exclusionTags in the UmlsOverlapLookupAnnotator. On Wed, Mar 7, 2018 at 10:58 AM, Kean Kaufmann wrote: > Hi Sean, > > I'm perplexed. It seems as if the number of tokens that the > UmlsOverlapLookupAnnotator will skip

UmlsOverlapLookupAnnotator + BsvRareWordDictionary: # tokens skipped varies?

2018-03-07 Thread Kean Kaufmann
Hi Sean, I'm perplexed. It seems as if the number of tokens that the UmlsOverlapLookupAnnotator will skip varies with the content of the RareWordDictionary. Here's my setup. I think I've included enough information to replicate my perplexity, if you have time/inclination to do that; let me know