Re: Problem with openNLP Name Finder API....

Jim - FooBar(); Wed, 08 Feb 2012 08:56:40 -0800

aaa ok i see what you mean...but then again if it recognised it as amere token it would not throw "IncompatibleFormat" exceptions but ratherskip it as a token that is not of interest wouldn't it? I don't have anypatches to send you, i just think that not including spaces in the sgmltag is a more wise approach...Unless of course you're extracting thesgml tags via regex...The truth is i've not looked at the source but iwould expect you to use some sort of xml-ish means to extract the sgmltags. If your parser is using regex then i'm sure you have your reasonsfor including the spaces. But anyway, this is a very small problem forme cos i can indeed sort it manually...My big problem still remains!!!

Anyway I'll stop bugging you...the fact that you tried to help means alot and certainly if i sort everything out i'll post what the problemwas for future users...


Cheers,
Jim


On 08/02/12 16:41, Joern Kottmann wrote:

The parsing code for the format expects white space tokenized text. The
<START>  and<END>  tags are handled different and are not
a token in this sense, but when you directly attach it to a word like you
did. acid<START>  then our parsing code just recognize it as a token
and not the tag to mark entity boundaries.

Re: Problem with openNLP Name Finder API....

Reply via email to