W dniu 2013-02-18 20:44, Mike Unwalla pisze: > Marcin, > > Thank you for your clarification about rewriting all tags. I tried to remove > the bug report, but I could not see how to do that. Sorry for the mess. > >> Please describe in detail your input. > The entire input text is only 1 word: testword
Was that only in the example part of the rule or given via the command-line? Then you could have also an EOL character added. Regards, Marcin > > Regards, > > Mike > > -----Original Message----- > From: Marcin Milkowski [mailto:[email protected]] > Sent: 18 February 2013 19:00 > To: [email protected] > Subject: Re: Multiple instances of <S>: update > > W dniu 2013-02-18 18:30, Mike Unwalla pisze: >> With a particular rule, I get multiple instances of postag 'VB': >> <S> You[you/PRP] must[must/MD] check[check/VB, check/VB, check/VB, > check/VB, >> check/VB] this[this/DT] text[text/NN:UN].[./., </S>] > > This is not a bug, this is a feature. You are rewriting all existing > tags to 'VB', so no wonder you end up with multiple instances being the > same. All *different* tags were rewritten using this: > > <disambig><match no="1" postag="VB"/></disambig> > > So this is exactly what should happen. > >> The problem occurs when I add the rule to LT's original disambiguation.xml >> file. >> >> I created a bug report >> > (https://sourceforge.net/tracker/?func=detail&aid=3605206&group_id=110216&at >> id=655717). >> >> Regards, >> >> Mike >> >> -----Original Message----- >> From: Mike Unwalla [mailto:[email protected]] >> Sent: 18 February 2013 11:00 >> To: [email protected] >> Subject: Multiple instances of <S> >> >> Hello, >> >> When LT tags text, the tag <S> shows the start of a sentence, doesn't it? >> >> With one particular disambiguation.xml file, I get unexpected results for >> the tagged text. LT gives multiple instances of the sentence start marker >> <S>, as shown in this output from the GUI: >> <S><S><S><S> testword[</S>testword/TESTPOS] >> >> The first rule in my disambiguation.xml is as follows. (Testrules gives no >> errors.): >> >> <rule id="add_TESTPOS" name="add TESTPOS"> >> <pattern> >> <token>testword</token> >> </pattern> >> <disambig action="add"><wd pos="TESTPOS"/></disambig> >> </rule> >> >> If I put that rule in the LanguageTool disambiguation.xml file, there is >> only one <S> tag, as I expect. > > Without knowing the full *input* sentence, it's hard to say, but if the > testword was the only word in your sentence, then it has <S> by default. > Note that testword itself is tagged only once, so I'm wondering what you > had before 'testword. > >> >> I do not understand: >> 1. How can there be multiple sentence starts? > > Multiple end of lines are enough. > >> 2. Something in my disambiguation.xml makes LT show multiple <S>. But, > this >> is the FIRST rule. How can rules that come after the first rule affect the >> tagging? (The rules "are applied in the order as they appear in the file" >> http://wiki.languagetool.org/developing-a-disambiguator .) > > If there's an end-of-line character, it will be tagged. > >> >> (I think that this is a bug. Probably, I will send more related questions, >> but for now, I want to keep things simple and focus only on one thing at a >> time.) > > I'm not sure it is. Please describe in detail your input. > > Best, > Marcin > >> >> Regards, >> >> Mike Unwalla >> Contact: www.techscribe.co.uk/techw/contact.htm > > > ------------------------------------------------------------------------------ > The Go Parallel Website, sponsored by Intel - in partnership with Geeknet, > is your hub for all things parallel software development, from weekly thought > leadership blogs to news, videos, case studies, tutorials, tech docs, > whitepapers, evaluation guides, and opinion stories. Check out the most > recent posts - join the conversation now. http://goparallel.sourceforge.net/ > _______________________________________________ > Languagetool-devel mailing list > [email protected] > https://lists.sourceforge.net/lists/listinfo/languagetool-devel > > ------------------------------------------------------------------------------ Everyone hates slow websites. So do we. Make your web apps faster with AppDynamics Download AppDynamics Lite for free today: http://p.sf.net/sfu/appdyn_d2d_feb _______________________________________________ Languagetool-devel mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/languagetool-devel
