Bugs item #3605206, was opened at 2013-02-18 09:23
Message generated for change (Tracker Item Submitted) made by mikeunwalla
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=655717&aid=3605206&group_id=110216

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Problem with English
Group: 2.0
Status: Open
Resolution: None
Priority: 5
Private: No
Submitted By: MikeUnwalla (mikeunwalla)
Assigned to: Marcin Miłkowski (milek_pl)
Summary: Multiple instances of the same postag for 1 token

Initial Comment:
With a particular rule, I get 5 instances of postag 'VB' for 1 token:
<S> You[you/PRP] must[must/MD] check[check/VB, check/VB, check/VB, check/VB, 
check/VB] this[this/DT] text[text/NN:UN].[./., </S>] 

This is the rule that gives multiple postags. I have simplified the rule as 
much as possible so that there are no distractions:
    <rule id="test" name="test">
      <pattern>
        <token/>
        <marker>
          <token>check</token>
        </marker>
      </pattern>
      <disambig><match no="1" postag="VB"/></disambig>
    </rule>

With an 'ambiguous' example in the rule 
(http://wiki.languagetool.org/developing-a-disambiguator#toc11), testrules 
gives this error (line breaks removed):
Exception in thread "main" junit.framework.AssertionFailedError: The output 
form for the rule test in the example: You must <marker>check</marker> the 
sentence.: check[check/JJ,check/NN:UN,check/UH,check/VB,check/VBP] -> 
check[check/VB] is different than expected (expected check[check/VB] but got 
check[check/VB,check/VB,check/VB,check/VB,check/VB]). The token has been 
changed by the disambiguator:

A similar problem occurs with other postags and with other words. The problem 
appears to be only with words that are both nouns and verbs, such as ‘check’, 
‘test’, and ‘oil’.


----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=655717&aid=3605206&group_id=110216

------------------------------------------------------------------------------
The Go Parallel Website, sponsored by Intel - in partnership with Geeknet, 
is your hub for all things parallel software development, from weekly thought 
leadership blogs to news, videos, case studies, tutorials, tech docs, 
whitepapers, evaluation guides, and opinion stories. Check out the most 
recent posts - join the conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Languagetool-commits mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/languagetool-commits

Reply via email to