So in JLanguageToolTest.testAnalyzedSentence() (line 133) the expected
reading for token with soft hyphen excpects test­ed/null, but I don't
really understand this logic.
The last related commit I found was a refactoring
(f44e90d1f00c87ef3728d3ac7d8502add8311142).
As I understand we use token readings with null tag to show we could
not tag the token. But what's the value of additional reading with
null tag?
I don't mind adjusting my rules but I'd like to fully understand the
logic before I do so :)

Thanks
Andriy

2015-01-18 14:26 GMT-05:00 Daniel Naber <daniel.na...@languagetool.org>:
> On 2015-01-16 05:19, Andriy Rysin wrote:
>
>> Here's a PoC patch to allow ignoring arbitrary characters when
>> analyzing text.
>
> Some ideas:
>
> -add "@since 2.9" to the new public methods
> -a test case would be nice
>
> About the "if( false )": do all the tests still work with that? Maybe
> have a look in the file history to see what it is for.
>
>> We could change regex into array of chars and use for loop/indexOf
>> (this should still be faster than regexp for several characters).
>
> I don't think this is necessary, unless a performance test shows that
> it's worth it.
>
> Regards
>   Daniel
>
>
> ------------------------------------------------------------------------------
> New Year. New Location. New Benefits. New Data Center in Ashburn, VA.
> GigeNET is offering a free month of service with a new server in Ashburn.
> Choose from 2 high performing configs, both with 100TB of bandwidth.
> Higher redundancy.Lower latency.Increased capacity.Completely compliant.
> http://p.sf.net/sfu/gigenet
> _______________________________________________
> Languagetool-devel mailing list
> Languagetool-devel@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/languagetool-devel

------------------------------------------------------------------------------
New Year. New Location. New Benefits. New Data Center in Ashburn, VA.
GigeNET is offering a free month of service with a new server in Ashburn.
Choose from 2 high performing configs, both with 100TB of bandwidth.
Higher redundancy.Lower latency.Increased capacity.Completely compliant.
http://p.sf.net/sfu/gigenet
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to