On 2015-10-09 07:32, Dominique Pellé wrote:

> I suppose that I care more than most because I only use LT to check
> text files where the situation is frequent.

I think normalizing the text makes sense if:
1) single line breaks get removed from plain text files (but not double 
spaces)
2) this normalization doesn't happen in LT core, but in the command-line 
client

My understanding is that's not enough for your use case as you use 
spaces for indentation? For me, this sounds like a general input format 
issue, just like people want to use LT to check LaTeX. We cannot support 
that in the core, but if we find a way to do it outside that would be 
okay for me. We just need to avoid becoming a parser for every format 
out there.

We already have the concept of annotated text[1], I think this could be 
used to check plain text files. "\n" is then markup just like "<h1>" is 
markup in XML. So we don't need normalization in that sense, but we need 
to parse the input.

[1] 
https://languagetool.org/development/api/org/languagetool/markup/AnnotatedText.html

Regards
  Daniel


------------------------------------------------------------------------------
_______________________________________________
Languagetool-devel mailing list
Languagetool-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/languagetool-devel

Reply via email to