On 10/03/2017 05:26 PM, Krunose wrote:

> Writer shouldn't show different word count for odt file and different for 
> docx file. Same for plain text and HTML.

This gets into when presentation markup, structural markup, and
syntactical markup are treated as words that are explicitly content, and
when they are ignored, because they are background noise.

Think of it this way.
* Is white space background noise, or significant content?

If the algorithm treats white space as background noise, it will produce
a different count, than if treats white space as significant content.

One other potential issue, is legitimate, explicit content, being
flagged as background noise, because it contains a sub-string that can
be confused with markup.

jonathon

-- 
To unsubscribe e-mail to: users+unsubscr...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted

Reply via email to