https://bugzilla.wikimedia.org/show_bug.cgi?id=27773

           Summary: Length of dump text and length field in API do not
                    match
           Product: XML Snapshots
           Version: unspecified
          Platform: All
        OS/Version: All
            Status: NEW
          Keywords: analytics
          Severity: enhancement
          Priority: Normal
         Component: General
        AssignedTo: ar...@wikimedia.org
        ReportedBy: dvanli...@gmail.com
                CC: tf...@wikimedia.org, aaron.halfa...@gmail.com


Length of dump text and length field in API do not match (even after UF8
encoding) due to inconsistent line break characters and beginning/ending
whitespace. 
Note that this results in false negatives when detecting identity reverts

Current workaround:
Strip whitespace from the beginning/end and replace all "\r\n" (windows
linebreak) with "\n".  With this approach, you get acceptable (99%), but still
imperfect consistency between API and dump.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to