[
https://issues.apache.org/jira/browse/PDFBOX-568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12802943#action_12802943
]
Mykola Gurov commented on PDFBOX-568:
-------------------------------------
The change in the revision 889724 has suppressed test failures for *all* the
files. I guess, this wasn't the intention?
~/pdfbox $ cat test/input/simple-openoffice.pdf.txt
##I am a simple pdf.
~/pdfbox $ echo garbage > test/input/simple-openoffice.pdf.txt
~/pdfbox $ cat test/input/simple-openoffice.pdf.txt
garbage
~/pdfbox $ ant testextract
Buildfile: build.xml
...
testextract:
[junit] Testsuite: org.apache.pdfbox.util.TestTextStripper
[junit] Tests run: 1, Failures: 0, Errors: 0, Time elapsed: 10.478 sec
[junit]
[junit] Testcase: testExtract took 10.474 sec
BUILD SUCCESSFUL
Total time: 22 seconds
> testextract failure on Linux and Mac OS X
> -----------------------------------------
>
> Key: PDFBOX-568
> URL: https://issues.apache.org/jira/browse/PDFBOX-568
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 0.8.0-incubator
> Reporter: Jukka Zitting
>
> As discussed on the mailing list, the extraction test case seems to fail on
> non-Windows platforms.
> The troublesome test file is ample_fonts_solidconvertor.pdf, and the
> textextract.log file says the following (^@ is U+0000 and � is U+FFFD):
> Lines differ at index expected:46-253 actual:46-65533
> FAILURE: Line mismatch for file sample_fonts_solidconvertor.pdf at expected
> line: 8 at actual line: 8
> expected line was: "^...@v^@e...@r^@d...@a^@n...@a^@:^@ ^...@t^@o...@t^@o^@
> ^...@j^@e^@ ^...@p^@o...@k^@u...@s^@n...@ý^@ ^...@t^@e...@x^@t^@ ^...@s^@ ^A"
> actual line was: "^...@v^@e...@r^@d...@a^@n...@a^@:^@ ^...@t^@o...@t^@o^@
> ^...@j^@e^@ ^...@p^@o...@k^@u...@s^@n...@�^@ ^...@t^@e...@x^@t^@ ^...@s^@ ^A"
> Lines differ at index expected:4-253 actual:4-65533
> FAILURE: Line mismatch for file sample_fonts_solidconvertor.pdf at expected
> line: 10 at actual line: 10
> expected line was: "^ay^...@ý^@�...@í^@é"
> actual line was: "^ay^...@�^@�...@�^@�"
> Lines differ at index expected:52-253 actual:52-65533
> FAILURE: Line mismatch for file sample_fonts_solidconvertor.pdf at expected
> line: 11 at actual line: 11
> expected line was: "^...@s^@a...@n^@s^@ ^...@s^@e...@r^@i...@f^@:^@
> ^...@t^@o...@t^@o^@ ^...@j^@e^@ ^...@p^@o...@k^@u...@s^@n...@ý^@
> ^...@t^@e...@x^@t^@ ^...@s^@ ^A"
> actual line was: "^...@s^@a...@n^@s^@ ^...@s^@e...@r^@i...@f^@:^@
> ^...@t^@o...@t^@o^@ ^...@j^@e^@ ^...@p^@o...@k^@u...@s^@n...@�^@
> ^...@t^@e...@x^@t^@ ^...@s^@ ^A"
> Lines differ at index expected:4-253 actual:4-65533
> FAILURE: Line mismatch for file sample_fonts_solidconvertor.pdf at expected
> line: 13 at actual line: 13
> expected line was: "^ay^...@ý^@�...@í^@é"
> actual line was: "^ay^...@�^@�...@�^@�"
> Preparing to parse sample_fonts_solidconvertor.pdf for sorted test
> Lines differ at index expected:46-253 actual:46-65533
> FAILURE: Line mismatch for file sample_fonts_solidconvertor.pdf at expected
> line: 8 at actual line: 8
> expected line was: "^...@v^@e...@r^@d...@a^@n...@a^@:^@ ^...@t^@o...@t^@o^@
> ^...@j^@e^@ ^...@p^@o...@k^@u...@s^@n...@ý^@ ^...@t^@e...@x^@t^@ ^...@s^@ ^A"
> actual line was: "^...@v^@e...@r^@d...@a^@n...@a^@:^@ ^...@t^@o...@t^@o^@
> ^...@j^@e^@ ^...@p^@o...@k^@u...@s^@n...@�^@ ^...@t^@e...@x^@t^@ ^...@s^@ ^A"
> Lines differ at index expected:0-253 actual:0-65533
> FAILURE: Line mismatch for file sample_fonts_solidconvertor.pdf at expected
> line: 10 at actual line: 10
> expected line was: "^...@ý^@�...@í^@é"
> actual line was: "^...@�^@�...@�^@�"
> Lines differ at index expected:52-253 actual:52-65533
> FAILURE: Line mismatch for file sample_fonts_solidconvertor.pdf at expected
> line: 11 at actual line: 11
> expected line was: "^...@s^@a...@n^@s^@ ^...@s^@e...@r^@i...@f^@:^@
> ^...@t^@o...@t^@o^@ ^...@j^@e^@ ^...@p^@o...@k^@u...@s^@n...@ý^@
> ^...@t^@e...@x^@t^@ ^...@s^@ ^A"
> actual line was: "^...@s^@a...@n^@s^@ ^...@s^@e...@r^@i...@f^@:^@
> ^...@t^@o...@t^@o^@ ^...@j^@e^@ ^...@p^@o...@k^@u...@s^@n...@�^@
> ^...@t^@e...@x^@t^@ ^...@s^@ ^A"
> Lines differ at index expected:4-253 actual:4-65533
> FAILURE: Line mismatch for file sample_fonts_solidconvertor.pdf at expected
> line: 13 at actual line: 13
> expected line was: "^a~^...@ý^@�...@í^@é"
> actual line was: "^a~^...@�^@�...@�^@�"
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.