Hi all,
the pdfToTextFilter appears to not respect the text encoding. For example, if
I have a PDF with German Umlaut characters I can do this:
<pdfVerifyText text="ü"/>
Here the encoding is dealt with correctly. However, if I do this:
<applyFilters>
<pdfToTextFilter lineSep=" "/>
</applyFilters>
<verifyText text="ü"/>
Then I get a failure, because the text that is extracted from the PDF is in
ISO-8859-1 encoding and my webtest in utf-8.
Is this a bug? Or am I doing something wrong?
Ulrich
_______________________________________________
WebTest mailing list
[email protected]
http://lists.canoo.com/mailman/listinfo/webtest