> However, if you follow the instructions at > http://wiki.scribus.net/index.php/Web_optimised_PDF , you will find that > (apart from compressing the PDF files, which you were not asking for) the > text extracted by pdftotext now becomes an almost perfect representation > of the original text. > I haven't investigated this in detail, and there may be encoding issues, > etc, but I found the results striking. >
Well it does work very fine indeed! So sla -> pdf -> ps -> pdf -> txt is a perfect process.
