A Dilluns, 28 de març de 2011, Tim Brody va escriure: > On Fri, 2011-03-25 at 20:43 +0000, Albert Astals Cid wrote: > > A Divendres, 25 de març de 2011, vàreu escriure: > > > On Fri, 25 Mar 2011 19:02:46 +0000, Albert Astals Cid <[email protected]> > > > > > > NB I just tried extracting from a Word-generated PDF and TextOutputDev > > > didn't see the line with the diacritic at all. > > > > And are you sure it's not a Word fault? > > (What tool do you use to de-compress/analyse PDFs?)
I usually use a custom patch i have lying around for poppler to decompress the streams or podofo/podofobrowser > > Here's the PDF file generated with Word 2010: > http://users.ecs.soton.ac.uk/tdb2/ms_word_accents.pdf I don't have Acrobat X at hand, but Acrobat 9 seems not to be able to extract text from that PDF either so i would not worry about it. Albert > > _______________________________________________ > poppler mailing list > [email protected] > http://lists.freedesktop.org/mailman/listinfo/poppler _______________________________________________ poppler mailing list [email protected] http://lists.freedesktop.org/mailman/listinfo/poppler
