A user of our software advised me of this piece of software which I think might occasionally come handy:
http://www.mobipocket.com/dev/pdf2xml/ It appears that it can deal with embedded fonts and produces apparently excellently structured output. I have unfortunately not managed to compile it despite thinking it should be straightforward. There is no make file, just a short explanation for compiling, which I seemingly completely misunderstood. What I did at the toplevel of the source directory was gcc -I xpdf -I xpdf/fofi -I xpdf/goo -I xpdf/xpdf -I image/zlib -I image/png -I /usr/include pdf2xml.cpp But this was obviously wrong. It produced a huge pile of error messages suggesting that it did not find all kinds of libpng related stuff. I have obviously installed libpoppler-dev and libpng-dev. I am aware of a whole bunch of public domain texts which are only published in PDF format but would suddenly become accessible to us. Peter _______________________________________________ sword-devel mailing list: [email protected] http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page
