Thanks Paul.

For future reference: I ended up installing a demo version of a Windows-only
Pdf editor (Infix Pdf Editor) in my VirtualBox environment. The demo is
actually perfectly functional---you cannot save without getting the pdf
files watermarked, but saving  was useless to me. You can select any text
snippet and obtain font information including typeface and size (size is
rounded up, which made for some interesting headscratching before I realized
it. I was converting a book from TeX point sizes to Adobe points (bp).) I
think that's similar to what you can do in Acrobat Pro, but I 've never
tried it.
And no---I did not intend to wite any Java code. I am too old for learning
how to write Java....
Cheers,

Stefano





On Wed, Dec 15, 2010 at 5:35 PM, Paul Rubin <ru...@msu.edu> wrote:

> Not sure how much help this will be, but pdfedit can convert a PDF file to
> XML
> format.  Here's a sample font tag from the XML output of a PDF file:
>
> <font basename="VYKOIK+NimbusRomNo9L-Regu"
> origname="VYKOIK+NimbusRomNo9L-Regu"
> embeddedfontname="VYKOIK+NimbusRomNo9L-Regu" tag="F67" serif="false"
> symbolic="true" italic="false" bold="false" ascent="0.678" descent="-0.216"
> writemode="horizontal" fonttype="Type1 (8bit)">
>
> You can identify the font from the basename attribute, and weight/slant
> from
> obvious attributes.  Size is tricky.  The difference ascent-descent maps to
> the
> font size, but the conversion is not entirely obvious to me.
>
> I also think there are some Java code libraries out there that can parse
> XML
> files, including font size, but I don't know off-hand which they are (and
> wasn't
> sure in any case that you wanted to write Java code).
>
> /Paul
>
>

Reply via email to