https://bugs.documentfoundation.org/show_bug.cgi?id=153888

V Stuart Foote <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |[email protected]

--- Comment #6 from V Stuart Foote <[email protected]> ---
Created attachment 185656
  --> https://bugs.documentfoundation.org/attachment.cgi?id=185656&action=edit
sample doc with the 1.6 and 1.4 sample PDFs as pdfium filter imports

First thing, LibreOffice is *NOT* a PDF editor.

That said there are two import filter paths for handling the PS markup embedded
in PDF. 

One, using the pdfium libs will fully parse PDF pages rending each full page to
a very high fidelity against the source PDF as a raster image of appropriate
size and scale.  The attached is a two page ODG Draw document with the two
"input" PDF inserted each to a page.

The second, a C++ PDF filter parses the individual elements described in the
PDF and lays them out (to Draw, Writer, or Impress canvas depending on filter
selected) as sets of drawing objects--Text boxes, shapes, grids, raster images.

The text taken from the PS text runs that have no contextual syntax, will be
assembled into drawing object text boxes.  Fonts that are not installed to
system, or that have an unrecognizable title in the PDF, will receive some
other fall-back font at some arbitrary size.  Multiple text box objects will be
assembled onto single text box runs--but beyond that there is no reference to
the source material used to prepare the PDF.  In Draw, text runs from those
individual text boxes can be "consolidated" (see bug 32249) into a single text
box, and the resulting text formatted. Or copied and pasted into a proper
paragraph object depending on need.  The point is, for this filter import the
resulting content that LibreOffice extracts from the PDF is not intended to
have high fidelity to the original source used to generate the PDF.

You can have high fidelity with the pdfium based filter, or you can extract
some percentage of the PDF content and render to drawing shape/text--but you
can't do both with LibreOffice.

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to