Hi Kent,

On Jul 25, 4:27 pm, Kent Tenney <[email protected]> wrote:
> - use (for example) pdfminer to pull text from the pdf,
>   it's pdf2text script will do a page split. This results in
>   plain text which has lots of advantages over rendered pages.

I used "pdftotext -layout" for this, but within Leo a python module
makes of course more sense, even if it is much slower.

I really want rendered PDF pages, however, because of tables, images
etc, which and PDF to text conversion cannot handle properly.

> Maybe the OpenOffice (LibreOffice) Python bindings could help.
> OO has great pdf capability.

Under Debian/Ubuntu the binding between Python and OO is simply done
by the uno module, but at least on Windows you need to run the Python
that comes with OOo - other distributions won't work. Since it is a
pain to convert the OOo Python to be used as the system Python, this
practically limits such a solution to Linux. It may also work on the
Mac, but a colleague had lots of problems finding the right
combination of Python, uno module and OOo, as the uno module did not
get distributed with OOo regularly. Even if you could control OOo from
Leo, this would not give you a PDF viewer within Leo.

Also, I think a solution based on the poppler library has a better
chance to cover also upcoming versions of PDF, as it seems to be well
supported and is the basis of several viewers.
So, I find it would probably best to look into the Qt widget you
mentioned.

- Josef

-- 
You received this message because you are subscribed to the Google Groups 
"leo-editor" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/leo-editor?hl=en.

Reply via email to