Hi Kent, On Jul 25, 4:27 pm, Kent Tenney <[email protected]> wrote: > - use (for example) pdfminer to pull text from the pdf, > it's pdf2text script will do a page split. This results in > plain text which has lots of advantages over rendered pages.
I used "pdftotext -layout" for this, but within Leo a python module makes of course more sense, even if it is much slower. I really want rendered PDF pages, however, because of tables, images etc, which and PDF to text conversion cannot handle properly. > Maybe the OpenOffice (LibreOffice) Python bindings could help. > OO has great pdf capability. Under Debian/Ubuntu the binding between Python and OO is simply done by the uno module, but at least on Windows you need to run the Python that comes with OOo - other distributions won't work. Since it is a pain to convert the OOo Python to be used as the system Python, this practically limits such a solution to Linux. It may also work on the Mac, but a colleague had lots of problems finding the right combination of Python, uno module and OOo, as the uno module did not get distributed with OOo regularly. Even if you could control OOo from Leo, this would not give you a PDF viewer within Leo. Also, I think a solution based on the poppler library has a better chance to cover also upcoming versions of PDF, as it seems to be well supported and is the basis of several viewers. So, I find it would probably best to look into the Qt widget you mentioned. - Josef -- You received this message because you are subscribed to the Google Groups "leo-editor" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit this group at http://groups.google.com/group/leo-editor?hl=en.
