El dijous, 14 d’abril de 2022, a les 13:16:58 (CEST), Pablo Rodríguez va escriure: > Dear list, > > I use the Python bindings for Poppler (through GObject introspection) to > extract some metadata from PDF documents. > > Here is a minimal script: > > > import sys > > import os > > import gi > > gi.require_version('Poppler', '0.18') > > from gi.repository import Poppler > > gi.require_version('Gst', '1.0') > > from gi.repository import Gst > > Gst.init(sys.argv) > > pdf = "a.pdf" > > uri = Gst.filename_to_uri(os.path.abspath(pdf)) > > doc = Poppler.Document.new_from_file(uri, None) > > title = doc.get_title() > > print(title) > > > Is there a way that I can extract the /Lang value from the /Catalog > dictionary? (Attached PDF document with that entry.)
No, thought it should be relatively easy to add support for it. Will you contribute a patch? Cheers, Albert > > I’m afraid I searched https://lazka.github.io/pgi-docs/, but I wasn’t > able to find anything that could give the language from the document. > > Many thanks for your help, > > Pablo >