Package: python3-rdflib
Version: 4.2.1-2
Severity: normal
Tags: upstream

Dear Maintainer,

The following script won't parse a local RDFa file :
from rdflib import Graph

g = Graph()
g.parse("papier-icssea2011.html", format="rdfa")

len(g) # prints 2

import pprint
for stmt in g:
    pprint.pprint(stmt)


It seems that provides an explanation, as patching
/usr/lib/python3/dist-packages/rdflib/plugins/parsers/pyRdfa/__init__.py
by commenting line 448 (not setting  self.charset = 'utf-8' when
provided a local file) will solve the issue.

I guess a better fix upstream should be needed though

Hope this helps.

Best regards,

-- System Information:
Debian Release: stretch/sid
  APT prefers stable-updates
  APT policy: (500, 'stable-updates'), (500, 'testing')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 4.8.0-2-amd64 (SMP w/8 CPU cores)
Locale: LANG=fr_FR.utf8, LC_CTYPE=fr_FR.utf8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)

Versions of packages python3-rdflib depends on:
ii  python3-isodate    0.5.4-1
ii  python3-pyparsing  2.1.10+dfsg1-1
pn  python3:any        <none>

Versions of packages python3-rdflib recommends:
ii  python3-html5lib       0.999999999-1
ii  python3-sparqlwrapper  1.7.6-2

Versions of packages python3-rdflib suggests:
ii  python-rdflib-doc  4.2.1-2

-- no debconf information

Reply via email to