On 12/10/2023 20:20, Steve Vestal wrote:
I couldn't resist trying https://purl.obolibrary.org/obo/foodon.owl as a stress test for what we are doing.  We're on Jena 4.5.0 and I'm getting

Not in RDF/XML format due to exception org.apache.jena.riot.RiotException [line: 110334, col: 72] Invalid byte 2 of 2-byte UTF-8 sequence.
("Not in RDF/XML format due to..." does not appear to be a Jena message)

At that location:

"...(/ˈærɪkə/ or /əˈriːkə/)..."
                        ^
(This email is UTF-8)

Line/column for encoding problems aren't always right but it looks like it is here.

Works for me in 3.17.0, 4.5.0, 5.0.0-dev

JVM_ARGS="-DentityExpansionLimit=2000000" riot --validate --count foodon.owl

Could this be due to my Jena version or Eclipse or Windows or UTF-8?

Windows most likely.
It can happen if the data has been piped at the command line.

    Andy


On 10/12/2023 1:42 PM, Andy Seaborne wrote:
Thanks. It parses OK.

On Thu, 12 Oct 2023, 19:36 Jim Balhoff, <balh...@gmail.com> wrote:

On Oct 6, 2023, at 3:46 AM, Andy Seaborne <a...@apache.org> wrote:


On 28/06/2023 09:26, Damion Dooley wrote:
I’m using RIOT to parse a large food ontology in owl rdf/xml format.
Damion,

Is that data publicly available?

There's a new RDF/XML parser for Jena in the pipeline and I'd like to
try it out on real data.

Andy,

Damion is active in FOODON, so that may be the ontology to try:
http://obofoundry.org/ontology/foodon.html

The ontology is at https://purl.obolibrary.org/obo/foodon.owl

- Jim



Reply via email to