It is complaining about the file name IRI.  According to the error,
Jena wants IRIs (and thus filenames) to be in Unicode Normal Form C
[1]. I don't know if there is any such requirement in the IRI standard?
I found the following issue which was discussing to turn the "warnings"
off: https://issues.apache.org/jira/browse/JENA-864 but they weren't.

The characters in the filename that are not in "Normal Form C" are
those with oxia, according to the normalisation procedures all oxia
should be replaced with tonos because they look very similar.  Of
course, normalisation is a loss of information.

I guess Andy will know more (maybe also why it is an error in the
filename and only a warning in the jsonld itself)

[1] https://unicode.org/reports/tr15/

On Sun, 2024-12-08 at 08:56 +0100, zPlus wrote:
> Apache Jena RIOT version 5.2.0
> Debian GNU/Linux 12 (bookworm)
> 
> 
> I have a file called
> "Εισαγωγή_στον_στοχαστικό_λογισμό_2015_Χελιώτης.jsonld",
> that I'm trying to load into a TDB2 database (content of the file
> below), but it
> does not pass validation:
> 
> 
> riot --validate
> ../data/nodes/Εισαγωγή_στον_στοχαστικό_λογισμό_2015_Χελιώτης.jsonld 
> org.apache.jena.irix.IRIException:
> <file:///storage/980/data/nodes/Εισαγωγή_στον_στοχαστικό_λογισμό_2015
> _Χελιώτης.j
> sonld> Code: 46/NOT_NFC in PATH: The IRI is not in Unicode Normal
> Form C.
>       at
> org.apache.jena.irix.IRIProviderJenaIRI.exceptions(IRIProviderJenaIRI
> .java:272)
>       at
> org.apache.jena.irix.IRIProviderJenaIRI.newIRIxJena(IRIProviderJenaIR
> I.java:155)
>       at
> org.apache.jena.irix.IRIProviderJenaIRI.newIRIxJena(IRIProviderJenaIR
> I.java:151)
>       at
> org.apache.jena.irix.IRIProviderJenaIRI$IRIxJena.resolve(IRIProviderJ
> enaIRI.java
> :86)
>       at org.apache.jena.irix.IRIs.toBase(IRIs.java:106)
>       at
> org.apache.jena.riot.RDFParserBuilder.build(RDFParserBuilder.java:668
> )
>       at riotcmd.CmdLangParse.parseRIOT(CmdLangParse.java:388)
>       at riotcmd.CmdLangParse.parseFile(CmdLangParse.java:337)
>       at riotcmd.CmdLangParse.exec$(CmdLangParse.java:234)
>       at riotcmd.CmdLangParse.exec(CmdLangParse.java:174)
>       at org.apache.jena.cmd.CmdMain.mainMethod(CmdMain.java:87)
>       at org.apache.jena.cmd.CmdMain.mainRun(CmdMain.java:56)
>       at org.apache.jena.cmd.CmdMain.mainRun(CmdMain.java:43)
>       at riotcmd.riot.main(riot.java:29)
> 
> 
> However, just by renaming the file to something else such as
> "file.json"
> everything works as aspected. There seems to be an issue with the
> file name, but
> I cannot understand what's wrong about it. I'm wondering if this is a
> bug, or if
> I'm missing something obvious.
> 
> 
> {
>     "@context": {
>         "library": "dokk:vocab:library:",
>         "license": "dokk:vocab:license:",
>         "license:licensed_under": { "@type": "@id" },
>         "license:translation_of": { "@type": "@id" },
>         "blob": "dokk:vocab:blob:"
>     },
>     "@type": [ "library:Item" ],
>     "@id": "dokk:Εισαγωγή_στον_στοχαστικό_λογισμό_2015_Χελιώτης",
>     "library:website":
> "https://repository.kallipos.gr/handle/11419/4143";,
>     "library:author": [ "Δημήτρης Χελιώτης" ],
>     "license:licensed_under": [ "dokk:license:CC-BY-NC-ND-3.0" ],
>     "library:title": "Εισαγωγή στον στοχαστικό λογισμό",
>     "blob:at": {
>         "@id":
> "file:/pdf/Εισαγωγή_στον_στοχαστικό_λογισμό_2015_Χελιώτης.pdf",
>         "blob:primary_source":
> "
> https://repository.kallipos.gr/bitstream/11419/4143/4/StochastikosLogi
> smos.pdf"
> ,
>         "blob:retrieval_date": "2023-11-07"
>     }
> }
> 
> 

Reply via email to