Hello,
I noticed some weird behaviour when trying to import items into DSpace using
command line and simple archive format. I noticed that if I have bitstreams,
whose names contain both SPACEs and scandinavian special characters, import
fails, when the OS cannot find the bitstream in question.
For instance, a bitstream name with space is ok:
Adding item from directory item_002
Loading dublin core from ark/item_002/dublin_core.xml
...
Processing contents file: ark/item_002/contents
Bitstream: Digigraduille uusi prosessi.pdf
Bitstream name with 'ä' (a+uml) is ok:
Adding item from directory item_005
Loading dublin core from ark/item_005/dublin_core.xml
...
Bitstream: Käisä1.pdf
But this is not ok:
Adding item from directory item_006
Loading dublin core from ark/item_006/dublin_core.xml
...
java.io.FileNotFoundException: ark/item_006/Kirjastoelämää Bolognassa.pdf (No
such file or directory)
...
java.io.FileNotFoundException: ark/item_006/Kirjastoelämää Bolognassa.pdf (No
such file or directory)
stracing the import gives the underlying error:
21515 open("ark/item_006/Kirjastoel\303\244m\303\244\303\244 Bolognassa.pdf",
O_RDONLY) = -1 ENOENT (No such file or directory)
I'm using RHEL 7.2, with LANG=en_US.UTF-8. I'm not sure whether is some
operating system (or even filesystem? XFS) specific behaviour, or if the java
is the culprit, or if this could be helped with some Java IO magic (and thus
worth opening a ticket). I tested this with DSpace 6.0, but I think this would
happen with other versions as well.
Ilja Sidoroff
Information Systems Specialist
Helsinki University Library
--
You received this message because you are subscribed to the Google Groups
"DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email
to [email protected].
To post to this group, send email to [email protected].
Visit this group at https://groups.google.com/group/dspace-tech.
For more options, visit https://groups.google.com/d/optout.