DSpace Devs,
I believe I've found one, possibly two bugs related to harvesting which
reside within dspace 3.2. I manage a meta-repository which harvests
metadata and bitstream references from eleven separate institutional
repositories. Of those eleven repositories, five are running dspace 3.2,
two are running dspace 1.8 (three others are running digital commons, for
which bitstream references cannot be harvested). Of the various
institutions, I've confirmed that the behavior in question occurs on all
the IRs running dspace 3.2, but does not occur on either of the IRs running
dspace 1.8.
The nature of the bugs themselves are as follows:
Within the ORE.xml of item records, the value for
"atom:entry/oreatom:triples/rdf:Description/@rdf:about" should (as I
understand it) follow a pattern of approximately:
"http://
[domain].[ext]/[site-prefix?]/bitstream/[handle]/[item-number]/[bitstream-number]/[filename]"
However, records I harvest from repositories using dspace 3.2 have one of
the following two problems:
1) '%2f', instead of '/'.
2) 'bitstream', instead of [filename].
This results in values for the @rdf:about field like follows:
"http://
[domain].[ext]/[site-prefix?]/bitstream/[handle]%2F[item-number]/[bitstream-number]/bitstream"
One last note of import - Although the first problem listed seems to always
be present, the filename is not always replaced by the string "bitstream".
I've compiled a list of one example each from our list of harvested
institutions. Please keep in mind the issue(s) seem to only be present
within the 3.2 version of dspace. Also - the repository doing the
harvesting is running 1.8.3.
Any assistance with resolving these issues would be greatly appreciated.
Sincerely,
- Patrick E.
----- -----
Albany State University (dspace 3.2):
item - http://gaknowledge.org/DRI/handle/10675.1/55
mets.xml - http://gaknowledge.org/metadata/handle/10675.1/55/mets.xml
ore.xml -
http://gaknowledge.org/bitstream/handle/10675.1/55/ORE.xml?sequence=1
Coastal College of Georgia (dspace 3.2):
item - http://gaknowledge.org/DRI/handle/10675.4/11
mets.xml - http://gaknowledge.org/metadata/handle/10675.4/11/mets.xml
ore.xml -
http://gaknowledge.org/bitstream/handle/10675.4/11/ORE.xml?sequence=1
Columbus State University (dspace 3.2):
item - http://gaknowledge.org/DRI/handle/META/37373
mets.xml - http://gaknowledge.org/metadata/handle/META/37373/mets.xml
ore.xml -
http://gaknowledge.org/bitstream/handle/META/37373/ORE.xml?sequence=1
Georgia Gwinnett College (dspace 3.2):
item - http://gaknowledge.org/DRI/handle/10675.3/31
mets.xml - http://gaknowledge.org/metadata/handle/10675.3/31/mets.xml
ore.xml -
http://gaknowledge.org/bitstream/handle/10675.3/31/ORE.xml?sequence=1
Georgia Institute of Technology (dspace 1.8):
item - http://gaknowledge.org/DRI/handle/1853/49228
mets.xml - http://gaknowledge.org/metadata/handle/1853/49228/mets.xml
ore.xml -
http://gaknowledge.org/bitstream/handle/1853/49228/ORE.xml?sequence=1
Georgia Regents University (dspace 3.2):
item - http://gaknowledge.org/DRI/handle/10675.2/381
mets.xml - http://gaknowledge.org/metadata/handle/10675.2/381/mets.xml
ore.xml -
http://gaknowledge.org/bitstream/handle/10675.2/381/ORE.xml?sequence=1
Georgia Southern University (digital commons):
item - http://gaknowledge.org/DRI/handle/META/47921
mets.xml - http://gaknowledge.org/metadata/handle/META/47921/mets.xml
ore.xml - n/a
Georgia State University (digital commons):
item - http://gaknowledge.org/DRI/handle/META/48086
mets.xml - http://gaknowledge.org/metadata/handle/META/48086/mets.xml
ore.xml - n/a
Kennesaw State University (digital commons):
item - http://gaknowledge.org/DRI/handle/META/49849
mets.xml - http://gaknowledge.org/metadata/handle/META/49849/mets.xml
ore.xml - n/a
Valdosta State University (dspace 1.8):
item - http://gaknowledge.org/DRI/handle/10428/1186
mets.xml - http://gaknowledge.org/metadata/handle/10428/1186/mets.xml
ore.xml -
http://gaknowledge.org/bitstream/handle/10428/1186/ORE.xml?sequence=1
--
P. Kieran Etienne
Systems Analyst
Georgia Institute of Technology
Atlanta, GA 30332
404.385.8121
------------------------------------------------------------------------------
November Webinars for C, C++, Fortran Developers
Accelerate application performance with scalable programming models. Explore
techniques for threading, error checking, porting, and tuning. Get the most
from the latest Intel processors and coprocessors. See abstracts and register
http://pubads.g.doubleclick.net/gampad/clk?id=60136231&iu=/4140/ostg.clktrk
_______________________________________________
Dspace-devel mailing list
Dspace-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-devel