[Dspace-tech] SRW/Lucene/Dspace - dc schema customization, index and search

2011-03-16 Thread Kocisky
Hi all !

i'm quite new to the dspace/lucene/SRW environment. I've set up a
dspace 1.7 instance with the SRW module. I'm now able to do the
following queries:

http://localhost:8080/SRW/search/DSpace/?query=dc.identifier+%3D+%2211%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys=

with the default fields in the SRW search interface:

cql.resultSetId
cql.serverChoice
dc.contributor
dc.creator
dc.description.abstract
dc.description.sponsorship
dc.description.statementofresponsibility
dc.description.tableofcontents
dc.format.mimetype
dc.identifier
dc.relation.ispartofseries
dc.subject
dc.title

after i've added some additional fields in the dc schema (from
http://10.0.102.11:8080/xmlui/admin/metadata-registry): dc.test.kind
and dc.test.content (if more easy it wouldn't be a problem to create
them in separate new schema). This because i would like to use those
fields from an external application (for internal purposes).

Following the documentation i've tried to configure SRW/Dspace with
the following configuration:

dspace/config/DSpace.SRWDatabase.props

indexSynonym.dc.creator=author
indexSynonym.dc.contributor=author
indexSynonym.dc.description.statementofresponsibility=author
indexSynonym.dc.title=title
indexSynonym.dc.subject=keyword
indexSynonym.dc.description.abstract=abstract
indexSynonym.dc.description.tableofcontents=abstract
indexSynonym.dc.relation.ispartofseries=series
indexSynonym.dc.format.mimetype=mime
indexSynonym.dc.description.sponsorship=sponsor
indexSynonym.dc.identifier=identifier
indexSynonym.cql.serverChoice=default

indexSynonym.dc.test.kind=kind
indexSynonym.dc.test.content=content


dspace.cfg

# Fields to Index for Search #

# DC metadata elements.qualifiers to be indexed for search
# format: - search.index.[number] = [search field]:element.qualifier
# - * used as wildcard

###  changing these will change your search results, ###
###  but will NOT automatically change your search displays  ###

search.index.1 = author:dc.contributor.*
search.index.2 = author:dc.creator.*
search.index.3 = title:dc.title.*
search.index.4 = keyword:dc.subject.*
search.index.5 = abstract:dc.description.abstract
search.index.6 = author:dc.description.statementofresponsibility
search.index.7 = series:dc.relation.ispartofseries
search.index.8 = abstract:dc.description.tableofcontents
search.index.9 = mime:dc.format.mimetype
search.index.10 = sponsor:dc.description.sponsorship
search.index.11 = identifier:dc.identifier.*
search.index.12 = language:dc.language.iso
search.index.13 = kind:dc.test.kind
search.index.14 = content:dc.test.content

then i've tried to rebuild the index but i couldn't find the script
/[dspace]/bin/index-init, i've check also in [dspace-source] but
nothing, should i download something separately?
i was looking at this documentation:
https://wiki.duraspace.org/display/DSDOC/Configuration#Configuration-ConfiguringLuceneSearchIndexes

Now what i would like to do is to search with SRW in the Dspace/Lucene
index with the customized metadata, for example by
dc.test.kind=video with a query like this:

http://localhost:8080/SRW/search/DSpace/?query=dc.test.kind+%3D+%22video%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys=

but i'm definitively doing something wrong because it's not working!! :)
i'm i missing something? is there some interesting
documentation/tutorial on this subject?

thank you!!
Kocisky

--
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] SRW/Lucene/Dspace - dc schema customization, index and search

2011-03-16 Thread Kocisky
When i'm fetching a record :

http://localhost:8080/SRW/search/DSpace/?query=dc.identifier+%3D+%2211%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys=

it fetches also the customized dc metadata:

Results for Search: dc.identifier = 11
Result Set Identifier: hqwurd
Records found: 1
Record: 1 of 1
Schema: Dublin Core
dc:contributor.author:  Administrator
dc:date.accessioned:2011-03-10T19:02:35Z
dc:date.available:  2011-03-10T19:02:35Z
dc:date.issued: 2011-03-10
dc:identifier.uri:  http://hdl.handle.net/123456789/11
dc:description.provenance:  Submitted by admin (ad...@domain.org) on
2011-03-10T19:02:35Z No. of bitstreams: 1 Dovetail.Tasting.pdf: 234082
bytes, checksum: bd7989f5f5d9b5c95ffef358b2a67fa4 (MD5)
dc:description.provenance:  Made available in DSpace on
2011-03-10T19:02:35Z (GMT). No. of bitstreams: 1 Dovetail.Tasting.pdf:
234082 bytes, checksum: bd7989f5f5d9b5c95ffef358b2a67fa4 (MD5)
dc:language.iso:en_US
dc:subject: Books/Canon/Misc
dc:subject: People
dc:title:   Development Item 1
dc:test.kind:   video
dc:test.content:rafting

hope it helps,
K

On 16 March 2011 15:19, Kocisky koci...@autistici.org wrote:
 Hi all !

 i'm quite new to the dspace/lucene/SRW environment. I've set up a
 dspace 1.7 instance with the SRW module. I'm now able to do the
 following queries:

 http://localhost:8080/SRW/search/DSpace/?query=dc.identifier+%3D+%2211%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys=

 with the default fields in the SRW search interface:

 cql.resultSetId
 cql.serverChoice
 dc.contributor
 dc.creator
 dc.description.abstract
 dc.description.sponsorship
 dc.description.statementofresponsibility
 dc.description.tableofcontents
 dc.format.mimetype
 dc.identifier
 dc.relation.ispartofseries
 dc.subject
 dc.title

 after i've added some additional fields in the dc schema (from
 http://10.0.102.11:8080/xmlui/admin/metadata-registry): dc.test.kind
 and dc.test.content (if more easy it wouldn't be a problem to create
 them in separate new schema). This because i would like to use those
 fields from an external application (for internal purposes).

 Following the documentation i've tried to configure SRW/Dspace with
 the following configuration:

 dspace/config/DSpace.SRWDatabase.props

 indexSynonym.dc.creator=author
 indexSynonym.dc.contributor=author
 indexSynonym.dc.description.statementofresponsibility=author
 indexSynonym.dc.title=title
 indexSynonym.dc.subject=keyword
 indexSynonym.dc.description.abstract=abstract
 indexSynonym.dc.description.tableofcontents=abstract
 indexSynonym.dc.relation.ispartofseries=series
 indexSynonym.dc.format.mimetype=mime
 indexSynonym.dc.description.sponsorship=sponsor
 indexSynonym.dc.identifier=identifier
 indexSynonym.cql.serverChoice=default

 indexSynonym.dc.test.kind=kind
 indexSynonym.dc.test.content=content


 dspace.cfg

 # Fields to Index for Search #

 # DC metadata elements.qualifiers to be indexed for search
 # format: - search.index.[number] = [search field]:element.qualifier
 # - * used as wildcard

 ###  changing these will change your search results, ###
 ###  but will NOT automatically change your search displays  ###

 search.index.1 = author:dc.contributor.*
 search.index.2 = author:dc.creator.*
 search.index.3 = title:dc.title.*
 search.index.4 = keyword:dc.subject.*
 search.index.5 = abstract:dc.description.abstract
 search.index.6 = author:dc.description.statementofresponsibility
 search.index.7 = series:dc.relation.ispartofseries
 search.index.8 = abstract:dc.description.tableofcontents
 search.index.9 = mime:dc.format.mimetype
 search.index.10 = sponsor:dc.description.sponsorship
 search.index.11 = identifier:dc.identifier.*
 search.index.12 = language:dc.language.iso
 search.index.13 = kind:dc.test.kind
 search.index.14 = content:dc.test.content

 then i've tried to rebuild the index but i couldn't find the script
 /[dspace]/bin/index-init, i've check also in [dspace-source] but
 nothing, should i download something separately?
 i was looking at this documentation:
 https://wiki.duraspace.org/display/DSDOC/Configuration#Configuration-ConfiguringLuceneSearchIndexes

 Now what i would like to do is to search with SRW in the Dspace/Lucene
 index with the customized metadata, for example by
 dc.test.kind=video with a query like this:

 http://localhost:8080/SRW/search/DSpace/?query=dc.test.kind+%3D+%22video%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys=

 but i'm definitively doing something wrong because it's not working!! :)
 i'm i missing something? is there some interesting
 documentation/tutorial on this 

Re: [Dspace-tech] SRW/Lucene/Dspace - dc schema customization, index and search

2011-03-16 Thread Kocisky
ok, never mind... the right command is [dspace]/bin/dspace index-init

and now:

http://10.0.102.11:8080/SRW/search/DSpace/?query=dc.test.kind+%3D+%22video%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys=

works ! nice...
K

On 16 March 2011 15:26, Kocisky koci...@autistici.org wrote:
 When i'm fetching a record :

 http://localhost:8080/SRW/search/DSpace/?query=dc.identifier+%3D+%2211%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys=

 it fetches also the customized dc metadata:

 Results for Search: dc.identifier = 11
 Result Set Identifier: hqwurd
 Records found: 1
 Record: 1 of 1
 Schema: Dublin Core
 dc:contributor.author:  Administrator
 dc:date.accessioned:    2011-03-10T19:02:35Z
 dc:date.available:      2011-03-10T19:02:35Z
 dc:date.issued:         2011-03-10
 dc:identifier.uri:      http://hdl.handle.net/123456789/11
 dc:description.provenance:      Submitted by admin (ad...@domain.org) on
 2011-03-10T19:02:35Z No. of bitstreams: 1 Dovetail.Tasting.pdf: 234082
 bytes, checksum: bd7989f5f5d9b5c95ffef358b2a67fa4 (MD5)
 dc:description.provenance:      Made available in DSpace on
 2011-03-10T19:02:35Z (GMT). No. of bitstreams: 1 Dovetail.Tasting.pdf:
 234082 bytes, checksum: bd7989f5f5d9b5c95ffef358b2a67fa4 (MD5)
 dc:language.iso:        en_US
 dc:subject:     Books/Canon/Misc
 dc:subject:     People
 dc:title:       Development Item 1
 dc:test.kind:   video
 dc:test.content:        rafting

 hope it helps,
 K

 On 16 March 2011 15:19, Kocisky koci...@autistici.org wrote:
 Hi all !

 i'm quite new to the dspace/lucene/SRW environment. I've set up a
 dspace 1.7 instance with the SRW module. I'm now able to do the
 following queries:

 http://localhost:8080/SRW/search/DSpace/?query=dc.identifier+%3D+%2211%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys=

 with the default fields in the SRW search interface:

 cql.resultSetId
 cql.serverChoice
 dc.contributor
 dc.creator
 dc.description.abstract
 dc.description.sponsorship
 dc.description.statementofresponsibility
 dc.description.tableofcontents
 dc.format.mimetype
 dc.identifier
 dc.relation.ispartofseries
 dc.subject
 dc.title

 after i've added some additional fields in the dc schema (from
 http://10.0.102.11:8080/xmlui/admin/metadata-registry): dc.test.kind
 and dc.test.content (if more easy it wouldn't be a problem to create
 them in separate new schema). This because i would like to use those
 fields from an external application (for internal purposes).

 Following the documentation i've tried to configure SRW/Dspace with
 the following configuration:

 dspace/config/DSpace.SRWDatabase.props

 indexSynonym.dc.creator=author
 indexSynonym.dc.contributor=author
 indexSynonym.dc.description.statementofresponsibility=author
 indexSynonym.dc.title=title
 indexSynonym.dc.subject=keyword
 indexSynonym.dc.description.abstract=abstract
 indexSynonym.dc.description.tableofcontents=abstract
 indexSynonym.dc.relation.ispartofseries=series
 indexSynonym.dc.format.mimetype=mime
 indexSynonym.dc.description.sponsorship=sponsor
 indexSynonym.dc.identifier=identifier
 indexSynonym.cql.serverChoice=default

 indexSynonym.dc.test.kind=kind
 indexSynonym.dc.test.content=content


 dspace.cfg

 # Fields to Index for Search #

 # DC metadata elements.qualifiers to be indexed for search
 # format: - search.index.[number] = [search field]:element.qualifier
 # - * used as wildcard

 ###  changing these will change your search results, ###
 ###  but will NOT automatically change your search displays  ###

 search.index.1 = author:dc.contributor.*
 search.index.2 = author:dc.creator.*
 search.index.3 = title:dc.title.*
 search.index.4 = keyword:dc.subject.*
 search.index.5 = abstract:dc.description.abstract
 search.index.6 = author:dc.description.statementofresponsibility
 search.index.7 = series:dc.relation.ispartofseries
 search.index.8 = abstract:dc.description.tableofcontents
 search.index.9 = mime:dc.format.mimetype
 search.index.10 = sponsor:dc.description.sponsorship
 search.index.11 = identifier:dc.identifier.*
 search.index.12 = language:dc.language.iso
 search.index.13 = kind:dc.test.kind
 search.index.14 = content:dc.test.content

 then i've tried to rebuild the index but i couldn't find the script
 /[dspace]/bin/index-init, i've check also in [dspace-source] but
 nothing, should i download something separately?
 i was looking at this documentation:
 https://wiki.duraspace.org/display/DSDOC/Configuration#Configuration-ConfiguringLuceneSearchIndexes

 Now what i would like to do is to search with SRW in the Dspace/Lucene
 index with the customized metadata, for example by
 

Re: [Dspace-tech] SRW/Lucene/Dspace - dc schema customization, index and search

2011-03-16 Thread helix84
On Wed, Mar 16, 2011 at 20:58, Kocisky koci...@autistici.org wrote:
 ok, never mind... the right command is [dspace]/bin/dspace index-init

You were quick to figure it out yourself :)
I updated the documentation.

Regards,
~~helix84

--
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech