[Dspace-tech] SRW/Lucene/Dspace - dc schema customization, index and search
Hi all ! i'm quite new to the dspace/lucene/SRW environment. I've set up a dspace 1.7 instance with the SRW module. I'm now able to do the following queries: http://localhost:8080/SRW/search/DSpace/?query=dc.identifier+%3D+%2211%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys= with the default fields in the SRW search interface: cql.resultSetId cql.serverChoice dc.contributor dc.creator dc.description.abstract dc.description.sponsorship dc.description.statementofresponsibility dc.description.tableofcontents dc.format.mimetype dc.identifier dc.relation.ispartofseries dc.subject dc.title after i've added some additional fields in the dc schema (from http://10.0.102.11:8080/xmlui/admin/metadata-registry): dc.test.kind and dc.test.content (if more easy it wouldn't be a problem to create them in separate new schema). This because i would like to use those fields from an external application (for internal purposes). Following the documentation i've tried to configure SRW/Dspace with the following configuration: dspace/config/DSpace.SRWDatabase.props indexSynonym.dc.creator=author indexSynonym.dc.contributor=author indexSynonym.dc.description.statementofresponsibility=author indexSynonym.dc.title=title indexSynonym.dc.subject=keyword indexSynonym.dc.description.abstract=abstract indexSynonym.dc.description.tableofcontents=abstract indexSynonym.dc.relation.ispartofseries=series indexSynonym.dc.format.mimetype=mime indexSynonym.dc.description.sponsorship=sponsor indexSynonym.dc.identifier=identifier indexSynonym.cql.serverChoice=default indexSynonym.dc.test.kind=kind indexSynonym.dc.test.content=content dspace.cfg # Fields to Index for Search # # DC metadata elements.qualifiers to be indexed for search # format: - search.index.[number] = [search field]:element.qualifier # - * used as wildcard ### changing these will change your search results, ### ### but will NOT automatically change your search displays ### search.index.1 = author:dc.contributor.* search.index.2 = author:dc.creator.* search.index.3 = title:dc.title.* search.index.4 = keyword:dc.subject.* search.index.5 = abstract:dc.description.abstract search.index.6 = author:dc.description.statementofresponsibility search.index.7 = series:dc.relation.ispartofseries search.index.8 = abstract:dc.description.tableofcontents search.index.9 = mime:dc.format.mimetype search.index.10 = sponsor:dc.description.sponsorship search.index.11 = identifier:dc.identifier.* search.index.12 = language:dc.language.iso search.index.13 = kind:dc.test.kind search.index.14 = content:dc.test.content then i've tried to rebuild the index but i couldn't find the script /[dspace]/bin/index-init, i've check also in [dspace-source] but nothing, should i download something separately? i was looking at this documentation: https://wiki.duraspace.org/display/DSDOC/Configuration#Configuration-ConfiguringLuceneSearchIndexes Now what i would like to do is to search with SRW in the Dspace/Lucene index with the customized metadata, for example by dc.test.kind=video with a query like this: http://localhost:8080/SRW/search/DSpace/?query=dc.test.kind+%3D+%22video%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys= but i'm definitively doing something wrong because it's not working!! :) i'm i missing something? is there some interesting documentation/tutorial on this subject? thank you!! Kocisky -- Colocation vs. Managed Hosting A question and answer guide to determining the best fit for your organization - today and in the future. http://p.sf.net/sfu/internap-sfd2d ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] SRW/Lucene/Dspace - dc schema customization, index and search
When i'm fetching a record : http://localhost:8080/SRW/search/DSpace/?query=dc.identifier+%3D+%2211%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys= it fetches also the customized dc metadata: Results for Search: dc.identifier = 11 Result Set Identifier: hqwurd Records found: 1 Record: 1 of 1 Schema: Dublin Core dc:contributor.author: Administrator dc:date.accessioned:2011-03-10T19:02:35Z dc:date.available: 2011-03-10T19:02:35Z dc:date.issued: 2011-03-10 dc:identifier.uri: http://hdl.handle.net/123456789/11 dc:description.provenance: Submitted by admin (ad...@domain.org) on 2011-03-10T19:02:35Z No. of bitstreams: 1 Dovetail.Tasting.pdf: 234082 bytes, checksum: bd7989f5f5d9b5c95ffef358b2a67fa4 (MD5) dc:description.provenance: Made available in DSpace on 2011-03-10T19:02:35Z (GMT). No. of bitstreams: 1 Dovetail.Tasting.pdf: 234082 bytes, checksum: bd7989f5f5d9b5c95ffef358b2a67fa4 (MD5) dc:language.iso:en_US dc:subject: Books/Canon/Misc dc:subject: People dc:title: Development Item 1 dc:test.kind: video dc:test.content:rafting hope it helps, K On 16 March 2011 15:19, Kocisky koci...@autistici.org wrote: Hi all ! i'm quite new to the dspace/lucene/SRW environment. I've set up a dspace 1.7 instance with the SRW module. I'm now able to do the following queries: http://localhost:8080/SRW/search/DSpace/?query=dc.identifier+%3D+%2211%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys= with the default fields in the SRW search interface: cql.resultSetId cql.serverChoice dc.contributor dc.creator dc.description.abstract dc.description.sponsorship dc.description.statementofresponsibility dc.description.tableofcontents dc.format.mimetype dc.identifier dc.relation.ispartofseries dc.subject dc.title after i've added some additional fields in the dc schema (from http://10.0.102.11:8080/xmlui/admin/metadata-registry): dc.test.kind and dc.test.content (if more easy it wouldn't be a problem to create them in separate new schema). This because i would like to use those fields from an external application (for internal purposes). Following the documentation i've tried to configure SRW/Dspace with the following configuration: dspace/config/DSpace.SRWDatabase.props indexSynonym.dc.creator=author indexSynonym.dc.contributor=author indexSynonym.dc.description.statementofresponsibility=author indexSynonym.dc.title=title indexSynonym.dc.subject=keyword indexSynonym.dc.description.abstract=abstract indexSynonym.dc.description.tableofcontents=abstract indexSynonym.dc.relation.ispartofseries=series indexSynonym.dc.format.mimetype=mime indexSynonym.dc.description.sponsorship=sponsor indexSynonym.dc.identifier=identifier indexSynonym.cql.serverChoice=default indexSynonym.dc.test.kind=kind indexSynonym.dc.test.content=content dspace.cfg # Fields to Index for Search # # DC metadata elements.qualifiers to be indexed for search # format: - search.index.[number] = [search field]:element.qualifier # - * used as wildcard ### changing these will change your search results, ### ### but will NOT automatically change your search displays ### search.index.1 = author:dc.contributor.* search.index.2 = author:dc.creator.* search.index.3 = title:dc.title.* search.index.4 = keyword:dc.subject.* search.index.5 = abstract:dc.description.abstract search.index.6 = author:dc.description.statementofresponsibility search.index.7 = series:dc.relation.ispartofseries search.index.8 = abstract:dc.description.tableofcontents search.index.9 = mime:dc.format.mimetype search.index.10 = sponsor:dc.description.sponsorship search.index.11 = identifier:dc.identifier.* search.index.12 = language:dc.language.iso search.index.13 = kind:dc.test.kind search.index.14 = content:dc.test.content then i've tried to rebuild the index but i couldn't find the script /[dspace]/bin/index-init, i've check also in [dspace-source] but nothing, should i download something separately? i was looking at this documentation: https://wiki.duraspace.org/display/DSDOC/Configuration#Configuration-ConfiguringLuceneSearchIndexes Now what i would like to do is to search with SRW in the Dspace/Lucene index with the customized metadata, for example by dc.test.kind=video with a query like this: http://localhost:8080/SRW/search/DSpace/?query=dc.test.kind+%3D+%22video%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys= but i'm definitively doing something wrong because it's not working!! :) i'm i missing something? is there some interesting documentation/tutorial on this
Re: [Dspace-tech] SRW/Lucene/Dspace - dc schema customization, index and search
ok, never mind... the right command is [dspace]/bin/dspace index-init and now: http://10.0.102.11:8080/SRW/search/DSpace/?query=dc.test.kind+%3D+%22video%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys= works ! nice... K On 16 March 2011 15:26, Kocisky koci...@autistici.org wrote: When i'm fetching a record : http://localhost:8080/SRW/search/DSpace/?query=dc.identifier+%3D+%2211%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys= it fetches also the customized dc metadata: Results for Search: dc.identifier = 11 Result Set Identifier: hqwurd Records found: 1 Record: 1 of 1 Schema: Dublin Core dc:contributor.author: Administrator dc:date.accessioned: 2011-03-10T19:02:35Z dc:date.available: 2011-03-10T19:02:35Z dc:date.issued: 2011-03-10 dc:identifier.uri: http://hdl.handle.net/123456789/11 dc:description.provenance: Submitted by admin (ad...@domain.org) on 2011-03-10T19:02:35Z No. of bitstreams: 1 Dovetail.Tasting.pdf: 234082 bytes, checksum: bd7989f5f5d9b5c95ffef358b2a67fa4 (MD5) dc:description.provenance: Made available in DSpace on 2011-03-10T19:02:35Z (GMT). No. of bitstreams: 1 Dovetail.Tasting.pdf: 234082 bytes, checksum: bd7989f5f5d9b5c95ffef358b2a67fa4 (MD5) dc:language.iso: en_US dc:subject: Books/Canon/Misc dc:subject: People dc:title: Development Item 1 dc:test.kind: video dc:test.content: rafting hope it helps, K On 16 March 2011 15:19, Kocisky koci...@autistici.org wrote: Hi all ! i'm quite new to the dspace/lucene/SRW environment. I've set up a dspace 1.7 instance with the SRW module. I'm now able to do the following queries: http://localhost:8080/SRW/search/DSpace/?query=dc.identifier+%3D+%2211%22version=1.1operation=searchRetrieverecordSchema=info%3Asrw%2Fschema%2F1%2Fdc-v1.1maximumRecords=10startRecord=1resultSetTTL=300recordPacking=xmlrecordXPath=sortKeys= with the default fields in the SRW search interface: cql.resultSetId cql.serverChoice dc.contributor dc.creator dc.description.abstract dc.description.sponsorship dc.description.statementofresponsibility dc.description.tableofcontents dc.format.mimetype dc.identifier dc.relation.ispartofseries dc.subject dc.title after i've added some additional fields in the dc schema (from http://10.0.102.11:8080/xmlui/admin/metadata-registry): dc.test.kind and dc.test.content (if more easy it wouldn't be a problem to create them in separate new schema). This because i would like to use those fields from an external application (for internal purposes). Following the documentation i've tried to configure SRW/Dspace with the following configuration: dspace/config/DSpace.SRWDatabase.props indexSynonym.dc.creator=author indexSynonym.dc.contributor=author indexSynonym.dc.description.statementofresponsibility=author indexSynonym.dc.title=title indexSynonym.dc.subject=keyword indexSynonym.dc.description.abstract=abstract indexSynonym.dc.description.tableofcontents=abstract indexSynonym.dc.relation.ispartofseries=series indexSynonym.dc.format.mimetype=mime indexSynonym.dc.description.sponsorship=sponsor indexSynonym.dc.identifier=identifier indexSynonym.cql.serverChoice=default indexSynonym.dc.test.kind=kind indexSynonym.dc.test.content=content dspace.cfg # Fields to Index for Search # # DC metadata elements.qualifiers to be indexed for search # format: - search.index.[number] = [search field]:element.qualifier # - * used as wildcard ### changing these will change your search results, ### ### but will NOT automatically change your search displays ### search.index.1 = author:dc.contributor.* search.index.2 = author:dc.creator.* search.index.3 = title:dc.title.* search.index.4 = keyword:dc.subject.* search.index.5 = abstract:dc.description.abstract search.index.6 = author:dc.description.statementofresponsibility search.index.7 = series:dc.relation.ispartofseries search.index.8 = abstract:dc.description.tableofcontents search.index.9 = mime:dc.format.mimetype search.index.10 = sponsor:dc.description.sponsorship search.index.11 = identifier:dc.identifier.* search.index.12 = language:dc.language.iso search.index.13 = kind:dc.test.kind search.index.14 = content:dc.test.content then i've tried to rebuild the index but i couldn't find the script /[dspace]/bin/index-init, i've check also in [dspace-source] but nothing, should i download something separately? i was looking at this documentation: https://wiki.duraspace.org/display/DSDOC/Configuration#Configuration-ConfiguringLuceneSearchIndexes Now what i would like to do is to search with SRW in the Dspace/Lucene index with the customized metadata, for example by
Re: [Dspace-tech] SRW/Lucene/Dspace - dc schema customization, index and search
On Wed, Mar 16, 2011 at 20:58, Kocisky koci...@autistici.org wrote: ok, never mind... the right command is [dspace]/bin/dspace index-init You were quick to figure it out yourself :) I updated the documentation. Regards, ~~helix84 -- Colocation vs. Managed Hosting A question and answer guide to determining the best fit for your organization - today and in the future. http://p.sf.net/sfu/internap-sfd2d ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech