create a non-Porter Stemming analyzer for DSpace
------------------------------------------------

                 Key: DS-849
                 URL: https://jira.duraspace.org/browse/DS-849
             Project: DSpace
          Issue Type: Improvement
          Components: Documentation, DSpace API
    Affects Versions: 1.7.0, 1.6.2, 1.6.1, 1.6.0, 1.5.2, 1.5.1, 1.5.0
            Reporter: Hardy Pottinger
         Attachments: DS-disable-porter-stemming.patch

For some use cases for DSpace, the index produced by the standard search 
analyzer (org.dspace.search.DSAnalyzer) produces unsatisfactorily imprecise 
results. Creating an alternate analyzer, which omits PorterStemFilter, will be 
helpful in those use cases. See these threads for more of the backstory:

http://comments.gmane.org/gmane.comp.db.dspace.user/13404
http://comments.gmane.org/gmane.comp.db.dspace.user/13407
http://comments.gmane.org/gmane.comp.db.dspace.user/13420
http://comments.gmane.org/gmane.comp.db.dspace.user/13427

I'm attaching a patch, but it's more of a kit. You must first copy 
[dspace-src]/dspace-api/src/main/java/org/dspace/search/DSAnalyzer.java to 
[dspace-src]/dspace-api/src/main/java/org/dspace/search/DSNonStemmingAnalyzer.java,
 then you can apply the patch.

After patching, you must alter your dspace.cfg file, uncommenting and changing 
the search.analyzer line so that it reads:

search.analyzer = org.dspace.search.DSNonStemmingAnalyzer

Then, do the following:

 * stop Tomcat (taking down your DSpace instance)
 * re-index all content in your DSpace by running:
       [dspace]/bin/dspace index-init
 * start Tomcat 
 * test

All credit for this work goes to Tim Donohue and Stuart Yeates, I just put the 
pieces together into this patch and ticket.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://jira.duraspace.org/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

------------------------------------------------------------------------------
Colocation vs. Managed Hosting
A question and answer guide to determining the best fit
for your organization - today and in the future.
http://p.sf.net/sfu/internap-sfd2d
_______________________________________________
Dspace-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-devel

Reply via email to