[ https://issues.apache.org/jira/browse/LUCENE-882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Daniel Naber updated LUCENE-882: -------------------------------- Attachment: lucene-spellchecker.diff don't store but only index ngrams > Spellchecker doesn't need to store ngrams > ----------------------------------------- > > Key: LUCENE-882 > URL: https://issues.apache.org/jira/browse/LUCENE-882 > Project: Lucene - Java > Issue Type: Improvement > Components: Other > Affects Versions: 2.1 > Reporter: Daniel Naber > Attachments: lucene-spellchecker.diff > > > The spellchecker in contrib stores the ngrams although this doesn't seem to > be necessary. This patch changes that, I will commit it unless someone > objects. This improves indexing speed and index size. Some numbers on a small > test I did: > Input of the original index: 2200 text files, index size 5.3 MB, indexing > took 17 seconds > Spell index before patch: about 60.000 documents, index size 13 MB, indexing > took 62 seconds > Spell index after patch: about 60.000 documents, index size 6.3 MB, indexing > took 52 seconds > BTW, the test case fails even before this patch. I'll probaby submit another > issue about how to fix that. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]