[ https://issues.apache.org/jira/browse/SOLR-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Robert Muir updated SOLR-1859: ------------------------------ Attachment: SOLR-1859.patch attached is a patch. I fixed every instance for general types like "text" in every schema file i could find, including test ones, and commented-out instances, too. All tests pass. > speed up indexing for example schema > ------------------------------------ > > Key: SOLR-1859 > URL: https://issues.apache.org/jira/browse/SOLR-1859 > Project: Solr > Issue Type: Task > Components: Schema and Analysis > Reporter: Robert Muir > Assignee: Robert Muir > Fix For: 3.1 > > Attachments: SOLR-1859.patch > > > The example schema should use the lucene core PorterStemmer (coded in Java by > Martin Porter) > instead of the Snowball one that is auto-generated code. > Although we have sped up the Snowball stemmer, its still pretty slow and the > example should be fast. > Below is the output of ant test -Dtestcase=TestIndexingPerformance > -Dargs="-server -Diter=100000" > These results are consistent with large document indexing times that I have > seen on large english > collections with Lucene, we double indexing speed. > {noformat} > solr1.5branch: > iter=100000 time=5841 throughput=17120 > iter=100000 time=5839 throughput=17126 > iter=100000 time=6017 throughput=16619 > trunk (unpatched): > iter=100000 time=4132 throughput=24201 > iter=100000 time=4142 throughput=24142 > iter=100000 time=4151 throughput=24090 > trunk (patched) > iter=100000 time=2998 throughput=33355 > iter=100000 time=3021 throughput=33101 > iter=100000 time=3006 throughput=33266 > {noformat} -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.