[jira] Assigned: (SOLR-1876) Convert all tokenstreams and tests to use CharTermAttribute

2010-04-11 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir reassigned SOLR-1876: - Assignee: Robert Muir Convert all tokenstreams and tests to use CharTermAttribute

[jira] Assigned: (SOLR-1874) optimize patternreplacefilter

2010-04-10 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir reassigned SOLR-1874: - Assignee: Robert Muir optimize patternreplacefilter -

[jira] Resolved: (SOLR-1874) optimize patternreplacefilter

2010-04-10 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved SOLR-1874. --- Resolution: Fixed Committed revision 932752. optimize patternreplacefilter

[jira] Created: (SOLR-1876) Convert all tokenstreams and tests to use CharTermAttribute

2010-04-10 Thread Robert Muir (JIRA)
Convert all tokenstreams and tests to use CharTermAttribute --- Key: SOLR-1876 URL: https://issues.apache.org/jira/browse/SOLR-1876 Project: Solr Issue Type: Task Components:

[jira] Updated: (SOLR-1876) Convert all tokenstreams and tests to use CharTermAttribute

2010-04-10 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1876: -- Attachment: SOLR-1876.patch This patch does the following: * Converts all tokenstreams to use

[jira] Created: (SOLR-1874) optimize patternreplacefilter

2010-04-09 Thread Robert Muir (JIRA)
optimize patternreplacefilter - Key: SOLR-1874 URL: https://issues.apache.org/jira/browse/SOLR-1874 Project: Solr Issue Type: Improvement Components: Schema and Analysis Affects Versions: 3.1

[jira] Updated: (SOLR-1874) optimize patternreplacefilter

2010-04-09 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1874: -- Attachment: SOLR-1874.patch optimize patternreplacefilter -

[jira] Commented: (SOLR-1869) RemoveDuplicatesTokenFilter doest have expected behaviour

2010-04-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12854983#action_12854983 ] Robert Muir commented on SOLR-1869: --- bq. this all started because the highlighter was

[jira] Commented: (SOLR-1869) RemoveDuplicatesTokenFilter doest have expected behaviour

2010-04-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12854676#action_12854676 ] Robert Muir commented on SOLR-1869: --- Joe, the initialization is the same. I simply prefer

[jira] Updated: (SOLR-1869) RemoveDuplicatesTokenFilter doest have expected behaviour

2010-04-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1869: -- Issue Type: New Feature (was: Bug) RemoveDuplicatesTokenFilter doest have expected behaviour

[jira] Commented: (SOLR-1869) RemoveDuplicatesTokenFilter doest have expected behaviour

2010-04-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1869?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12854712#action_12854712 ] Robert Muir commented on SOLR-1869: --- bq. The CharArrayMap is more performant in lookup,

[jira] Created: (SOLR-1865) ignore byte-order markers in SolrResourceLoader

2010-04-05 Thread Robert Muir (JIRA)
ignore byte-order markers in SolrResourceLoader --- Key: SOLR-1865 URL: https://issues.apache.org/jira/browse/SOLR-1865 Project: Solr Issue Type: Improvement Reporter: Robert Muir

[jira] Updated: (SOLR-1865) ignore byte-order markers in SolrResourceLoader

2010-04-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1865: -- Attachment: SOLR-1865.patch attached is a patch to ignore BOM's at the beginning of files loaded with

[jira] Commented: (SOLR-1860) improve stopwords list handling

2010-04-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12853684#action_12853684 ] Robert Muir commented on SOLR-1860: --- bq. Either we can setup a simple export and

[jira] Commented: (SOLR-1852) enablePositionIncrements=true can cause searches to fail when they are parsed as phrase queries

2010-04-02 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852811#action_12852811 ] Robert Muir commented on SOLR-1852: --- Committed the test to trunk: revision 930262.

[jira] Commented: (SOLR-1860) improve stopwords list handling

2010-04-02 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852978#action_12852978 ] Robert Muir commented on SOLR-1860: --- A third idea from Hoss Man: We should make it easy

[jira] Commented: (SOLR-1859) speed up indexing for example schema

2010-04-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852375#action_12852375 ] Robert Muir commented on SOLR-1859: --- Any objections? If not I would like to commit later

[jira] Resolved: (SOLR-1859) speed up indexing for example schema

2010-04-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved SOLR-1859. --- Resolution: Fixed Committed revision 930050. speed up indexing for example schema

[jira] Created: (SOLR-1860) improve stopwords list handling

2010-04-01 Thread Robert Muir (JIRA)
improve stopwords list handling --- Key: SOLR-1860 URL: https://issues.apache.org/jira/browse/SOLR-1860 Project: Solr Issue Type: Improvement Components: Schema and Analysis Affects Versions: 3.1

[jira] Assigned: (SOLR-1740) ShingleFilterFactory improvements

2010-04-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir reassigned SOLR-1740: - Assignee: Robert Muir ShingleFilterFactory improvements -

[jira] Commented: (SOLR-1740) ShingleFilterFactory improvements

2010-04-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852686#action_12852686 ] Robert Muir commented on SOLR-1740: --- Now that we are on Lucene 3.1, it seems like it would

[jira] Commented: (SOLR-1312) BufferedTokenStream should use new Lucene 2.9 TokenStream API

2010-04-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852687#action_12852687 ] Robert Muir commented on SOLR-1312: --- Hello, I recommend we cancel this issue. No Solr

[jira] Updated: (SOLR-1740) ShingleFilterFactory improvements

2010-04-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1740: -- Attachment: SOLR-1740.patch Steven's patch, synced to trunk. I plan to commit shortly, thanks for the

[jira] Updated: (SOLR-1740) ShingleFilterFactory improvements

2010-04-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1740: -- Affects Version/s: (was: 1.5) 3.1 Fix Version/s: 3.1

[jira] Resolved: (SOLR-1740) ShingleFilterFactory improvements

2010-04-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1740?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved SOLR-1740. --- Resolution: Fixed Committed revision 930163. Thanks Steven! ShingleFilterFactory improvements

[jira] Created: (SOLR-1857) cleanup and sync analysis with lucene trunk

2010-03-31 Thread Robert Muir (JIRA)
cleanup and sync analysis with lucene trunk --- Key: SOLR-1857 URL: https://issues.apache.org/jira/browse/SOLR-1857 Project: Solr Issue Type: Task Components: Schema and Analysis Affects

[jira] Updated: (SOLR-1857) cleanup and sync analysis with lucene trunk

2010-03-31 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1857: -- Attachment: SOLR-1857.patch attached is a regrettably large patch to sync us up, and clean things up a

[jira] Commented: (SOLR-1857) cleanup and sync analysis with lucene trunk

2010-03-31 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852079#action_12852079 ] Robert Muir commented on SOLR-1857: --- if no one objects, I would like to commit in a day or

[jira] Assigned: (SOLR-1857) cleanup and sync analysis with lucene trunk

2010-03-31 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir reassigned SOLR-1857: - Assignee: Robert Muir cleanup and sync analysis with lucene trunk

[jira] Commented: (SOLR-1857) cleanup and sync analysis with lucene trunk

2010-03-31 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852213#action_12852213 ] Robert Muir commented on SOLR-1857: --- bq. I just did a 5 min review, not line-by-line, but

[jira] Assigned: (SOLR-1852) enablePositionIncrements=true can cause searches to fail when they are parsed as phrase queries

2010-03-31 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir reassigned SOLR-1852: - Assignee: Robert Muir enablePositionIncrements=true can cause searches to fail when they are

[jira] Commented: (SOLR-1852) enablePositionIncrements=true can cause searches to fail when they are parsed as phrase queries

2010-03-31 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852216#action_12852216 ] Robert Muir commented on SOLR-1852: --- I'm afraid of WDF, but I don't think I am the only

[jira] Commented: (SOLR-1852) enablePositionIncrements=true can cause searches to fail when they are parsed as phrase queries

2010-03-31 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12852234#action_12852234 ] Robert Muir commented on SOLR-1852: --- Peter it is... but admittedly it has not been in

[jira] Created: (SOLR-1859) speed up indexing for example schema

2010-03-31 Thread Robert Muir (JIRA)
speed up indexing for example schema Key: SOLR-1859 URL: https://issues.apache.org/jira/browse/SOLR-1859 Project: Solr Issue Type: Task Components: Schema and Analysis Reporter:

[jira] Updated: (SOLR-1859) speed up indexing for example schema

2010-03-31 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1859: -- Attachment: SOLR-1859.patch attached is a patch. I fixed every instance for general types like text in

[jira] Updated: (SOLR-1852) enablePositionIncrements=true can cause searches to fail when they are parsed as phrase queries

2010-03-28 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1852: -- Attachment: SOLR-1852_testcase.patch attached is a testcase demonstrating the bug. The problem is that

[jira] Resolved: (SOLR-1710) convert worddelimiterfilter to new tokenstream API

2010-03-27 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved SOLR-1710. --- Resolution: Fixed Fix Version/s: 3.1 Assignee: Mark Miller This was resolved in

[jira] Resolved: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2010-03-27 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved SOLR-1657. --- Resolution: Fixed Fix Version/s: 3.1 Assignee: Mark Miller This was resolved in

[jira] Resolved: (SOLR-1706) wrong tokens output from WordDelimiterFilter depending upon options

2010-03-27 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved SOLR-1706. --- Resolution: Fixed Fix Version/s: 3.1 Assignee: Mark Miller This was resolved in

[jira] Resolved: (SOLR-1820) Remove custom greek/russian charsets encoding

2010-03-27 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved SOLR-1820. --- Resolution: Fixed Fix Version/s: 3.1 Assignee: Robert Muir This was resolved in

[jira] Commented: (SOLR-1852) enablePositionIncrements=true can cause searches to fail when they are parsed as phrase queries

2010-03-27 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12850612#action_12850612 ] Robert Muir commented on SOLR-1852: --- bq. The changes in the patch originate at SOLR-1706

[jira] Commented: (SOLR-1852) enablePositionIncrements=true can cause searches to fail when they are parsed as phrase queries

2010-03-27 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1852?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12850613#action_12850613 ] Robert Muir commented on SOLR-1852: --- ok, so your bug relates somehow to how the

[jira] Commented: (SOLR-1835) speed up and improve tests

2010-03-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12848586#action_12848586 ] Robert Muir commented on SOLR-1835: --- committed revision 926470 to newtrunk. if you have

[jira] Updated: (SOLR-1835) speed up and improve tests

2010-03-22 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1835: -- Attachment: SOLR-1835_parallel.patch attached is a patch to parallelize the tests... improvements can

[jira] Updated: (SOLR-1835) speed up and improve tests

2010-03-22 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1835: -- Attachment: SOLR-1835_parallel.patch updated patch: * doesnt do parallel for the -Dtestcase= case, but

[jira] Updated: (SOLR-1835) speed up and improve tests

2010-03-22 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1835: -- Attachment: SOLR-1835_parallel.patch attached is a new patch: * the output from multiple threads is no

[jira] Updated: (SOLR-1835) speed up and improve tests

2010-03-22 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1835: -- Attachment: SOLR-1835_parallel.patch there was a stray slash in the previous version. this caused some

[jira] Commented: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-03-15 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12845301#action_12845301 ] Robert Muir commented on SOLR-1804: --- I wonder if you guys have any insight why the results

[jira] Commented: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-03-15 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12845451#action_12845451 ] Robert Muir commented on SOLR-1804: --- Hi Stanislaw: Correct, I did not upgrade anything

[jira] Commented: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-03-15 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12845455#action_12845455 ] Robert Muir commented on SOLR-1804: --- Grant I am concerned about a possible BW break in

[jira] Commented: (SOLR-1804) Upgrade Carrot2 to 3.2.0

2010-03-15 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1804?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12845474#action_12845474 ] Robert Muir commented on SOLR-1804: --- Thanks for the confirmation the clusters are ok.

[jira] Updated: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2010-03-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1657: -- Attachment: SOLR-1657_synonyms_ugly_slow.patch A very very ugly, very slow, but simple and conservative

[jira] Updated: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2010-03-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1657: -- Attachment: SOLR-1657_synonyms_ugly_slightly_less_slow.patch attached is a less slow version of the

[jira] Created: (SOLR-1820) Remove custom greek/russian charsets encoding

2010-03-14 Thread Robert Muir (JIRA)
Remove custom greek/russian charsets encoding - Key: SOLR-1820 URL: https://issues.apache.org/jira/browse/SOLR-1820 Project: Solr Issue Type: Task Components: Schema and Analysis

[jira] Updated: (SOLR-1820) Remove custom greek/russian charsets encoding

2010-03-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1820: -- Attachment: SOLR-1820.patch Attached is a patch that removes the deprecates bits. If you try to specify

[jira] Commented: (SOLR-1821) Failing testGetDateFormatEvaluator in TestEvaluatorBag

2010-03-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1821?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12845144#action_12845144 ] Robert Muir commented on SOLR-1821: --- Nice, fixes the issue. Can you commit this? It would

[jira] Assigned: (SOLR-1821) Failing testGetDateFormatEvaluator in TestEvaluatorBag

2010-03-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir reassigned SOLR-1821: - Assignee: Robert Muir Failing testGetDateFormatEvaluator in TestEvaluatorBag

[jira] Resolved: (SOLR-1821) Failing testGetDateFormatEvaluator in TestEvaluatorBag

2010-03-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1821?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir resolved SOLR-1821. --- Resolution: Fixed Fix Version/s: 1.5 Committed revision 922991. Thanks Chris! Failing

[jira] Updated: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2010-03-13 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1657: -- Attachment: SOLR-1657_part2.patch Here's a separate patch (_part2.patch) for all the remaining

[jira] Updated: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2010-03-13 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1657: -- Description: org.apache.solr.analysis: -BufferedTokenStream- - -CommonGramsFilter- -

[jira] Created: (SOLR-1813) Support Arabic PDF extraction

2010-03-08 Thread Robert Muir (JIRA)
Support Arabic PDF extraction - Key: SOLR-1813 URL: https://issues.apache.org/jira/browse/SOLR-1813 Project: Solr Issue Type: Improvement Components: contrib - Solr Cell (Tika extraction) Affects

[jira] Updated: (SOLR-1813) Support Arabic PDF extraction

2010-03-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1813: -- Attachment: SOLR-1813.patch attached is a patch with a testcase. i can shrink the icu4j jar file if

[jira] Updated: (SOLR-1813) Support Arabic PDF extraction

2010-03-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1813: -- Attachment: arabic.pdf the pdf file for contrib/extraction/src/test/resources/arabic.pdf Support

[jira] Updated: (SOLR-1813) Support Arabic PDF extraction

2010-03-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1813: -- Attachment: icu4j-4_2_1.jar the icu4j jar file that goes in contrib/extraction/lib Support Arabic PDF

[jira] Created: (SOLR-1760) convert synonymsfilter to new tokenstream API

2010-02-05 Thread Robert Muir (JIRA)
convert synonymsfilter to new tokenstream API - Key: SOLR-1760 URL: https://issues.apache.org/jira/browse/SOLR-1760 Project: Solr Issue Type: Task Components: Schema and Analysis

[jira] Updated: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2010-02-04 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1657: -- Attachment: SOLR-1657.patch Chris's patch, except also implement BufferedTokenStream. its marked

[jira] Updated: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2010-02-04 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1657: -- Description: org.apache.solr.analysis: -BufferedTokenStream- - -CommonGramsFilter- -

[jira] Commented: (SOLR-1670) synonymfilter/map repeat bug

2010-02-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12829092#action_12829092 ] Robert Muir commented on SOLR-1670: --- bq. Order of overlapping tokens is unimportant in

[jira] Commented: (SOLR-1670) synonymfilter/map repeat bug

2010-02-03 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12829097#action_12829097 ] Robert Muir commented on SOLR-1670: --- bq. Not at the semantic level (for overlapping

[jira] Commented: (SOLR-1670) synonymfilter/map repeat bug

2010-02-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12828204#action_12828204 ] Robert Muir commented on SOLR-1670: --- bq. I left in place the existing test method, which

[jira] Commented: (SOLR-1670) synonymfilter/map repeat bug

2010-01-31 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12806833#action_12806833 ] Robert Muir commented on SOLR-1670: --- Steven, i don't have a problem with your patch (I do

[jira] Commented: (SOLR-1677) Add support for o.a.lucene.util.Version for BaseTokenizerFactory and BaseTokenFilterFactory

2010-01-26 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12805187#action_12805187 ] Robert Muir commented on SOLR-1677: --- bq. 2) Perhaps you should read the StopFilter example

[jira] Commented: (SOLR-1677) Add support for o.a.lucene.util.Version for BaseTokenizerFactory and BaseTokenFilterFactory

2010-01-20 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12802979#action_12802979 ] Robert Muir commented on SOLR-1677: --- bq. The point I was trying to make is that the types

[jira] Commented: (SOLR-1677) Add support for o.a.lucene.util.Version for BaseTokenizerFactory and BaseTokenFilterFactory

2010-01-14 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12800518#action_12800518 ] Robert Muir commented on SOLR-1677: --- bq. The implication i got from Robert was that there

[jira] Commented: (SOLR-1677) Add support for o.a.lucene.util.Version for BaseTokenizerFactory and BaseTokenFilterFactory

2010-01-11 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12798921#action_12798921 ] Robert Muir commented on SOLR-1677: --- {quote} which is why i think it's a bad idea to have

[jira] Commented: (SOLR-1677) Add support for o.a.lucene.util.Version for BaseTokenizerFactory and BaseTokenFilterFactory

2010-01-11 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12798936#action_12798936 ] Robert Muir commented on SOLR-1677: --- bq. WTF?!?! ... now i feel like you are just messing

[jira] Created: (SOLR-1710) convert worddelimiterfilter to new tokenstream API

2010-01-08 Thread Robert Muir (JIRA)
convert worddelimiterfilter to new tokenstream API -- Key: SOLR-1710 URL: https://issues.apache.org/jira/browse/SOLR-1710 Project: Solr Issue Type: Improvement Components: Schema and

[jira] Updated: (SOLR-1710) convert worddelimiterfilter to new tokenstream API

2010-01-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1710: -- Attachment: SOLR-1710.patch convert worddelimiterfilter to new tokenstream API

[jira] Commented: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2010-01-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12798199#action_12798199 ] Robert Muir commented on SOLR-1657: --- Yonik, I agree, this is almost what the current patch

[jira] Updated: (SOLR-1710) convert worddelimiterfilter to new tokenstream API

2010-01-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1710: -- Attachment: SOLR-1710.patch for the 'wdf is only modifying single word with punctuation', don't

[jira] Commented: (SOLR-1710) convert worddelimiterfilter to new tokenstream API

2010-01-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12798239#action_12798239 ] Robert Muir commented on SOLR-1710: --- Yonik, thanks. Again i have a hesitation: the

[jira] Commented: (SOLR-1710) convert worddelimiterfilter to new tokenstream API

2010-01-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12798241#action_12798241 ] Robert Muir commented on SOLR-1710: --- Chris, not really, if you see the description i say:

[jira] Updated: (SOLR-1710) convert worddelimiterfilter to new tokenstream API

2010-01-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1710: -- Description: This one was a doozy, attached is a patch to convert it to the new tokenstream API. Some

[jira] Commented: (SOLR-1710) convert worddelimiterfilter to new tokenstream API

2010-01-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12798248#action_12798248 ] Robert Muir commented on SOLR-1710: --- Chris, no problem, I created this confusion until the

[jira] Commented: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2010-01-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12798255#action_12798255 ] Robert Muir commented on SOLR-1657: --- bq. Not sure... I guess it depends on the attribute

[jira] Updated: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2010-01-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1657: -- Description: org.apache.solr.analysis: BufferedTokenStream - -CommonGramsFilter- -

[jira] Commented: (SOLR-1710) convert worddelimiterfilter to new tokenstream API

2010-01-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12798261#action_12798261 ] Robert Muir commented on SOLR-1710: --- thanks in advance chris, I will help with testing and

[jira] Commented: (SOLR-1710) convert worddelimiterfilter to new tokenstream API

2010-01-08 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1710?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12798268#action_12798268 ] Robert Muir commented on SOLR-1710: --- chris yeah, its supposed to be similar to

[jira] Commented: (SOLR-1706) wrong tokens output from WordDelimiterFilter when english possessives are in the text

2010-01-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12797829#action_12797829 ] Robert Muir commented on SOLR-1706: --- its not just the concatenation, but also the subword

[jira] Updated: (SOLR-1706) wrong tokens output from WordDelimiterFilter depending upon options

2010-01-07 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1706: -- Description: below you can see that when I have requested to only output numeric concatenations (not

[jira] Commented: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2010-01-06 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12797161#action_12797161 ] Robert Muir commented on SOLR-1657: --- Hello, I am working on WordDelimiterFilter and I have

[jira] Created: (SOLR-1706) wrong tokens output from WordDelimiterFilter when english possessives are in the text

2010-01-06 Thread Robert Muir (JIRA)
wrong tokens output from WordDelimiterFilter when english possessives are in the text - Key: SOLR-1706 URL: https://issues.apache.org/jira/browse/SOLR-1706 Project:

[jira] Commented: (SOLR-1706) wrong tokens output from WordDelimiterFilter when english possessives are in the text

2010-01-06 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12797366#action_12797366 ] Robert Muir commented on SOLR-1706: --- by the way, i do not have a patch here. i am putting

[jira] Commented: (SOLR-1706) wrong tokens output from WordDelimiterFilter when english possessives are in the text

2010-01-06 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12797466#action_12797466 ] Robert Muir commented on SOLR-1706: --- ok i narrowed this one down some, appears to be

[jira] Updated: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2010-01-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1657: -- Description: org.apache.solr.analysis: BufferedTokenStream - -CommonGramsFilter- -

[jira] Commented: (SOLR-1677) Add support for o.a.lucene.util.Version for BaseTokenizerFactory and BaseTokenFilterFactory

2010-01-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12796862#action_12796862 ] Robert Muir commented on SOLR-1677: --- bq. Oh come on now ... that's not really a fair

[jira] Commented: (SOLR-1677) Add support for o.a.lucene.util.Version for BaseTokenizerFactory and BaseTokenFilterFactory

2010-01-05 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12796965#action_12796965 ] Robert Muir commented on SOLR-1677: --- {quote} No, he uses an OS where he can upgrade

[jira] Commented: (SOLR-1677) Add support for o.a.lucene.util.Version for BaseTokenizerFactory and BaseTokenFilterFactory

2010-01-04 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12796136#action_12796136 ] Robert Muir commented on SOLR-1677: --- {quote} User Carl helpfully replies... That was

[jira] Commented: (SOLR-1677) Add support for o.a.lucene.util.Version for BaseTokenizerFactory and BaseTokenFilterFactory

2010-01-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1677?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12795760#action_12795760 ] Robert Muir commented on SOLR-1677: --- bq. But as i said: i don't see any compelling need

[jira] Updated: (SOLR-1657) convert the rest of solr to use the new tokenstream API

2009-12-23 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-1657: -- Attachment: SOLR-1657.patch converts CommonGramsFilter, CommonGramsQueryFilter, and

  1   2   3   >