[jira] [Commented] (LUCENE-2956) Support updateDocument() with DWPTs

2011-04-13 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019234#comment-13019234 ] Simon Willnauer commented on LUCENE-2956: - bq. Though it worries me a little how

Re: Patch for http_proxy support in solr-ruby client

2011-04-13 Thread Duncan Robertson
Hi Otis, The fork you're talking is mine! But the repos I forked is not official, so I am trying to find out where the official version is so I can patch it. D On 13/04/2011 04:45, Otis Gospodnetic otis_gospodne...@yahoo.com wrote: Hi, Hm, maybe you are asking where solr-ruby actually

[jira] [Updated] (LUCENE-2956) Support updateDocument() with DWPTs

2011-04-13 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-2956: Attachment: LUCENE-2956.patch here is an updated patch fixing some spellings, adds atomic

[HUDSON] Lucene-Solr-tests-only-trunk - Build # 7061 - Failure

2011-04-13 Thread Apache Hudson Server
Build: https://hudson.apache.org/hudson/job/Lucene-Solr-tests-only-trunk/7061/ 14 tests failed. REGRESSION: org.apache.lucene.index.TestIndexReaderReopen.testThreadSafety Error Message: Error occurred in thread Thread-110:

TestIndexWriterDelete#testUpdatesOnDiskFull can false fail

2011-04-13 Thread Simon Willnauer
In TestIndexWriterDelete#testUpdatesOnDiskFull especially between line 538 and 553 we could get a random exception from the MockDirectoryWrapper which makes the test fail since we are not catching / expecting those exceptions. I can make this fail on trunk even in 1000 runs but on realtime it

[jira] [Commented] (LUCENE-2956) Support updateDocument() with DWPTs

2011-04-13 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019298#comment-13019298 ] Simon Willnauer commented on LUCENE-2956: - I committed that patch and merged with

Re: Numerical ids for terms?

2011-04-13 Thread Toke Eskildsen
On Tue, 2011-04-12 at 11:41 +0200, Gregor Heinrich wrote: Hi -- has there been any effort to create a numerical representation of Lucene indices. That is, to use the Lucene Directory backend as a large term-document matrix at index level. As this would require bijective mapping between

[jira] [Commented] (LUCENE-3018) Lucene Native Directory implementation need automated build

2011-04-13 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019326#comment-13019326 ] Simon Willnauer commented on LUCENE-3018: - varun, pastbin links are not ideal

Re: Patch for http_proxy support in solr-ruby client

2011-04-13 Thread Erik Hatcher
Duncan - I'm the original creator of solr-ruby and put it under Solr's svn. But many folks are now using RSolr, and even in our own (JRuby-based product) we use simply Net::HTTP and not a library like solr-ruby or RSolr. I don't personally have incentive to continue to maintain solr-ruby,

[jira] [Commented] (LUCENE-3018) Lucene Native Directory implementation need automated build

2011-04-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019329#comment-13019329 ] Uwe Schindler commented on LUCENE-3018: --- Hi, I suggest to use ANT Contrib for

[jira] [Commented] (LUCENE-3018) Lucene Native Directory implementation need automated build

2011-04-13 Thread Varun Thacker (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019356#comment-13019356 ] Varun Thacker commented on LUCENE-3018: --- I made the mistake of adding the

Re: [HUDSON] Lucene-trunk - Build # 1528 - Still Failing

2011-04-13 Thread Michael McCandless
GC overhead limit exceeded... Mike http://blog.mikemccandless.com On Tue, Apr 12, 2011 at 10:43 PM, Apache Hudson Server hud...@hudson.apache.org wrote: Build: https://hudson.apache.org/hudson/job/Lucene-trunk/1528/ 1 tests failed. REGRESSION:  

Re: TestIndexWriterDelete#testUpdatesOnDiskFull can false fail

2011-04-13 Thread Michael McCandless
+1 Mike http://blog.mikemccandless.com On Wed, Apr 13, 2011 at 5:58 AM, Simon Willnauer simon.willna...@googlemail.com wrote: In TestIndexWriterDelete#testUpdatesOnDiskFull especially between line 538 and 553 we could get a random exception from the MockDirectoryWrapper which makes the test

[jira] [Commented] (LUCENE-2956) Support updateDocument() with DWPTs

2011-04-13 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019370#comment-13019370 ] Jason Rutherglen commented on LUCENE-2956: -- Simon, nice work. I agree with

[HUDSON] Lucene-Solr-tests-only-3.x - Build # 7062 - Failure

2011-04-13 Thread Apache Hudson Server
Build: https://hudson.apache.org/hudson/job/Lucene-Solr-tests-only-3.x/7062/ 1 tests failed. REGRESSION: org.apache.lucene.collation.TestCollationKeyAnalyzer.testThreadSafe Error Message: Java heap space Stack Trace: java.lang.OutOfMemoryError: Java heap space at

need help in constructing a query

2011-04-13 Thread Ramamurthy, Premila
Need help in constructing a solr query, I need the values for a field. I want values which does not have embedded space The value of the indexed field should not have embedded space. Please help. Thanks Premila

[jira] [Commented] (SOLR-2436) move uimaConfig to under the uima's update processor in solrconfig.xml

2011-04-13 Thread Tommaso Teofili (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019377#comment-13019377 ] Tommaso Teofili commented on SOLR-2436: --- Hello Koji, your patch seems fine to me from

[jira] [Updated] (SOLR-2436) move uimaConfig to under the uima's update processor in solrconfig.xml

2011-04-13 Thread Tommaso Teofili (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tommaso Teofili updated SOLR-2436: -- Attachment: SOLR-2436-3.patch move uimaConfig to under the uima's update processor in

Re: Patch for http_proxy support in solr-ruby client

2011-04-13 Thread Duncan Robertson
Thanks Erik, I hadn't seen RSolr and it looks like it fixes all the problems was having. Maybe rather than keeping many solutions, I'll just take a look at this one. Duncan On 13/04/2011 14:51, Erik Hatcher erik.hatc...@gmail.com wrote: Duncan - I'm the original creator of solr-ruby and

GSoC: LUCENE-2308: Separately specify a field's type

2011-04-13 Thread Nikola Tanković
Hi all, if everything goes well I'll be delighted to be part of this project this summer together with my assigned mentor Mike. My task will be to introduce new classes to Lucene core which will enable to separate Fields' Lucene properties from it's value (

[jira] [Created] (LUCENE-3026) smartcn analysis throw NullPointer exception when the length of analysed text over 32767

2011-04-13 Thread wangzhenghang (JIRA)
smartcn analysis throw NullPointer exception when the length of analysed text over 32767 Key: LUCENE-3026 URL: https://issues.apache.org/jira/browse/LUCENE-3026

[jira] [Updated] (LUCENE-3026) smartcn analysis throw NullPointer exception when the length of analysed text over 32767

2011-04-13 Thread wangzhenghang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangzhenghang updated LUCENE-3026: -- Description: That's all because of org.apache.lucene.analysis.cn.smart.hhmm.SegGraph's

[jira] [Commented] (LUCENE-2312) Search on IndexWriter's RAM Buffer

2011-04-13 Thread Jason Rutherglen (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019391#comment-13019391 ] Jason Rutherglen commented on LUCENE-2312: -- In the current patch, I'm copying

Re: Numerical ids for terms?

2011-04-13 Thread Gregor Heinrich
Thanks Toke and Kirill -- I guess that's the way to go (at least until v4.0). Best regards gregor On 4/13/11 3:42 PM, Toke Eskildsen wrote: On Tue, 2011-04-12 at 11:41 +0200, Gregor Heinrich wrote: Hi -- has there been any effort to create a numerical representation of Lucene indices. That

[jira] [Updated] (SOLR-64) strict hierarchical facets

2011-04-13 Thread Relephant (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-64?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Relephant updated SOLR-64: -- Attachment: SOLR-64_3.1.0.diff Hi all, we have just tried to apply solr-64 to 3.1. Attached SOLR-64_3.1.0.diff.

[jira] [Updated] (SOLR-64) strict hierarchical facets

2011-04-13 Thread Relephant (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-64?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Relephant updated SOLR-64: -- Attachment: (was: SOLR-64_3.1.0.diff) strict hierarchical facets --

[jira] [Updated] (SOLR-64) strict hierarchical facets

2011-04-13 Thread Relephant (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-64?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Relephant updated SOLR-64: -- Attachment: SOLR-64_3.1.0.patch strict hierarchical facets -- Key:

[jira] [Issue Comment Edited] (SOLR-64) strict hierarchical facets

2011-04-13 Thread Relephant (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-64?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019399#comment-13019399 ] Relephant edited comment on SOLR-64 at 4/13/11 4:04 PM: Hi all, we

[jira] [Commented] (LUCENE-2939) Highlighter should try and use maxDocCharsToAnalyze in WeightedSpanTermExtractor when adding a new field to MemoryIndex as well as when using CachingTokenStream

2011-04-13 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-2939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019421#comment-13019421 ] Mark Miller commented on LUCENE-2939: - Okay - I'm going to commit to trunk shortly.

PayloadProcessorProvider Usage

2011-04-13 Thread Shai Erera
Hey, In Lucene 3.1 we've introduced PayloadProcessorProvider which allows you to rewrite payloads of terms during merge. The main scenario is when you merge indexes, and you want to rewrite/remap payloads of the incoming indexes, but one can certainly use it to rewrite the payloads of a term, in

[jira] [Commented] (SOLR-2436) move uimaConfig to under the uima's update processor in solrconfig.xml

2011-04-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019465#comment-13019465 ] Uwe Schindler commented on SOLR-2436: - I just looked at the patch, is the

[jira] [Commented] (SOLR-2436) move uimaConfig to under the uima's update processor in solrconfig.xml

2011-04-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019470#comment-13019470 ] Uwe Schindler commented on SOLR-2436: - Here is the new way to load XML from

[jira] [Issue Comment Edited] (SOLR-2436) move uimaConfig to under the uima's update processor in solrconfig.xml

2011-04-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019470#comment-13019470 ] Uwe Schindler edited comment on SOLR-2436 at 4/13/11 6:14 PM: --

[jira] [Commented] (SOLR-2436) move uimaConfig to under the uima's update processor in solrconfig.xml

2011-04-13 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019473#comment-13019473 ] Uwe Schindler commented on SOLR-2436: - Maybe we should add my last comment into the

[jira] [Commented] (SOLR-2436) move uimaConfig to under the uima's update processor in solrconfig.xml

2011-04-13 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019474#comment-13019474 ] Mark Miller commented on SOLR-2436: --- bq. Maybe we should add my last comment into the

[jira] [Commented] (SOLR-2436) move uimaConfig to under the uima's update processor in solrconfig.xml

2011-04-13 Thread Mark Miller (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019476#comment-13019476 ] Mark Miller commented on SOLR-2436: --- Or perhaps we need a utility method and pointer to

Re: An IDF variation with penalty for very rare terms

2011-04-13 Thread Marvin Humphrey
On Wed, Apr 13, 2011 at 01:01:09AM +0400, Earwin Burrfoot wrote: Excuse me for somewhat of an offtopic, but have anybody ever seen/used -subj- ? Something that looks like like http://dl.dropbox.com/u/920413/IDFplusplus.png Traditional log(N/x) tail, but when nearing zero freq, instead of

[HUDSON] Lucene-Solr-tests-only-3.x - Build # 7075 - Failure

2011-04-13 Thread Apache Hudson Server
Build: https://hudson.apache.org/hudson/job/Lucene-Solr-tests-only-3.x/7075/ 1 tests failed. REGRESSION: org.apache.lucene.collation.TestCollationKeyAnalyzer.testThreadSafe Error Message: Java heap space Stack Trace: java.lang.OutOfMemoryError: Java heap space at

[jira] [Updated] (LUCENE-3026) smartcn analyzer throw NullPointer exception when the length of analysed text over 32767

2011-04-13 Thread wangzhenghang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangzhenghang updated LUCENE-3026: -- Summary: smartcn analyzer throw NullPointer exception when the length of analysed text over

[jira] [Commented] (LUCENE-3026) smartcn analyzer throw NullPointer exception when the length of analysed text over 32767

2011-04-13 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019636#comment-13019636 ] Robert Muir commented on LUCENE-3026: - This sounds like a bug, do you want to try

[jira] [Commented] (LUCENE-3022) DictionaryCompoundWordTokenFilter Flag onlyLongestMatch has no affect

2011-04-13 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019637#comment-13019637 ] Robert Muir commented on LUCENE-3022: - This sounds like a bug, do you want to try

[jira] [Commented] (SOLR-2436) move uimaConfig to under the uima's update processor in solrconfig.xml

2011-04-13 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019649#comment-13019649 ] Koji Sekiguchi commented on SOLR-2436: -- Hi Uwe, The problematic snippet regarding

[jira] [Commented] (SOLR-2436) move uimaConfig to under the uima's update processor in solrconfig.xml

2011-04-13 Thread Koji Sekiguchi (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-2436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019652#comment-13019652 ] Koji Sekiguchi commented on SOLR-2436: -- The patch looks good, Tommaso! If it is going

[jira] [Created] (SOLR-2467) Custom analyzer load exceptions are not logged.

2011-04-13 Thread Alexander Kistanov (JIRA)
Custom analyzer load exceptions are not logged. --- Key: SOLR-2467 URL: https://issues.apache.org/jira/browse/SOLR-2467 Project: Solr Issue Type: Bug Affects Versions: 3.1

[HUDSON] Lucene-Solr-tests-only-3.x - Build # 7082 - Failure

2011-04-13 Thread Apache Hudson Server
Build: https://hudson.apache.org/hudson/job/Lucene-Solr-tests-only-3.x/7082/ No tests ran. Build Log (for compile errors): [...truncated 118 lines...] - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional

[jira] [Commented] (LUCENE-3026) smartcn analyzer throw NullPointer exception when the length of analysed text over 32767

2011-04-13 Thread wangzhenghang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13019671#comment-13019671 ] wangzhenghang commented on LUCENE-3026: --- It's done smartcn analyzer throw

[jira] [Updated] (LUCENE-3026) smartcn analyzer throw NullPointer exception when the length of analysed text over 32767

2011-04-13 Thread wangzhenghang (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-3026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] wangzhenghang updated LUCENE-3026: -- Attachment: LUCENE-3026.patch smartcn analyzer throw NullPointer exception when the length