date:20130510


[ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653587#comment-13653587
 ] 

Commit Tag Bot commented on SOLR-4760:
--

[lucene_solr_4_3 commit] elyograg
http://svn.apache.org/viewvc?view=revisionrevision=1480893

SOLR-4760: Include core name in logs when loading schema. (merge trunk r1480882)

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Make tests.failfast default to true

I thought to make the change in LuceneTestCase.

Shai


On Fri, May 10, 2013 at 8:56 AM, Dawid Weiss
dawid.we...@cs.put.poznan.plwrote:

 I don't care much. If you do though, you can specify your own override
 in any of these locations?

   property file=${user.home}/lucene.build.properties/
   property file=${user.home}/build.properties/
   property file=${basedir}/build.properties/
   property file=${common.dir}/build.properties/

 Or is this something you'd like to promote to the top level?

 Dawid

 On Fri, May 10, 2013 at 7:20 AM, Shai Erera ser...@gmail.com wrote:
  Hi
 
  Can we default tests.failfast to true? I don't know how common it is to
 run
  with -Dtests.iters=N, not specify -Dtests.maxfailures AND wanting to see
 all
  N tests finish.
 
  At any rate, if someone wishes to, he can always specify
  -Dtests.failfast=false. It now depends what is the default behavior we
 want
  to have.
 
  Any thoughts?
 
  Shai

 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Resolved] (SOLR-4760) Improve logging messages during startup to better identify core


 [ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shawn Heisey resolved SOLR-4760.


Resolution: Fixed

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Compressed field reader and memory problems with atomic updates

2013-05-10 Thread Savia Beson

Erick, 
I would try to limit the number of threads to aleviate the problem, honestly, 
gong much above #CPU cores rarely makes things better. Just let excessive 
update commands wait in some queue.   We impose hard limits intercepting update 
chain and configure container thread pool to have smaller number of long liven 
threads ( longer TTL times to reduce churn).  Is there a simple way to do it on 
jetty?



On May 9, 2013, at 10:28 PM, Erick Erickson erickerick...@gmail.com wrote:

 Adrien:
 
 Yeah, this is getting warmer. Some of the
 CompressingStoredFieldsReader objects are 240M. The documents aren't
 nearly that large I don't think (but I'll verify).
 
 But still, over 700 of these objects live at once? I _think_ I'm
 seeing the number go up significantly when the number of indexing
 threads increases, but that's a bit of indirect evidence. My other
 question would be whether you'd expect the number of these objects to
 go up as the number of segments goes up, i.e. I assume they're
 per-segment
 
 So the pattern here is atomic updates on documents where some of the
 fields get quite large. So the underlying reader has to decompress
 these a lot. Do you have any suggestions how to mitigate this? Other
 than don't do thatG
 
 Thanks,
 Erick
 
 On Thu, May 9, 2013 at 6:45 AM, Adrien Grand jpou...@gmail.com wrote:
 Hi Erick,
 
 The stored fields reader reuses the buffer used for decompression across
 calls in the same thread, so I'm thinking that this kind of behaviour could
 happen if some documents are very large. Is it the case?
 
 Adrien
 
 Le 8 mai 2013 17:22, Erick Erickson erickerick...@gmail.com a écrit :
 
 I'm seeing a case where (reported to me, not on my personal machine)
 where Solr's heap is being exhausted apparently by compressed field
 reader. Here's a sample line from a summary of a heap dump:
 
 CompressingStoredFieldReader 735 instances
 
 where the CompressingStoredFieldReader instances are taking up 7.5G of
 heap space (out of 8 allocated). The GC cycles are a pattern I've seen
 before
 gc for a long time, recover a few meg
 run for a _very_ short time
 gc again, recover a few meg
 
 This app happens to be doing a lot of atomic updates, running with CMS
 collector, java 1.7. There is very little querying going on, and it is
 happening on several different machines. Solr 4.2.
 
 Now, there is a bit of custom update code involved, we're testing whether
 1 if not compressing the stored fields changes things
 2 if disabling the custom code changes things
 3 if Solr 4.3 changes things.
 4 whether memory usage grows over time linearly or spikes (pretty
 sure the former).
 
 This is an index-heavy/query-light application, and we're hosing a
 _bunch_ of documents at the indexing process. But still, assuming that
 the memory reporting above is accurate, each
 CompressingStoredFieldReader is apparently taking up 10M each and
 there are over 700 of them. The documents are being batched (don't
 quite know how many per batch, will find out).
 
 Mainly, I'm asking if this rings any bells. I don't see anything on a
 quick look at the JIRAs.
 
 Thanks,
 Erick
 
 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org
 
 
 
 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org
 


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4795) Sub shard leader should not accept any updates from parent after it goes active


[ 
https://issues.apache.org/jira/browse/SOLR-4795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653593#comment-13653593
 ] 

Commit Tag Bot commented on SOLR-4795:
--

[lucene_solr_4_3 commit] shalin
http://svn.apache.org/viewvc?view=revisionrevision=1480895

SOLR-4795: Sub shard leader should not accept any updates from parent after it 
goes active

 Sub shard leader should not accept any updates from parent after it goes 
 active
 ---

 Key: SOLR-4795
 URL: https://issues.apache.org/jira/browse/SOLR-4795
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud
Affects Versions: 4.3
Reporter: Shalin Shekhar Mangar
Assignee: Shalin Shekhar Mangar
Priority: Minor
 Fix For: 4.4

 Attachments: SOLR-4795.patch


 The sub shard leaders accept any update from a parent as long as their range 
 is a subset of parent's range. A sub shard can continue to accept updates 
 from parent even after it has gone active which can lead to inconsistencies. 
 This affects partial updates (think counters) the most but can also lead to 
 out-of-order updates to the same document.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Created] (SOLR-4807) zkcli - log4j is not configured, so no logging happens

Shawn Heisey created SOLR-4807:
--

 Summary: zkcli - log4j is not configured, so no logging happens
 Key: SOLR-4807
 URL: https://issues.apache.org/jira/browse/SOLR-4807
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud
Affects Versions: 4.3
Reporter: Shawn Heisey
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1


When executing the zkcli script from the 4.3.0 release, a log4j error message 
is displayed about no appenders, because there is no log4j configuration.  As a 
result of this problem, none of the normal zkcli stuff gets logged.

A minimal config needs to be present so that it behaves like it did before the 
logging change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Assigned] (SOLR-4807) zkcli - log4j is not configured, so no logging happens


 [ 
https://issues.apache.org/jira/browse/SOLR-4807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shawn Heisey reassigned SOLR-4807:
--

Assignee: Shawn Heisey

 zkcli - log4j is not configured, so no logging happens
 --

 Key: SOLR-4807
 URL: https://issues.apache.org/jira/browse/SOLR-4807
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud
Affects Versions: 4.3
Reporter: Shawn Heisey
Assignee: Shawn Heisey
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1


 When executing the zkcli script from the 4.3.0 release, a log4j error message 
 is displayed about no appenders, because there is no log4j configuration.  As 
 a result of this problem, none of the normal zkcli stuff gets logged.
 A minimal config needs to be present so that it behaves like it did before 
 the logging change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4807) zkcli - log4j is not configured, so no logging happens


[ 
https://issues.apache.org/jira/browse/SOLR-4807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653595#comment-13653595
 ] 

Shawn Heisey commented on SOLR-4807:


Fix Version/s includes 4.3.1, as this will probably be a small and safe fix.


 zkcli - log4j is not configured, so no logging happens
 --

 Key: SOLR-4807
 URL: https://issues.apache.org/jira/browse/SOLR-4807
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud
Affects Versions: 4.3
Reporter: Shawn Heisey
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1


 When executing the zkcli script from the 4.3.0 release, a log4j error message 
 is displayed about no appenders, because there is no log4j configuration.  As 
 a result of this problem, none of the normal zkcli stuff gets logged.
 A minimal config needs to be present so that it behaves like it did before 
 the logging change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4807) zkcli - log4j is not configured, so no logging happens


[ 
https://issues.apache.org/jira/browse/SOLR-4807?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653599#comment-13653599
 ] 

Shawn Heisey commented on SOLR-4807:


I've assigned the issue to myself, but anyone who wants it is free to take it.


 zkcli - log4j is not configured, so no logging happens
 --

 Key: SOLR-4807
 URL: https://issues.apache.org/jira/browse/SOLR-4807
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud
Affects Versions: 4.3
Reporter: Shawn Heisey
Assignee: Shawn Heisey
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1


 When executing the zkcli script from the 4.3.0 release, a log4j error message 
 is displayed about no appenders, because there is no log4j configuration.  As 
 a result of this problem, none of the normal zkcli stuff gets logged.
 A minimal config needs to be present so that it behaves like it did before 
 the logging change.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4798) shard splitting does not respect router (docs with a compositeId may not go to correct shard)


[ 
https://issues.apache.org/jira/browse/SOLR-4798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653598#comment-13653598
 ] 

Commit Tag Bot commented on SOLR-4798:
--

[lucene_solr_4_3 commit] shalin
http://svn.apache.org/viewvc?view=revisionrevision=1480897

SOLR-4798: use correct router during index splitting

 shard splitting does not respect router (docs with a compositeId may not go 
 to correct shard)
 -

 Key: SOLR-4798
 URL: https://issues.apache.org/jira/browse/SOLR-4798
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud
Affects Versions: 4.3
Reporter: Yonik Seeley
Assignee: Yonik Seeley
 Fix For: 4.3.1

 Attachments: SOLR-4798.patch


 Lowest level shard splitting code needs to somehow use the router for the 
 collection.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4797) Sub shards have wrong hash range in cluster state except when using PlainIdRouter


[ 
https://issues.apache.org/jira/browse/SOLR-4797?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653604#comment-13653604
 ] 

Commit Tag Bot commented on SOLR-4797:
--

[lucene_solr_4_3 commit] shalin
http://svn.apache.org/viewvc?view=revisionrevision=1480898

SOLR-4797: Shard splitting creates sub shards which have the wrong hash range 
in cluster state. This happens when numShards is not a power of two and router 
is compositeId

 Sub shards have wrong hash range in cluster state except when using 
 PlainIdRouter
 -

 Key: SOLR-4797
 URL: https://issues.apache.org/jira/browse/SOLR-4797
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud
Affects Versions: 4.3
Reporter: Shalin Shekhar Mangar
Assignee: Shalin Shekhar Mangar
 Fix For: 4.3.1

 Attachments: SOLR-4797.patch


 The overseer collection processor always uses PlainIdRouter to partition the 
 hash range. However, some router implementations (most notably 
 CompositeIdRouter) can provide a different implementation to partition hash 
 ranges. So there can be a mismatch between the hash ranges which are 
 persisted in clusterstate.xml for sub shards and the actual index which is 
 split using the collection's configured router impl.
 This bug does not affect collections using PlainIdRouter.
 The overseer collection processor should always use the collection's 
 configured router to partition ranges.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4791) solr.xml sharedLib does not work in 4.3.0

2013-05-10 Thread Shalin Shekhar Mangar (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-4791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653608#comment-13653608
 ] 

Shalin Shekhar Mangar commented on SOLR-4791:
-

Is there anything left here? Can I backport the commits to 4.3.1?

 solr.xml sharedLib does not work in 4.3.0
 -

 Key: SOLR-4791
 URL: https://issues.apache.org/jira/browse/SOLR-4791
 Project: Solr
  Issue Type: Bug
  Components: multicore
Affects Versions: 4.3
Reporter: Jan Høydahl
Assignee: Erick Erickson
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: closeLoader.patch, SOLR-4791.patch, SOLR-4791.patch, 
 SOLR-4791.patch, SOLR-4791-test.patch


 The sharedLib attribute on {{solr}} tag in solr.xml stopped working in 4.3.
 Using old-style solr.xml with sharedLib defined on solr tag. Solr does not 
 load any of them. Simply swapping out solr.war with the 4.2.1 one, brings 
 sharedLib loading back.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Created] (LUCENE-4995) Remove the strong reference of CompressingStoredFieldsReader on the decompression buffer

2013-05-10 Thread Adrien Grand (JIRA)

Adrien Grand created LUCENE-4995:


 Summary: Remove the strong reference of 
CompressingStoredFieldsReader on the decompression buffer
 Key: LUCENE-4995
 URL: https://issues.apache.org/jira/browse/LUCENE-4995
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Adrien Grand
Assignee: Adrien Grand


CompressingStoredFieldsReader has a strong reference on the buffer it uses for 
decompression. Although it makes the reader able to reuse this buffer, this can 
trigger high memory usage in case some documents are very large. Creating this 
buffer on demand would help give memory back to the JVM.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (LUCENE-4995) Remove the strong reference of CompressingStoredFieldsReader on the decompression buffer

2013-05-10 Thread Adrien Grand (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-4995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adrien Grand updated LUCENE-4995:
-

Attachment: LUCENE-4995.patch

Patch. CompressingTermVectorsReader doesn't need the fix since it already 
creates the decompression buffer on demand.

 Remove the strong reference of CompressingStoredFieldsReader on the 
 decompression buffer
 

 Key: LUCENE-4995
 URL: https://issues.apache.org/jira/browse/LUCENE-4995
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Adrien Grand
Assignee: Adrien Grand
 Attachments: LUCENE-4995.patch


 CompressingStoredFieldsReader has a strong reference on the buffer it uses 
 for decompression. Although it makes the reader able to reuse this buffer, 
 this can trigger high memory usage in case some documents are very large. 
 Creating this buffer on demand would help give memory back to the JVM.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Compressed field reader and memory problems with atomic updates

2013-05-10 Thread Adrien Grand

Erick,

On Thu, May 9, 2013 at 10:28 PM, Erick Erickson erickerick...@gmail.com wrote:
 Yeah, this is getting warmer. Some of the
 CompressingStoredFieldsReader objects are 240M. The documents aren't
 nearly that large I don't think (but I'll verify).

Wow, 240M is a lot! Would you be able to know which member of these
instances is making them so large? Since you were mentioning
threading, I first thought that the culprit was the buffer used for
decompression but it could be the fields index as well (which is
stored in memory but should be shared across instances for the same
segment, so threading shouldn't make the situation worse).

 But still, over 700 of these objects live at once? I _think_ I'm
 seeing the number go up significantly when the number of indexing
 threads increases, but that's a bit of indirect evidence. My other
 question would be whether you'd expect the number of these objects to
 go up as the number of segments goes up, i.e. I assume they're
 per-segment

Indeed, these objects are managed per-segment, so increasing the
number of segments increases the number of stored fields reader
instances.

 So the pattern here is atomic updates on documents where some of the
 fields get quite large. So the underlying reader has to decompress
 these a lot. Do you have any suggestions how to mitigate this? Other
 than don't do thatG

As Savia suggests, lowering the number of indexing threads should
help. If the merge factor is highish, lowering it could help as well.
On my end, I'll fix the stored fields reader to not reuse the buffer
for decompression (https://issues.apache.org/jira/browse/LUCENE-4995,
even if we find out that it is not the problem here, it shouldn't
hurt).

-- 
Adrien

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

RE: Sorry for JIRA mails

2013-05-10 Thread Uwe Schindler

In the past we always pushed that version forward, which is what I did. Steven 
Rowe did this the last time in January (4.1-4.2), also spamming the mailing 
list - so I am not the first one.

JIRA 5.2 is just missing the suppress email, so you have to do the change in 
2 steps:
- Do a bulk change with suppressed emails
- set version to released

I updated the wiki with the warning about the risk of spamming the mailing list.

Uwe

-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de


 -Original Message-
 From: dawid.we...@gmail.com [mailto:dawid.we...@gmail.com] On Behalf
 Of Dawid Weiss
 Sent: Friday, May 10, 2013 8:06 AM
 To: dev@lucene.apache.org
 Subject: Re: Sorry for JIRA mails
 
  I found out that Lucene/Solr 4.3 was not marked as released version in
 JIRA.
 
 I e-mailed about it 2 days ago but wasn't sure what to do with it.
 
 http://goo.gl/xXhgF
 
 D.
 
 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional
 commands, e-mail: dev-h...@lucene.apache.org


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4993) BeiderMorseFilter inserts tokens with positionIncrement=0, but ignores all custom attributes except OffsetAttribute


[ 
https://issues.apache.org/jira/browse/LUCENE-4993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653627#comment-13653627
 ] 

Commit Tag Bot commented on LUCENE-4993:


[trunk commit] uschindler
http://svn.apache.org/viewvc?view=revisionrevision=1480911

LUCENE-4993: Fix BeiderMorseFilter to preserve custom attributes when inserting 
tokens with position increment 0.

 BeiderMorseFilter inserts tokens with positionIncrement=0, but ignores all 
 custom attributes except OffsetAttribute
 ---

 Key: LUCENE-4993
 URL: https://issues.apache.org/jira/browse/LUCENE-4993
 Project: Lucene - Core
  Issue Type: Bug
  Components: modules/analysis
Affects Versions: 4.3
Reporter: Uwe Schindler
Assignee: Uwe Schindler
Priority: Trivial
 Fix For: 4.4

 Attachments: LUCENE-4993.patch


 BeiderMorseFilter inserts sometimes additional phonetic tokens for the same 
 source token. Currently it calls clearAttributes before doing this and sets 
 the new token's term, positionIncrement=0 and the original offset.
 This leads to problems if the TokenStream contains other attributes inserted 
 before (like KeywordAttribute, FlagsAttribute,...). Those are all reverted to 
 defaults for the inserted tokens.
 The TokenFilter should remove the special case done for preserving offsets 
 and instead to captureState() and restoreState().

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4993) BeiderMorseFilter inserts tokens with positionIncrement=0, but ignores all custom attributes except OffsetAttribute


[ 
https://issues.apache.org/jira/browse/LUCENE-4993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653628#comment-13653628
 ] 

Commit Tag Bot commented on LUCENE-4993:


[branch_4x commit] uschindler
http://svn.apache.org/viewvc?view=revisionrevision=1480912

Merged revision(s) 1480911 from lucene/dev/trunk:
LUCENE-4993: Fix BeiderMorseFilter to preserve custom attributes when inserting 
tokens with position increment 0.

 BeiderMorseFilter inserts tokens with positionIncrement=0, but ignores all 
 custom attributes except OffsetAttribute
 ---

 Key: LUCENE-4993
 URL: https://issues.apache.org/jira/browse/LUCENE-4993
 Project: Lucene - Core
  Issue Type: Bug
  Components: modules/analysis
Affects Versions: 4.3
Reporter: Uwe Schindler
Assignee: Uwe Schindler
Priority: Trivial
 Fix For: 4.4

 Attachments: LUCENE-4993.patch


 BeiderMorseFilter inserts sometimes additional phonetic tokens for the same 
 source token. Currently it calls clearAttributes before doing this and sets 
 the new token's term, positionIncrement=0 and the original offset.
 This leads to problems if the TokenStream contains other attributes inserted 
 before (like KeywordAttribute, FlagsAttribute,...). Those are all reverted to 
 defaults for the inserted tokens.
 The TokenFilter should remove the special case done for preserving offsets 
 and instead to captureState() and restoreState().

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Resolved] (LUCENE-4993) BeiderMorseFilter inserts tokens with positionIncrement=0, but ignores all custom attributes except OffsetAttribute

2013-05-10 Thread Uwe Schindler (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-4993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe Schindler resolved LUCENE-4993.
---

   Resolution: Fixed
Fix Version/s: 5.0

 BeiderMorseFilter inserts tokens with positionIncrement=0, but ignores all 
 custom attributes except OffsetAttribute
 ---

 Key: LUCENE-4993
 URL: https://issues.apache.org/jira/browse/LUCENE-4993
 Project: Lucene - Core
  Issue Type: Bug
  Components: modules/analysis
Affects Versions: 4.3
Reporter: Uwe Schindler
Assignee: Uwe Schindler
Priority: Trivial
 Fix For: 5.0, 4.4

 Attachments: LUCENE-4993.patch


 BeiderMorseFilter inserts sometimes additional phonetic tokens for the same 
 source token. Currently it calls clearAttributes before doing this and sets 
 the new token's term, positionIncrement=0 and the original offset.
 This leads to problems if the TokenStream contains other attributes inserted 
 before (like KeywordAttribute, FlagsAttribute,...). Those are all reverted to 
 defaults for the inserted tokens.
 The TokenFilter should remove the special case done for preserving offsets 
 and instead to captureState() and restoreState().

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (LUCENE-4994) PatternKeywordMarkerFilter is final and has protected ctor and cannot be instantiated by non-Lucene code

2013-05-10 Thread Uwe Schindler (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-4994?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Uwe Schindler updated LUCENE-4994:
--

Fix Version/s: 5.0

 PatternKeywordMarkerFilter is final and has protected ctor and cannot be 
 instantiated by non-Lucene code
 

 Key: LUCENE-4994
 URL: https://issues.apache.org/jira/browse/LUCENE-4994
 Project: Lucene - Core
  Issue Type: Bug
  Components: modules/analysis
Affects Versions: 4.3
Reporter: Uwe Schindler
Assignee: Uwe Schindler
 Fix For: 5.0, 4.4


 I tried to write a test for LUCENE-4993 but recognized that a copy'n'paste 
 error made the ctor of this filter protected.
 The sister SetKeywordMarkerFilter has public ctor.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.8.0-ea-b86) - Build # 5582 - Still Failing!

2013-05-10 Thread Policeman Jenkins Server

Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/5582/
Java: 64bit/jdk1.8.0-ea-b86 -XX:-UseCompressedOops -XX:+UseG1GC

1 tests failed.
FAILED:  
org.apache.lucene.spatial.prefix.SpatialOpRecursivePrefixTreeTest.testContains 
{#6 seed=[2301AA95F732FAE3:A9AE9291D68A1C44]}

Error Message:
Shouldn't match I #4:ShapePair(Rect(minX=15.0,maxX=28.0,minY=-107.0,maxY=97.0) 
, Rect(minX=146.0,maxX=165.0,minY=-100.0,maxY=56.0)) 
Q:Rect(minX=0.0,maxX=256.0,minY=-128.0,maxY=128.0)

Stack Trace:
java.lang.AssertionError: Shouldn't match I 
#4:ShapePair(Rect(minX=15.0,maxX=28.0,minY=-107.0,maxY=97.0) , 
Rect(minX=146.0,maxX=165.0,minY=-100.0,maxY=56.0)) 
Q:Rect(minX=0.0,maxX=256.0,minY=-128.0,maxY=128.0)
at 
__randomizedtesting.SeedInfo.seed([2301AA95F732FAE3:A9AE9291D68A1C44]:0)
at org.junit.Assert.fail(Assert.java:93)
at 
org.apache.lucene.spatial.prefix.SpatialOpRecursivePrefixTreeTest.fail(SpatialOpRecursivePrefixTreeTest.java:287)
at 
org.apache.lucene.spatial.prefix.SpatialOpRecursivePrefixTreeTest.doTest(SpatialOpRecursivePrefixTreeTest.java:273)
at 
org.apache.lucene.spatial.prefix.SpatialOpRecursivePrefixTreeTest.testContains(SpatialOpRecursivePrefixTreeTest.java:101)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:490)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner.invoke(RandomizedRunner.java:1559)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner.access$600(RandomizedRunner.java:79)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$6.evaluate(RandomizedRunner.java:737)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$7.evaluate(RandomizedRunner.java:773)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$8.evaluate(RandomizedRunner.java:787)
at 
org.apache.lucene.util.TestRuleSetupTeardownChained$1.evaluate(TestRuleSetupTeardownChained.java:50)
at 
org.apache.lucene.util.TestRuleFieldCacheSanity$1.evaluate(TestRuleFieldCacheSanity.java:51)
at 
org.apache.lucene.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:46)
at 
com.carrotsearch.randomizedtesting.rules.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:55)
at 
org.apache.lucene.util.TestRuleThreadAndTestName$1.evaluate(TestRuleThreadAndTestName.java:49)
at 
org.apache.lucene.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:70)
at 
org.apache.lucene.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:48)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:358)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl.forkTimeoutingTask(ThreadLeakControl.java:782)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl$3.evaluate(ThreadLeakControl.java:442)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner.runSingleTest(RandomizedRunner.java:746)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$3.evaluate(RandomizedRunner.java:648)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$4.evaluate(RandomizedRunner.java:682)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$5.evaluate(RandomizedRunner.java:693)
at 
org.apache.lucene.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:46)
at 
org.apache.lucene.util.TestRuleStoreClassName$1.evaluate(TestRuleStoreClassName.java:42)
at 
com.carrotsearch.randomizedtesting.rules.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:55)
at 
com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:39)
at 
com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:39)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
org.apache.lucene.util.TestRuleAssertionsRequired$1.evaluate(TestRuleAssertionsRequired.java:43)
at 
org.apache.lucene.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:48)
at 
org.apache.lucene.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:70)
at 
org.apache.lucene.util.TestRuleIgnoreTestSuites$1.evaluate(TestRuleIgnoreTestSuites.java:55)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at

[jira] [Commented] (SOLR-3862) add remove as update option for atomically removing a value from a multivalued field

2013-05-10 Thread Andreas Kleiber (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-3862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653660#comment-13653660
 ] 

Andreas Kleiber commented on SOLR-3862:
---

I need this aswell. We have a message based system to generate / update our 
solr index. So an option to remove an entry of a multiValued field by value is 
very nice.
I would appreciate if this will be merged.

 add remove as update option for atomically removing a value from a 
 multivalued field
 --

 Key: SOLR-3862
 URL: https://issues.apache.org/jira/browse/SOLR-3862
 Project: Solr
  Issue Type: Improvement
  Components: SolrCloud
Affects Versions: 4.0-BETA
Reporter: Jim Musil
 Attachments: SOLR-3862.patch


 Currently you can atomically add a value to a multivalued field. It would 
 be useful to be able to remove a value from a multivalued field. 
 When you set a multivalued field to null, it destroys all values.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Make tests.failfast default to true

2013-05-10 Thread Dawid Weiss

I don't have any strong opinion, to be honest so if nobody vetoes this
change, go ahead.

Dawid

On Fri, May 10, 2013 at 8:27 AM, Shai Erera ser...@gmail.com wrote:
 I thought to make the change in LuceneTestCase.

 Shai


 On Fri, May 10, 2013 at 8:56 AM, Dawid Weiss dawid.we...@cs.put.poznan.pl
 wrote:

 I don't care much. If you do though, you can specify your own override
 in any of these locations?

   property file=${user.home}/lucene.build.properties/
   property file=${user.home}/build.properties/
   property file=${basedir}/build.properties/
   property file=${common.dir}/build.properties/

 Or is this something you'd like to promote to the top level?

 Dawid

 On Fri, May 10, 2013 at 7:20 AM, Shai Erera ser...@gmail.com wrote:
  Hi
 
  Can we default tests.failfast to true? I don't know how common it is to
  run
  with -Dtests.iters=N, not specify -Dtests.maxfailures AND wanting to see
  all
  N tests finish.
 
  At any rate, if someone wishes to, he can always specify
  -Dtests.failfast=false. It now depends what is the default behavior we
  want
  to have.
 
  Any thoughts?
 
  Shai

 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4975) Add Replication module to Lucene


[ 
https://issues.apache.org/jira/browse/LUCENE-4975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653896#comment-13653896
 ] 

Shai Erera commented on LUCENE-4975:


I ran both tests w/ tests.iters=1000 and they passed. This gives me more 
confidence about the robustness of these two handlers. Still, other machines 
can dig up special seeds :).

 Add Replication module to Lucene
 

 Key: LUCENE-4975
 URL: https://issues.apache.org/jira/browse/LUCENE-4975
 Project: Lucene - Core
  Issue Type: New Feature
Reporter: Shai Erera
Assignee: Shai Erera
 Attachments: LUCENE-4975.patch, LUCENE-4975.patch, LUCENE-4975.patch, 
 LUCENE-4975.patch, LUCENE-4975.patch, LUCENE-4975.patch, LUCENE-4975.patch


 I wrote a replication module which I think will be useful to Lucene users who 
 want to replicate their indexes for e.g high-availability, taking hot backups 
 etc.
 I will upload a patch soon where I'll describe in general how it works.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Comment Edited] (LUCENE-4975) Add Replication module to Lucene

[
https://issues.apache.org/jira/browse/LUCENE-4975?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13653562#comment-13653562
]

Shai Erera edited comment on LUCENE-4975 at 5/10/13 10:34 AM:
--

Patch resolves all remaining noocmmits. I made both IndexReplicationHandler and
IndexAndTaxonomyReplicationHandler more resilient to errors but carefully
reverting any changes made to the target indexes on errors. Also ensure that
segments_N files are written last.

I also added code to write segments.gen. To do that I factored out
SegmentInfos.writeSegmentsGen which takes a Directory and generation. It is now
used by SIS.finishCommit and the handlers.

Would appreciate if someone can review the handlers, and whether they can be
written in a more resilient ways.

I want to beast IndexReplicationClientTest and
IndexAndTaxonomyReplictionClientTest checkConsistencyOnExceptions before
committing. If anyone's interested to help, just fire {{ant test
-Dtestcase=IndexReplicationClientTest
-Dtestsm.method=testConsistencyOnExceptions -Dtests.iters=1000}} (and same for
IndexAndTaxonomyReplicationClientTest) and let me know the seeds that failed.

was (Author: shaie):
Patch resolves all remaining noocmmits. I made both IndexReplicationHandler
and IndexAndTaxonomyReplicationHandler more resilient to errors but carefully
reverting any changes made to the target indexes on errors. Also ensure that
segments_N files are written last.

I also added code to write segments.gen. To do that I factored out
SegmentInfos.writeSegmentsGen which takes a Directory and generation. It is now
used by SIS.finishCommit and the handlers.

Would appreciate if someone can review the handlers, and whether they can be
written in a more resilient ways.

I want to beast IndexReplicationClientTest and
IndexAndTaxonomyReplictionClientTest checkConsistencyOnExceptions before
committing. If anyone's interested to help, just fire {{ant test
-Dtestcase=IndexReplicationHandlerTest
-Dtestsm.method=testConsistencyOnExceptions -Dtests.iters=1000}} (and same for
IndexAndTaxonomyReplicationClientTest) and let me know the seeds that failed.

Add Replication module to Lucene

Key: LUCENE-4975
URL: https://issues.apache.org/jira/browse/LUCENE-4975
Project: Lucene - Core
Issue Type: New Feature
Reporter: Shai Erera
Assignee: Shai Erera
Attachments: LUCENE-4975.patch, LUCENE-4975.patch, LUCENE-4975.patch,
LUCENE-4975.patch, LUCENE-4975.patch, LUCENE-4975.patch, LUCENE-4975.patch

I wrote a replication module which I think will be useful to Lucene users who
want to replicate their indexes for e.g high-availability, taking hot backups
etc.
I will upload a patch soon where I'll describe in general how it works.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4791) solr.xml sharedLib does not work in 4.3.0


[ 
https://issues.apache.org/jira/browse/SOLR-4791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654369#comment-13654369
 ] 

Erick Erickson commented on SOLR-4791:
--

Go for it. Be sure to apply both the SOLR-4791.patch and closeLoader.patch. 
They were originally applied in that order, but I doubt it matters.

 solr.xml sharedLib does not work in 4.3.0
 -

 Key: SOLR-4791
 URL: https://issues.apache.org/jira/browse/SOLR-4791
 Project: Solr
  Issue Type: Bug
  Components: multicore
Affects Versions: 4.3
Reporter: Jan Høydahl
Assignee: Erick Erickson
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: closeLoader.patch, SOLR-4791.patch, SOLR-4791.patch, 
 SOLR-4791.patch, SOLR-4791-test.patch


 The sharedLib attribute on {{solr}} tag in solr.xml stopped working in 4.3.
 Using old-style solr.xml with sharedLib defined on solr tag. Solr does not 
 load any of them. Simply swapping out solr.war with the 4.2.1 one, brings 
 sharedLib loading back.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Resolved] (SOLR-4791) solr.xml sharedLib does not work in 4.3.0


 [ 
https://issues.apache.org/jira/browse/SOLR-4791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson resolved SOLR-4791.
--

Resolution: Fixed

Haven't seen any test failures since Robert added the patch so I'm marking this 
resolved.

 solr.xml sharedLib does not work in 4.3.0
 -

 Key: SOLR-4791
 URL: https://issues.apache.org/jira/browse/SOLR-4791
 Project: Solr
  Issue Type: Bug
  Components: multicore
Affects Versions: 4.3
Reporter: Jan Høydahl
Assignee: Erick Erickson
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: closeLoader.patch, SOLR-4791.patch, SOLR-4791.patch, 
 SOLR-4791.patch, SOLR-4791-test.patch


 The sharedLib attribute on {{solr}} tag in solr.xml stopped working in 4.3.
 Using old-style solr.xml with sharedLib defined on solr tag. Solr does not 
 load any of them. Simply swapping out solr.war with the 4.2.1 one, brings 
 sharedLib loading back.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Code checkin, vacation and reverting

2013-05-10 Thread Erick Erickson

I'll be checking in a pretty small patch today (SOLR-4790) (assuming
all tests pass) and then leaving for vacation. I hereby pre-authorize
anyone at all to revert it if it's a problem.

Erick

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4760) Improve logging messages during startup to better identify core


[ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654379#comment-13654379
 ] 

Jan Høydahl commented on SOLR-4760:
---

I get a nullpointer caused by this patch, when running tests for a patch.

{noformat}
[junit4:junit4]   2 NOTE: All tests run in this JVM: [TestMaxScoreQueryParser]
[junit4:junit4]   2 NOTE: reproduce with: ant test  
-Dtestcase=TestMaxScoreQueryParser -Dtests.seed=FFA2D62653D3F0F3 
-Dtests.slow=true -Dtests.locale=nl_BE -Dtests.timezone=America/Shiprock 
-Dtests.file.encoding=UTF-8
[junit4:junit4] ERROR   0.00s | TestMaxScoreQueryParser (suite) 
[junit4:junit4] Throwable #1: org.apache.solr.common.SolrException: Schema 
Parsing Failed: null
[junit4:junit4]at 
__randomizedtesting.SeedInfo.seed([FFA2D62653D3F0F3]:0)
[junit4:junit4]at 
org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:607)
[junit4:junit4]at 
org.apache.solr.schema.IndexSchema.init(IndexSchema.java:166)
[junit4:junit4]at 
org.apache.solr.schema.IndexSchemaFactory.create(IndexSchemaFactory.java:55)
[junit4:junit4]at 
org.apache.solr.schema.IndexSchemaFactory.buildIndexSchema(IndexSchemaFactory.java:69)
[junit4:junit4]at 
org.apache.solr.util.TestHarness.init(TestHarness.java:108)
[junit4:junit4]at 
org.apache.solr.SolrTestCaseJ4.createCore(SolrTestCaseJ4.java:393)
[junit4:junit4]at 
org.apache.solr.SolrTestCaseJ4.initCore(SolrTestCaseJ4.java:385)
[junit4:junit4]at 
org.apache.solr.SolrTestCaseJ4.initCore(SolrTestCaseJ4.java:197)
[junit4:junit4]at 
org.apache.solr.SolrTestCaseJ4.initCore(SolrTestCaseJ4.java:184)
[junit4:junit4]at 
org.apache.solr.search.TestMaxScoreQueryParser.beforeClass(TestMaxScoreQueryParser.java:36)
[junit4:junit4]at java.lang.Thread.run(Thread.java:722)
[junit4:junit4] Caused by: java.lang.NullPointerException
[junit4:junit4]at 
org.apache.solr.schema.IndexSchema.readSchema(IndexSchema.java:437)
[junit4:junit4]... 32 more
[junit4:junit4] Completed in 2.16s, 0 tests, 1 error  FAILURES!
{noformat}

The offending line is:
{code:java}
437:  sb.append(loader.getCoreProperties().getProperty(NAME));
{code}


 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Compressed field reader and memory problems with atomic updates

2013-05-10 Thread Erick Erickson

Thanks to you both. I've pinged the people who can reproduce this
problem, I suspect they'll try the patch out. I'm out next week, far
far away from internet connections so I'll be silent for a while. But
don't be surprised if new messages appear.

Thanks again
Erick

On Fri, May 10, 2013 at 3:46 AM, Adrien Grand jpou...@gmail.com wrote:
 Erick,

 On Thu, May 9, 2013 at 10:28 PM, Erick Erickson erickerick...@gmail.com 
 wrote:
 Yeah, this is getting warmer. Some of the
 CompressingStoredFieldsReader objects are 240M. The documents aren't
 nearly that large I don't think (but I'll verify).

 Wow, 240M is a lot! Would you be able to know which member of these
 instances is making them so large? Since you were mentioning
 threading, I first thought that the culprit was the buffer used for
 decompression but it could be the fields index as well (which is
 stored in memory but should be shared across instances for the same
 segment, so threading shouldn't make the situation worse).

 But still, over 700 of these objects live at once? I _think_ I'm
 seeing the number go up significantly when the number of indexing
 threads increases, but that's a bit of indirect evidence. My other
 question would be whether you'd expect the number of these objects to
 go up as the number of segments goes up, i.e. I assume they're
 per-segment

 Indeed, these objects are managed per-segment, so increasing the
 number of segments increases the number of stored fields reader
 instances.

 So the pattern here is atomic updates on documents where some of the
 fields get quite large. So the underlying reader has to decompress
 these a lot. Do you have any suggestions how to mitigate this? Other
 than don't do thatG

 As Savia suggests, lowering the number of indexing threads should
 help. If the merge factor is highish, lowering it could help as well.
 On my end, I'll fix the stored fields reader to not reuse the buffer
 for decompression (https://issues.apache.org/jira/browse/LUCENE-4995,
 even if we find out that it is not the problem here, it shouldn't
 hurt).

 --
 Adrien

 -
 To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Reopened] (SOLR-4760) Improve logging messages during startup to better identify core


 [ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jan Høydahl reopened SOLR-4760:
---


Reopening to examine the nullpointer

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-4018) Dispatcher to optionally write QTime and Hits HTTP header


 [ 
https://issues.apache.org/jira/browse/SOLR-4018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Markus Jelsma updated SOLR-4018:


Attachment: SOLR-4018-trunk-5.patch

Updated patch for trunk.

 Dispatcher to optionally write QTime and Hits HTTP header
 -

 Key: SOLR-4018
 URL: https://issues.apache.org/jira/browse/SOLR-4018
 Project: Solr
  Issue Type: Improvement
Affects Versions: 4.0
Reporter: Markus Jelsma
 Fix For: 4.4

 Attachments: SOLR-4018-trunk-1.patch, SOLR-4018-trunk-2.patch, 
 SOLR-4018-trunk-3.patch, SOLR-4018-trunk-4.patch, SOLR-4018-trunk-5.patch


 SolrDispatchFilter should be able to write QTime and Hits HTTP headers via 
 configuration.
 {code}
   requestDispatcher handleSelect=true 
 requestParsers enableRemoteStreaming=false 
 multipartUploadLimitInKB=2048000 /
 httpCaching never304=true /
 httpHeaders qtime=true hits=true/
   /requestDispatcher
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4760) Improve logging messages during startup to better identify core


[ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654393#comment-13654393
 ] 

Erick Erickson commented on SOLR-4760:
--

right, I'm seeing the same thing:
sb.append(loader.getCoreProperties().getProperty(NAME));
getCoreProperties returns null here. Seems like a simple if test here would 
suffice, are you taking care of it or should I? I've seen several situations 
where stuff is null (particularly CoreContainer.cfg) in the test environment 
that are _never_ null in non-test, this could be another one of those.

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4760) Improve logging messages during startup to better identify core

[
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654397#comment-13654397
]

Jan Høydahl commented on SOLR-4760:
---

Yea, a test issue probably. Not sure why I bump into it only in the tests.

I tried to copy core.properties into collection1/ in test-files, but that did
not help either. Must be that test framework does not initialize same way as
the real thing.

Improve logging messages during startup to better identify core
---

Key: SOLR-4760
URL: https://issues.apache.org/jira/browse/SOLR-4760
Project: Solr
Issue Type: Wish
Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
Fix For: 5.0, 4.4, 4.3.1

Attachments: SOLR-4760.patch, SOLR-4760.patch

Some log messages could be more informative. For example:
{code}
680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema –
schema has no name!
{code}
Would be _very nice_ to know which core this is complaining about.
Later, once the core is loaded, the core name shows up in the logs,
but it would be nice to have it earlier without having to
triangulating it through 'Loading core' messages.

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (LUCENE-4975) Add Replication module to Lucene


 [ 
https://issues.apache.org/jira/browse/LUCENE-4975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shai Erera updated LUCENE-4975:
---

Attachment: LUCENE-4975.patch

Mike found a seed which tripped a test bug. Fixed it and on the way made the 
test less sensitive to time changes (i.e. it waited up to 20 seconds for the 
index to get updated, otherwise failed) and added some other improvements (to 
the test only).

Let's search for more seeds :).

 Add Replication module to Lucene
 

 Key: LUCENE-4975
 URL: https://issues.apache.org/jira/browse/LUCENE-4975
 Project: Lucene - Core
  Issue Type: New Feature
Reporter: Shai Erera
Assignee: Shai Erera
 Attachments: LUCENE-4975.patch, LUCENE-4975.patch, LUCENE-4975.patch, 
 LUCENE-4975.patch, LUCENE-4975.patch, LUCENE-4975.patch, LUCENE-4975.patch, 
 LUCENE-4975.patch


 I wrote a replication module which I think will be useful to Lucene users who 
 want to replicate their indexes for e.g high-availability, taking hot backups 
 etc.
 I will upload a patch soon where I'll describe in general how it works.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4760) Improve logging messages during startup to better identify core


[ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654401#comment-13654401
 ] 

Jan Høydahl commented on SOLR-4760:
---

This patch did the trick for me
{code}
437:  sb.append(loader.getCoreProperties()!=null ? 
loader.getCoreProperties().getProperty(NAME) : (N/A));
{code}

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4760) Improve logging messages during startup to better identify core


[ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654402#comment-13654402
 ] 

Erick Erickson commented on SOLR-4760:
--

yep, that's what I've found viz. some of the stuff in CoreContainer, you'll see 
some if tests in there (which I hate but haven't had a chance to track down).

I'll take care of it. I think this is useful information in the usual case, 
simple if test. Look for a patch soon.

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (LUCENE-4975) Add Replication module to Lucene


 [ 
https://issues.apache.org/jira/browse/LUCENE-4975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shai Erera updated LUCENE-4975:
---

Attachment: LUCENE-4975.patch

Forgot to include one change -- handleUpdateEx should fail if the exception it 
hits is not IOE. Previous patch called super.handle(), which only logged it. 
But I think it's fair to say that the test shouldn't hit any exception, and 
terminate the client if it does.

 Add Replication module to Lucene
 

 Key: LUCENE-4975
 URL: https://issues.apache.org/jira/browse/LUCENE-4975
 Project: Lucene - Core
  Issue Type: New Feature
Reporter: Shai Erera
Assignee: Shai Erera
 Attachments: LUCENE-4975.patch, LUCENE-4975.patch, LUCENE-4975.patch, 
 LUCENE-4975.patch, LUCENE-4975.patch, LUCENE-4975.patch, LUCENE-4975.patch, 
 LUCENE-4975.patch, LUCENE-4975.patch


 I wrote a replication module which I think will be useful to Lucene users who 
 want to replicate their indexes for e.g high-availability, taking hot backups 
 etc.
 I will upload a patch soon where I'll describe in general how it works.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4751) The replication problem of the file in a subdirectory.


[ 
https://issues.apache.org/jira/browse/SOLR-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654405#comment-13654405
 ] 

Commit Tag Bot commented on SOLR-4751:
--

[trunk commit] koji
http://svn.apache.org/viewvc?view=revisionrevision=1480988

SOLR-4751: fix replication problem of files in sub directory of conf

 The replication problem of the file in a subdirectory.
 --

 Key: SOLR-4751
 URL: https://issues.apache.org/jira/browse/SOLR-4751
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 4.2.1
Reporter: Minoru Osuka
Assignee: Koji Sekiguchi
Priority: Minor
 Attachments: SOLR-4751.patch


 When set lang/stopwords_ja.txt to confFiles in replication settings,
 {code:xml}
   requestHandler name=/replication class=solr.ReplicationHandler 
lst name=master
  str name=replicateAftercommit/str
  str name=replicateAfterstartup/str
  str 
 name=confFilesschema.xml,stopwords.txt,lang/stopwords_ja.txt/str
/lst
   /requestHandler
 {code}
 Only stopwords_ja.txt exists in solr/collection1/conf/lang directory on slave 
 node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4995) Remove the strong reference of CompressingStoredFieldsReader on the decompression buffer


[ 
https://issues.apache.org/jira/browse/LUCENE-4995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654414#comment-13654414
 ] 

Robert Muir commented on LUCENE-4995:
-

Are we sure this is the right thing to do? People have misconfigured 
threadpools (especially tomcat users) and complain all the time about e.g. the 
analyzer buffers and so on.

I just want to make sure these misconfigured users arent causing 
correctly-configured users massive amounts of garbage etc.

 Remove the strong reference of CompressingStoredFieldsReader on the 
 decompression buffer
 

 Key: LUCENE-4995
 URL: https://issues.apache.org/jira/browse/LUCENE-4995
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Adrien Grand
Assignee: Adrien Grand
 Attachments: LUCENE-4995.patch


 CompressingStoredFieldsReader has a strong reference on the buffer it uses 
 for decompression. Although it makes the reader able to reuse this buffer, 
 this can trigger high memory usage in case some documents are very large. 
 Creating this buffer on demand would help give memory back to the JVM.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Make tests.failfast default to true

why change it?

On Fri, May 10, 2013 at 1:20 AM, Shai Erera ser...@gmail.com wrote:

 Hi

 Can we default tests.failfast to true? I don't know how common it is to
 run with -Dtests.iters=N, not specify -Dtests.maxfailures AND wanting to
 see all N tests finish.

 At any rate, if someone wishes to, he can always specify
 -Dtests.failfast=false. It now depends what is the default behavior we want
 to have.

 Any thoughts?

 Shai

[jira] [Commented] (SOLR-4563) RSS DIH-example not working


[ 
https://issues.apache.org/jira/browse/SOLR-4563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654421#comment-13654421
 ] 

Commit Tag Bot commented on SOLR-4563:
--

[trunk commit] janhoy
http://svn.apache.org/viewvc?view=revisionrevision=1480993

SOLR-4563: RSS DIH-example not working

 RSS DIH-example not working
 ---

 Key: SOLR-4563
 URL: https://issues.apache.org/jira/browse/SOLR-4563
 Project: Solr
  Issue Type: Bug
Affects Versions: 4.2
Reporter: Jan Høydahl
 Fix For: 4.4

 Attachments: SOLR-4563.patch


 The xpath paths of /rss/item do not match the real world RSS feed which uses 
 /rss/channel/item

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4791) solr.xml sharedLib does not work in 4.3.0


[ 
https://issues.apache.org/jira/browse/SOLR-4791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654423#comment-13654423
 ] 

Robert Muir commented on SOLR-4791:
---

shouldn't this have a changes entry?

 solr.xml sharedLib does not work in 4.3.0
 -

 Key: SOLR-4791
 URL: https://issues.apache.org/jira/browse/SOLR-4791
 Project: Solr
  Issue Type: Bug
  Components: multicore
Affects Versions: 4.3
Reporter: Jan Høydahl
Assignee: Erick Erickson
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: closeLoader.patch, SOLR-4791.patch, SOLR-4791.patch, 
 SOLR-4791.patch, SOLR-4791-test.patch


 The sharedLib attribute on {{solr}} tag in solr.xml stopped working in 4.3.
 Using old-style solr.xml with sharedLib defined on solr tag. Solr does not 
 load any of them. Simply swapping out solr.war with the 4.2.1 one, brings 
 sharedLib loading back.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4563) RSS DIH-example not working


[ 
https://issues.apache.org/jira/browse/SOLR-4563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654424#comment-13654424
 ] 

Commit Tag Bot commented on SOLR-4563:
--

[branch_4x commit] janhoy
http://svn.apache.org/viewvc?view=revisionrevision=1480994

SOLR-4563: RSS DIH-example not working (backport)

 RSS DIH-example not working
 ---

 Key: SOLR-4563
 URL: https://issues.apache.org/jira/browse/SOLR-4563
 Project: Solr
  Issue Type: Bug
Affects Versions: 4.2
Reporter: Jan Høydahl
 Fix For: 4.4

 Attachments: SOLR-4563.patch


 The xpath paths of /rss/item do not match the real world RSS feed which uses 
 /rss/channel/item

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-4563) RSS DIH-example not working


 [ 
https://issues.apache.org/jira/browse/SOLR-4563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jan Høydahl updated SOLR-4563:
--

Fix Version/s: 5.0

 RSS DIH-example not working
 ---

 Key: SOLR-4563
 URL: https://issues.apache.org/jira/browse/SOLR-4563
 Project: Solr
  Issue Type: Bug
Affects Versions: 4.2
Reporter: Jan Høydahl
 Fix For: 5.0, 4.4

 Attachments: SOLR-4563.patch


 The xpath paths of /rss/item do not match the real world RSS feed which uses 
 /rss/channel/item

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-4563) RSS DIH-example not working


 [ 
https://issues.apache.org/jira/browse/SOLR-4563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jan Høydahl updated SOLR-4563:
--

Component/s: contrib - DataImportHandler

 RSS DIH-example not working
 ---

 Key: SOLR-4563
 URL: https://issues.apache.org/jira/browse/SOLR-4563
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.2
Reporter: Jan Høydahl
 Fix For: 5.0, 4.4

 Attachments: SOLR-4563.patch


 The xpath paths of /rss/item do not match the real world RSS feed which uses 
 /rss/channel/item

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Assigned] (SOLR-4563) RSS DIH-example not working


 [ 
https://issues.apache.org/jira/browse/SOLR-4563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jan Høydahl reassigned SOLR-4563:
-

Assignee: Jan Høydahl

 RSS DIH-example not working
 ---

 Key: SOLR-4563
 URL: https://issues.apache.org/jira/browse/SOLR-4563
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.2
Reporter: Jan Høydahl
Assignee: Jan Høydahl
 Fix For: 5.0, 4.4

 Attachments: SOLR-4563.patch


 The xpath paths of /rss/item do not match the real world RSS feed which uses 
 /rss/channel/item

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Resolved] (SOLR-4563) RSS DIH-example not working


 [ 
https://issues.apache.org/jira/browse/SOLR-4563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jan Høydahl resolved SOLR-4563.
---

Resolution: Fixed

 RSS DIH-example not working
 ---

 Key: SOLR-4563
 URL: https://issues.apache.org/jira/browse/SOLR-4563
 Project: Solr
  Issue Type: Bug
  Components: contrib - DataImportHandler
Affects Versions: 4.2
Reporter: Jan Høydahl
Assignee: Jan Høydahl
 Fix For: 5.0, 4.4

 Attachments: SOLR-4563.patch


 The xpath paths of /rss/item do not match the real world RSS feed which uses 
 /rss/channel/item

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

disable solr jenkins tasks?

I've noticed that nearly all solr tests (225 tests failing) have been
failing for hours in jenkins.

I have serious doubts whether any solr committers are even looking at test
failures. Can we disable solr jobs in jenkins so that lucene gets more test
coverage?

Thanks,
Robert

[jira] [Commented] (SOLR-4760) Improve logging messages during startup to better identify core


[ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654429#comment-13654429
 ] 

Commit Tag Bot commented on SOLR-4760:
--

[trunk commit] erick
http://svn.apache.org/viewvc?view=revisionrevision=1480998

Test fix for SOLR-4760

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4760) Improve logging messages during startup to better identify core


[ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654435#comment-13654435
 ] 

Commit Tag Bot commented on SOLR-4760:
--

[branch_4x commit] erick
http://svn.apache.org/viewvc?view=revisionrevision=1481003

Test fix for SOLR-4760

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4760) Improve logging messages during startup to better identify core


[ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654436#comment-13654436
 ] 

Erick Erickson commented on SOLR-4760:
--

trunk: r - 1480998
4x:r - 1481003

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4751) The replication problem of the file in a subdirectory.


[ 
https://issues.apache.org/jira/browse/SOLR-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654455#comment-13654455
 ] 

Commit Tag Bot commented on SOLR-4751:
--

[branch_4x commit] koji
http://svn.apache.org/viewvc?view=revisionrevision=1481004

SOLR-4751: fix replication problem of files in sub directory of conf

 The replication problem of the file in a subdirectory.
 --

 Key: SOLR-4751
 URL: https://issues.apache.org/jira/browse/SOLR-4751
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 4.2.1
Reporter: Minoru Osuka
Assignee: Koji Sekiguchi
Priority: Minor
 Attachments: SOLR-4751.patch


 When set lang/stopwords_ja.txt to confFiles in replication settings,
 {code:xml}
   requestHandler name=/replication class=solr.ReplicationHandler 
lst name=master
  str name=replicateAftercommit/str
  str name=replicateAfterstartup/str
  str 
 name=confFilesschema.xml,stopwords.txt,lang/stopwords_ja.txt/str
/lst
   /requestHandler
 {code}
 Only stopwords_ja.txt exists in solr/collection1/conf/lang directory on slave 
 node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-4760) Improve logging messages during startup to better identify core


 [ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson updated SOLR-4760:
-

Attachment: SOLR-4760-testfix.patch

Patch for making the tests run, apply on top of original patch.

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch, SOLR-4760-testfix.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Resolved] (SOLR-4760) Improve logging messages during startup to better identify core


 [ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson resolved SOLR-4760.
--

Resolution: Fixed

Added null test so this doesn't break the tests.

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch, SOLR-4760-testfix.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Make tests.failfast default to true

Because I think it makes more sense as a default? Usually when I run with
tests.iters, I just search for a seed that fails. When I find it, I go fix
the bug and run again.

But that's just me. That's why I asked if others think it makes sense as a
default too. There is no functionality change, just different default.

Shai
On May 10, 2013 3:33 PM, Robert Muir rcm...@gmail.com wrote:

 why change it?

 On Fri, May 10, 2013 at 1:20 AM, Shai Erera ser...@gmail.com wrote:

 Hi

 Can we default tests.failfast to true? I don't know how common it is to
 run with -Dtests.iters=N, not specify -Dtests.maxfailures AND wanting to
 see all N tests finish.

 At any rate, if someone wishes to, he can always specify
 -Dtests.failfast=false. It now depends what is the default behavior we want
 to have.

 Any thoughts?

 Shai

[jira] [Commented] (SOLR-4751) The replication problem of the file in a subdirectory.


[ 
https://issues.apache.org/jira/browse/SOLR-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654460#comment-13654460
 ] 

Commit Tag Bot commented on SOLR-4751:
--

[lucene_solr_4_3 commit] koji
http://svn.apache.org/viewvc?view=revisionrevision=1481008

SOLR-4751: fix replication problem of files in sub directory of conf

 The replication problem of the file in a subdirectory.
 --

 Key: SOLR-4751
 URL: https://issues.apache.org/jira/browse/SOLR-4751
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 4.2.1
Reporter: Minoru Osuka
Assignee: Koji Sekiguchi
Priority: Minor
 Attachments: SOLR-4751.patch


 When set lang/stopwords_ja.txt to confFiles in replication settings,
 {code:xml}
   requestHandler name=/replication class=solr.ReplicationHandler 
lst name=master
  str name=replicateAftercommit/str
  str name=replicateAfterstartup/str
  str 
 name=confFilesschema.xml,stopwords.txt,lang/stopwords_ja.txt/str
/lst
   /requestHandler
 {code}
 Only stopwords_ja.txt exists in solr/collection1/conf/lang directory on slave 
 node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4751) The replication problem of the file in a subdirectory.


[ 
https://issues.apache.org/jira/browse/SOLR-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654461#comment-13654461
 ] 

Koji Sekiguchi commented on SOLR-4751:
--

I committed the patch on trunk, branch_4x and 4.3.

Osuka-san, can you check one of them and see that your fix can solve the 
problem?

 The replication problem of the file in a subdirectory.
 --

 Key: SOLR-4751
 URL: https://issues.apache.org/jira/browse/SOLR-4751
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 4.2.1
Reporter: Minoru Osuka
Assignee: Koji Sekiguchi
Priority: Minor
 Attachments: SOLR-4751.patch


 When set lang/stopwords_ja.txt to confFiles in replication settings,
 {code:xml}
   requestHandler name=/replication class=solr.ReplicationHandler 
lst name=master
  str name=replicateAftercommit/str
  str name=replicateAfterstartup/str
  str 
 name=confFilesschema.xml,stopwords.txt,lang/stopwords_ja.txt/str
/lst
   /requestHandler
 {code}
 Only stopwords_ja.txt exists in solr/collection1/conf/lang directory on slave 
 node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-4751) The replication problem of the file in a subdirectory.


 [ 
https://issues.apache.org/jira/browse/SOLR-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated SOLR-4751:
-

Fix Version/s: 4.4
   5.0

 The replication problem of the file in a subdirectory.
 --

 Key: SOLR-4751
 URL: https://issues.apache.org/jira/browse/SOLR-4751
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 4.2.1
Reporter: Minoru Osuka
Assignee: Koji Sekiguchi
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: SOLR-4751.patch


 When set lang/stopwords_ja.txt to confFiles in replication settings,
 {code:xml}
   requestHandler name=/replication class=solr.ReplicationHandler 
lst name=master
  str name=replicateAftercommit/str
  str name=replicateAfterstartup/str
  str 
 name=confFilesschema.xml,stopwords.txt,lang/stopwords_ja.txt/str
/lst
   /requestHandler
 {code}
 Only stopwords_ja.txt exists in solr/collection1/conf/lang directory on slave 
 node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Created] (LUCENE-4996) DocInverterPerField to log which field throws exceptions

Markus Jelsma created LUCENE-4996:
-

 Summary: DocInverterPerField to log which field throws exceptions
 Key: LUCENE-4996
 URL: https://issues.apache.org/jira/browse/LUCENE-4996
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Reporter: Markus Jelsma
Priority: Minor
 Fix For: 5.0, 4.4


One of ours fields seems to have a problem that didn't result in an exception 
before. It seems one of my filters doesn't deal with posIncAttr properly but 
Lucene did not log which of my numerous fields was the problem.

This patch includes the field name in the exception.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4760) Improve logging messages during startup to better identify core


[ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654466#comment-13654466
 ] 

Erick Erickson commented on SOLR-4760:
--

Gaaah. Removed the trailing ']' in the message by mistake. I'm not going to 
re-spin the test-fix patch, but I'll fix it in another patch I'll put up 
shortly.

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch, SOLR-4760-testfix.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (LUCENE-4996) DocInverterPerField to log which field throws exceptions


 [ 
https://issues.apache.org/jira/browse/LUCENE-4996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Markus Jelsma updated LUCENE-4996:
--

Attachment: LUCENE-4996-trunk.patch

 DocInverterPerField to log which field throws exceptions
 

 Key: LUCENE-4996
 URL: https://issues.apache.org/jira/browse/LUCENE-4996
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Reporter: Markus Jelsma
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: LUCENE-4996-trunk.patch


 One of ours fields seems to have a problem that didn't result in an exception 
 before. It seems one of my filters doesn't deal with posIncAttr properly but 
 Lucene did not log which of my numerous fields was the problem.
 This patch includes the field name in the exception.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4760) Improve logging messages during startup to better identify core


[ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654473#comment-13654473
 ] 

Shawn Heisey commented on SOLR-4760:


[~janhoy] and [~erickerickson], many thanks for finding and fixing my mistake 
while I slept.  I did run tests and precommit back when I created this patch, 
but I did not do so before the actual commit, which was sloppy.


 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch, SOLR-4760-testfix.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-1913) QParserPlugin plugin for Search Results Filtering Based on Bitwise Operations on Integer Fields

2013-05-10 Thread Deepthi Sigireddi (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-1913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Deepthi Sigireddi updated SOLR-1913:


Attachment: SOLR-1913-src.tar.gz

Hi Israel,
I took the src you attached to the JIRA and migrated it to Solr 4.2.1. It 
compiles now, and works as expected.
I haven't tried it with Solr 4.3, but presumably it will work fine with that 
too.
I'm attaching a tar ball (SOLR-1913-src.tar.gz) with the modified source.
Ankur, Christopher, please feel free to try this out and give your feedback.

 QParserPlugin plugin for Search Results Filtering Based on Bitwise Operations 
 on Integer Fields
 ---

 Key: SOLR-1913
 URL: https://issues.apache.org/jira/browse/SOLR-1913
 Project: Solr
  Issue Type: New Feature
  Components: search
Reporter: Israel Ekpo
 Fix For: 4.4

 Attachments: bitwise_filter_plugin.jar, SOLR-1913.bitwise.tar.gz, 
 SOLR-1913-src.tar.gz

   Original Estimate: 1h
  Remaining Estimate: 1h

 BitwiseQueryParserPlugin is a org.apache.solr.search.QParserPlugin that 
 allows 
 users to filter the documents returned from a query
 by performing bitwise operations between a particular integer field in the 
 index
 and the specified value.
 This Solr plugin is based on the BitwiseFilter in LUCENE-2460
 See https://issues.apache.org/jira/browse/LUCENE-2460 for more details
 This is the syntax for searching in Solr:
 http://localhost:8983/path/to/solr/select/?q={!bitwise field=fieldname 
 op=OPERATION_NAME source=sourcevalue negate=boolean}remainder of query
 Example :
 http://localhost:8983/solr/bitwise/select/?q={!bitwise field=user_permissions 
 op=AND source=3 negate=true}state:FL
 The negate parameter is optional
 The field parameter is the name of the integer field
 The op parameter is the name of the operation; one of {AND, OR, XOR}
 The source parameter is the specified integer value
 The negate parameter is a boolean indicating whether or not to negate the 
 results of the bitwise operation
 To test out this plugin, simply copy the jar file containing the plugin 
 classes into your $SOLR_HOME/lib directory and then
 add the following to your solrconfig.xml file after the dismax request 
 handler:
 queryParser name=bitwise 
 class=org.apache.solr.bitwise.BitwiseQueryParserPlugin basedOn=dismax /
 Restart your servlet container.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-4018) Dispatcher to optionally write QTime and Hits HTTP header


 [ 
https://issues.apache.org/jira/browse/SOLR-4018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Markus Jelsma updated SOLR-4018:


Fix Version/s: 5.0

 Dispatcher to optionally write QTime and Hits HTTP header
 -

 Key: SOLR-4018
 URL: https://issues.apache.org/jira/browse/SOLR-4018
 Project: Solr
  Issue Type: Improvement
Affects Versions: 4.0
Reporter: Markus Jelsma
 Fix For: 5.0, 4.4

 Attachments: SOLR-4018-trunk-1.patch, SOLR-4018-trunk-2.patch, 
 SOLR-4018-trunk-3.patch, SOLR-4018-trunk-4.patch, SOLR-4018-trunk-5.patch


 SolrDispatchFilter should be able to write QTime and Hits HTTP headers via 
 configuration.
 {code}
   requestDispatcher handleSelect=true 
 requestParsers enableRemoteStreaming=false 
 multipartUploadLimitInKB=2048000 /
 httpCaching never304=true /
 httpHeaders qtime=true hits=true/
   /requestDispatcher
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4760) Improve logging messages during startup to better identify core


[ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654506#comment-13654506
 ] 

Commit Tag Bot commented on SOLR-4760:
--

[lucene_solr_4_3 commit] erick
http://svn.apache.org/viewvc?view=revisionrevision=1481022

backporting test fix for SOLR-4760 to 4.3.1

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch, SOLR-4760-testfix.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4760) Improve logging messages during startup to better identify core


[ 
https://issues.apache.org/jira/browse/SOLR-4760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654507#comment-13654507
 ] 

Erick Erickson commented on SOLR-4760:
--

backported to 4.3.1, r: 1481022

 Improve logging messages during startup to better identify core
 ---

 Key: SOLR-4760
 URL: https://issues.apache.org/jira/browse/SOLR-4760
 Project: Solr
  Issue Type: Wish
  Components: Schema and Analysis
Affects Versions: 4.3
Reporter: Alexandre Rafalovitch
Priority: Minor
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: SOLR-4760.patch, SOLR-4760.patch, SOLR-4760-testfix.patch


 Some log messages could be more informative. For example:
 {code}
 680 [coreLoadExecutor-3-thread-3] WARN org.apache.solr.schema.IndexSchema  – 
 schema has no name!
 {code}
 Would be _very nice_ to know which core this is complaining about.
 Later, once the core is loaded, the core name shows up in the logs,
 but it would be nice to have it earlier without having to
 triangulating it through 'Loading core' messages.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4751) The replication problem of the file in a subdirectory.


[ 
https://issues.apache.org/jira/browse/SOLR-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654515#comment-13654515
 ] 

Commit Tag Bot commented on SOLR-4751:
--

[trunk commit] markrmiller
http://svn.apache.org/viewvc?view=revisionrevision=1481030

SOLR-4751: revert for now

 The replication problem of the file in a subdirectory.
 --

 Key: SOLR-4751
 URL: https://issues.apache.org/jira/browse/SOLR-4751
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 4.2.1
Reporter: Minoru Osuka
Assignee: Koji Sekiguchi
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: SOLR-4751.patch


 When set lang/stopwords_ja.txt to confFiles in replication settings,
 {code:xml}
   requestHandler name=/replication class=solr.ReplicationHandler 
lst name=master
  str name=replicateAftercommit/str
  str name=replicateAfterstartup/str
  str 
 name=confFilesschema.xml,stopwords.txt,lang/stopwords_ja.txt/str
/lst
   /requestHandler
 {code}
 Only stopwords_ja.txt exists in solr/collection1/conf/lang directory on slave 
 node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4751) The replication problem of the file in a subdirectory.


[ 
https://issues.apache.org/jira/browse/SOLR-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654516#comment-13654516
 ] 

Commit Tag Bot commented on SOLR-4751:
--

[branch_4x commit] markrmiller
http://svn.apache.org/viewvc?view=revisionrevision=1481031

SOLR-4751: revert for now

 The replication problem of the file in a subdirectory.
 --

 Key: SOLR-4751
 URL: https://issues.apache.org/jira/browse/SOLR-4751
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 4.2.1
Reporter: Minoru Osuka
Assignee: Koji Sekiguchi
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: SOLR-4751.patch


 When set lang/stopwords_ja.txt to confFiles in replication settings,
 {code:xml}
   requestHandler name=/replication class=solr.ReplicationHandler 
lst name=master
  str name=replicateAftercommit/str
  str name=replicateAfterstartup/str
  str 
 name=confFilesschema.xml,stopwords.txt,lang/stopwords_ja.txt/str
/lst
   /requestHandler
 {code}
 Only stopwords_ja.txt exists in solr/collection1/conf/lang directory on slave 
 node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4751) The replication problem of the file in a subdirectory.

2013-05-10 Thread Mark Miller (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654517#comment-13654517
 ] 

Mark Miller commented on SOLR-4751:
---

Seems a test fail is exposing an issue with this. I've reverted this for the 
moment.

 The replication problem of the file in a subdirectory.
 --

 Key: SOLR-4751
 URL: https://issues.apache.org/jira/browse/SOLR-4751
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 4.2.1
Reporter: Minoru Osuka
Assignee: Koji Sekiguchi
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: SOLR-4751.patch


 When set lang/stopwords_ja.txt to confFiles in replication settings,
 {code:xml}
   requestHandler name=/replication class=solr.ReplicationHandler 
lst name=master
  str name=replicateAftercommit/str
  str name=replicateAfterstartup/str
  str 
 name=confFilesschema.xml,stopwords.txt,lang/stopwords_ja.txt/str
/lst
   /requestHandler
 {code}
 Only stopwords_ja.txt exists in solr/collection1/conf/lang directory on slave 
 node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: disable solr jenkins tasks?

2013-05-10 Thread Shawn Heisey

 I've noticed that nearly all solr tests (225 tests failing) have been
 failing for hours in jenkins.

 I have serious doubts whether any solr committers are even looking at test
 failures. Can we disable solr jobs in jenkins so that lucene gets more
 test
 coverage?

I will take a look when my commute ends and do my best to make some
progress. Some of those failures were my fault on SOLR-4760.

If the tests continue to fail, +1 to disabling.

Thanks,
Shawn



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4791) solr.xml sharedLib does not work in 4.3.0


[ 
https://issues.apache.org/jira/browse/SOLR-4791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654526#comment-13654526
 ] 

Commit Tag Bot commented on SOLR-4791:
--

[trunk commit] erick
http://svn.apache.org/viewvc?view=revisionrevision=1481038

added to CHANGES.txt for SOLR-4791

 solr.xml sharedLib does not work in 4.3.0
 -

 Key: SOLR-4791
 URL: https://issues.apache.org/jira/browse/SOLR-4791
 Project: Solr
  Issue Type: Bug
  Components: multicore
Affects Versions: 4.3
Reporter: Jan Høydahl
Assignee: Erick Erickson
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: closeLoader.patch, SOLR-4791.patch, SOLR-4791.patch, 
 SOLR-4791.patch, SOLR-4791-test.patch


 The sharedLib attribute on {{solr}} tag in solr.xml stopped working in 4.3.
 Using old-style solr.xml with sharedLib defined on solr tag. Solr does not 
 load any of them. Simply swapping out solr.war with the 4.2.1 one, brings 
 sharedLib loading back.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Make tests.failfast default to true

I dont have any strong opinion really, i just want to mention on the side
you can also change your 'personal defaults' by editing build.properties in
your home directory too.

On Fri, May 10, 2013 at 9:27 AM, Shai Erera ser...@gmail.com wrote:

 Because I think it makes more sense as a default? Usually when I run with
 tests.iters, I just search for a seed that fails. When I find it, I go fix
 the bug and run again.

 But that's just me. That's why I asked if others think it makes sense as a
 default too. There is no functionality change, just different default.

 Shai
 On May 10, 2013 3:33 PM, Robert Muir rcm...@gmail.com wrote:

 why change it?

 On Fri, May 10, 2013 at 1:20 AM, Shai Erera ser...@gmail.com wrote:

 Hi

 Can we default tests.failfast to true? I don't know how common it is to
 run with -Dtests.iters=N, not specify -Dtests.maxfailures AND wanting to
 see all N tests finish.

 At any rate, if someone wishes to, he can always specify
 -Dtests.failfast=false. It now depends what is the default behavior we want
 to have.

 Any thoughts?

 Shai

Re: Make tests.failfast default to true

Thanks for the tip. If it's just me, I won't make the change. I just find
it annoying to have to type it everytime, and think that this is probably
what you intend to achieve when running w/ tests.iters -- search for a seed
that trips the tests.

If there are no objections, I'll make the trivial change later.

Shai


On Fri, May 10, 2013 at 5:55 PM, Robert Muir rcm...@gmail.com wrote:

 I dont have any strong opinion really, i just want to mention on the side
 you can also change your 'personal defaults' by editing build.properties in
 your home directory too.


 On Fri, May 10, 2013 at 9:27 AM, Shai Erera ser...@gmail.com wrote:

 Because I think it makes more sense as a default? Usually when I run with
 tests.iters, I just search for a seed that fails. When I find it, I go fix
 the bug and run again.

 But that's just me. That's why I asked if others think it makes sense as
 a default too. There is no functionality change, just different default.

 Shai
 On May 10, 2013 3:33 PM, Robert Muir rcm...@gmail.com wrote:

 why change it?

 On Fri, May 10, 2013 at 1:20 AM, Shai Erera ser...@gmail.com wrote:

 Hi

 Can we default tests.failfast to true? I don't know how common it is to
 run with -Dtests.iters=N, not specify -Dtests.maxfailures AND wanting to
 see all N tests finish.

 At any rate, if someone wishes to, he can always specify
 -Dtests.failfast=false. It now depends what is the default behavior we want
 to have.

 Any thoughts?

 Shai

[jira] [Commented] (SOLR-4791) solr.xml sharedLib does not work in 4.3.0


[ 
https://issues.apache.org/jira/browse/SOLR-4791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654533#comment-13654533
 ] 

Commit Tag Bot commented on SOLR-4791:
--

[lucene_solr_4_3 commit] erick
http://svn.apache.org/viewvc?view=revisionrevision=1481043

added to CHANGES.txt for SOLR-4791

 solr.xml sharedLib does not work in 4.3.0
 -

 Key: SOLR-4791
 URL: https://issues.apache.org/jira/browse/SOLR-4791
 Project: Solr
  Issue Type: Bug
  Components: multicore
Affects Versions: 4.3
Reporter: Jan Høydahl
Assignee: Erick Erickson
 Fix For: 5.0, 4.4, 4.3.1

 Attachments: closeLoader.patch, SOLR-4791.patch, SOLR-4791.patch, 
 SOLR-4791.patch, SOLR-4791-test.patch


 The sharedLib attribute on {{solr}} tag in solr.xml stopped working in 4.3.
 Using old-style solr.xml with sharedLib defined on solr tag. Solr does not 
 load any of them. Simply swapping out solr.war with the 4.2.1 one, brings 
 sharedLib loading back.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Make tests.failfast default to true

I'm still hoping for the elusive tests.beast or whatever that gives each
run a whole different seed and acts like the shellscripts that run 'ant
test -Dtestcase' over and over :)

On Fri, May 10, 2013 at 10:59 AM, Shai Erera ser...@gmail.com wrote:

 Thanks for the tip. If it's just me, I won't make the change. I just find
 it annoying to have to type it everytime, and think that this is probably
 what you intend to achieve when running w/ tests.iters -- search for a seed
 that trips the tests.

 If there are no objections, I'll make the trivial change later.

 Shai


 On Fri, May 10, 2013 at 5:55 PM, Robert Muir rcm...@gmail.com wrote:

 I dont have any strong opinion really, i just want to mention on the side
 you can also change your 'personal defaults' by editing build.properties in
 your home directory too.


 On Fri, May 10, 2013 at 9:27 AM, Shai Erera ser...@gmail.com wrote:

 Because I think it makes more sense as a default? Usually when I run
 with tests.iters, I just search for a seed that fails. When I find it, I go
 fix the bug and run again.

 But that's just me. That's why I asked if others think it makes sense as
 a default too. There is no functionality change, just different default.

 Shai
 On May 10, 2013 3:33 PM, Robert Muir rcm...@gmail.com wrote:

 why change it?

 On Fri, May 10, 2013 at 1:20 AM, Shai Erera ser...@gmail.com wrote:

 Hi

 Can we default tests.failfast to true? I don't know how common it is
 to run with -Dtests.iters=N, not specify -Dtests.maxfailures AND wanting 
 to
 see all N tests finish.

 At any rate, if someone wishes to, he can always specify
 -Dtests.failfast=false. It now depends what is the default behavior we 
 want
 to have.

 Any thoughts?

 Shai

Re: Make tests.failfast default to true

Me too! :)

I was hoping tests.dups would do it. Dawid promised me he will one day find
the time to do it! :)

In the meanwhile, Mike pointed me to repeatLuceneTest.py under luceneutil.
I've yet been successful to run it, but I think it's just a matter of
configuration.

Maybe if you have a shellscript that's generic enough, it's worth putting
under dev-tools?

Shai


On Fri, May 10, 2013 at 6:06 PM, Robert Muir rcm...@gmail.com wrote:

 I'm still hoping for the elusive tests.beast or whatever that gives each
 run a whole different seed and acts like the shellscripts that run 'ant
 test -Dtestcase' over and over :)


 On Fri, May 10, 2013 at 10:59 AM, Shai Erera ser...@gmail.com wrote:

 Thanks for the tip. If it's just me, I won't make the change. I just find
 it annoying to have to type it everytime, and think that this is probably
 what you intend to achieve when running w/ tests.iters -- search for a seed
 that trips the tests.

 If there are no objections, I'll make the trivial change later.

 Shai


 On Fri, May 10, 2013 at 5:55 PM, Robert Muir rcm...@gmail.com wrote:

 I dont have any strong opinion really, i just want to mention on the
 side you can also change your 'personal defaults' by editing
 build.properties in your home directory too.


 On Fri, May 10, 2013 at 9:27 AM, Shai Erera ser...@gmail.com wrote:

 Because I think it makes more sense as a default? Usually when I run
 with tests.iters, I just search for a seed that fails. When I find it, I go
 fix the bug and run again.

 But that's just me. That's why I asked if others think it makes sense
 as a default too. There is no functionality change, just different default.

 Shai
 On May 10, 2013 3:33 PM, Robert Muir rcm...@gmail.com wrote:

 why change it?

 On Fri, May 10, 2013 at 1:20 AM, Shai Erera ser...@gmail.com wrote:

 Hi

 Can we default tests.failfast to true? I don't know how common it is
 to run with -Dtests.iters=N, not specify -Dtests.maxfailures AND wanting 
 to
 see all N tests finish.

 At any rate, if someone wishes to, he can always specify
 -Dtests.failfast=false. It now depends what is the default behavior we 
 want
 to have.

 Any thoughts?

 Shai

[jira] [Commented] (LUCENE-4956) the korean analyzer that has a korean morphological analyzer and dictionaries

2013-05-10 Thread Steve Rowe (JIRA)

[
https://issues.apache.org/jira/browse/LUCENE-4956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654554#comment-13654554
]

Steve Rowe commented on LUCENE-4956:

bq. Yesterday I called a vote for this contribution on
gene...@incubator.apache.org:
[http://mail-archives.apache.org/mod_mbox/incubator-general/201305.mbox/%3c7ad4d4e3-530b-41e3-8323-da3d66a40...@apache.org%3e]

This vote has passed, so we're now free to incorporate this contribution into
the code base when and as we see fit.

the korean analyzer that has a korean morphological analyzer and dictionaries
-

Key: LUCENE-4956
URL: https://issues.apache.org/jira/browse/LUCENE-4956
Project: Lucene - Core
Issue Type: New Feature
Components: modules/analysis
Affects Versions: 4.2
Reporter: SooMyung Lee
Assignee: Christian Moen
Labels: newbie
Attachments: kr.analyzer.4x.tar

Korean language has specific characteristic. When developing search service
with lucene solr in korean, there are some problems in searching and
indexing. The korean analyer solved the problems with a korean morphological
anlyzer. It consists of a korean morphological analyzer, dictionaries, a
korean tokenizer and a korean filter. The korean anlyzer is made for lucene
and solr. If you develop a search service with lucene in korean, It is the
best idea to choose the korean analyzer.

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-4751) The replication problem of the file in a subdirectory.


 [ 
https://issues.apache.org/jira/browse/SOLR-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated SOLR-4751:
-

Fix Version/s: (was: 4.4)
   (was: 5.0)

 The replication problem of the file in a subdirectory.
 --

 Key: SOLR-4751
 URL: https://issues.apache.org/jira/browse/SOLR-4751
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 4.2.1
Reporter: Minoru Osuka
Assignee: Koji Sekiguchi
Priority: Minor
 Attachments: SOLR-4751.patch


 When set lang/stopwords_ja.txt to confFiles in replication settings,
 {code:xml}
   requestHandler name=/replication class=solr.ReplicationHandler 
lst name=master
  str name=replicateAftercommit/str
  str name=replicateAfterstartup/str
  str 
 name=confFilesschema.xml,stopwords.txt,lang/stopwords_ja.txt/str
/lst
   /requestHandler
 {code}
 Only stopwords_ja.txt exists in solr/collection1/conf/lang directory on slave 
 node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4751) The replication problem of the file in a subdirectory.


[ 
https://issues.apache.org/jira/browse/SOLR-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654556#comment-13654556
 ] 

Koji Sekiguchi commented on SOLR-4751:
--

Sorry for the inconvenience.

I'll contact the reporter and see if we can fix the tests.

 The replication problem of the file in a subdirectory.
 --

 Key: SOLR-4751
 URL: https://issues.apache.org/jira/browse/SOLR-4751
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 4.2.1
Reporter: Minoru Osuka
Assignee: Koji Sekiguchi
Priority: Minor
 Attachments: SOLR-4751.patch


 When set lang/stopwords_ja.txt to confFiles in replication settings,
 {code:xml}
   requestHandler name=/replication class=solr.ReplicationHandler 
lst name=master
  str name=replicateAftercommit/str
  str name=replicateAfterstartup/str
  str 
 name=confFilesschema.xml,stopwords.txt,lang/stopwords_ja.txt/str
/lst
   /requestHandler
 {code}
 Only stopwords_ja.txt exists in solr/collection1/conf/lang directory on slave 
 node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4996) DocInverterPerField to log which field throws exceptions


[ 
https://issues.apache.org/jira/browse/LUCENE-4996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654557#comment-13654557
 ] 

Robert Muir commented on LUCENE-4996:
-

+1. Thanks Markus. I'll take this one.

 DocInverterPerField to log which field throws exceptions
 

 Key: LUCENE-4996
 URL: https://issues.apache.org/jira/browse/LUCENE-4996
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Reporter: Markus Jelsma
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: LUCENE-4996-trunk.patch


 One of ours fields seems to have a problem that didn't result in an exception 
 before. It seems one of my filters doesn't deal with posIncAttr properly but 
 Lucene did not log which of my numerous fields was the problem.
 This patch includes the field name in the exception.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4790) When defining a core with the same name (discovery mode or not), CoreContainer should throw an error


[ 
https://issues.apache.org/jira/browse/SOLR-4790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654577#comment-13654577
 ] 

Commit Tag Bot commented on SOLR-4790:
--

[trunk commit] erick
http://svn.apache.org/viewvc?view=revisionrevision=1481079

SOLR-4790, Defining a core with the same name should throw an error

 When defining a core with the same name (discovery mode or not), 
 CoreContainer should throw an error
 

 Key: SOLR-4790
 URL: https://issues.apache.org/jira/browse/SOLR-4790
 Project: Solr
  Issue Type: Improvement
Affects Versions: 4.3, 5.0
Reporter: Erick Erickson
Assignee: Erick Erickson
 Attachments: SOLR-4790.patch, SOLR-4790.patch


 When you define a core with the same name as another core (discovery mode 
 definitely, old-style xml probably), last one wins. Which means it's very 
 hard to track down what caused the problem.
 What's worse, the last-encountered core replaces the first one, leading to 
 cores that change an unexpected index.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4996) DocInverterPerField to log which field throws exceptions


[ 
https://issues.apache.org/jira/browse/LUCENE-4996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654579#comment-13654579
 ] 

Commit Tag Bot commented on LUCENE-4996:


[trunk commit] rmuir
http://svn.apache.org/viewvc?view=revisionrevision=1481080

LUCENE-4996: Include field name in all DocInverter exceptions

 DocInverterPerField to log which field throws exceptions
 

 Key: LUCENE-4996
 URL: https://issues.apache.org/jira/browse/LUCENE-4996
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Reporter: Markus Jelsma
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: LUCENE-4996-trunk.patch


 One of ours fields seems to have a problem that didn't result in an exception 
 before. It seems one of my filters doesn't deal with posIncAttr properly but 
 Lucene did not log which of my numerous fields was the problem.
 This patch includes the field name in the exception.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4996) DocInverterPerField to log which field throws exceptions


[ 
https://issues.apache.org/jira/browse/LUCENE-4996?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654583#comment-13654583
 ] 

Commit Tag Bot commented on LUCENE-4996:


[branch_4x commit] rmuir
http://svn.apache.org/viewvc?view=revisionrevision=1481081

LUCENE-4996: Include field name in all DocInverter exceptions

 DocInverterPerField to log which field throws exceptions
 

 Key: LUCENE-4996
 URL: https://issues.apache.org/jira/browse/LUCENE-4996
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Reporter: Markus Jelsma
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: LUCENE-4996-trunk.patch


 One of ours fields seems to have a problem that didn't result in an exception 
 before. It seems one of my filters doesn't deal with posIncAttr properly but 
 Lucene did not log which of my numerous fields was the problem.
 This patch includes the field name in the exception.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Resolved] (LUCENE-4996) DocInverterPerField to log which field throws exceptions


 [ 
https://issues.apache.org/jira/browse/LUCENE-4996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir resolved LUCENE-4996.
-

Resolution: Fixed

Thanks again!

 DocInverterPerField to log which field throws exceptions
 

 Key: LUCENE-4996
 URL: https://issues.apache.org/jira/browse/LUCENE-4996
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Reporter: Markus Jelsma
Priority: Minor
 Fix For: 5.0, 4.4

 Attachments: LUCENE-4996-trunk.patch


 One of ours fields seems to have a problem that didn't result in an exception 
 before. It seems one of my filters doesn't deal with posIncAttr properly but 
 Lucene did not log which of my numerous fields was the problem.
 This patch includes the field name in the exception.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-4790) When defining a core with the same name (discovery mode or not), CoreContainer should throw an error


 [ 
https://issues.apache.org/jira/browse/SOLR-4790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson updated SOLR-4790:
-

Attachment: SOLR-4790.patch

Final patch

 When defining a core with the same name (discovery mode or not), 
 CoreContainer should throw an error
 

 Key: SOLR-4790
 URL: https://issues.apache.org/jira/browse/SOLR-4790
 Project: Solr
  Issue Type: Improvement
Affects Versions: 4.3, 5.0
Reporter: Erick Erickson
Assignee: Erick Erickson
 Attachments: SOLR-4790.patch, SOLR-4790.patch, SOLR-4790.patch


 When you define a core with the same name as another core (discovery mode 
 definitely, old-style xml probably), last one wins. Which means it's very 
 hard to track down what caused the problem.
 What's worse, the last-encountered core replaces the first one, leading to 
 cores that change an unexpected index.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-4751) The replication problem of the file in a subdirectory.


[ 
https://issues.apache.org/jira/browse/SOLR-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654597#comment-13654597
 ] 

Commit Tag Bot commented on SOLR-4751:
--

[lucene_solr_4_3 commit] koji
http://svn.apache.org/viewvc?view=revisionrevision=1481094

revert SOLR-4751 change

 The replication problem of the file in a subdirectory.
 --

 Key: SOLR-4751
 URL: https://issues.apache.org/jira/browse/SOLR-4751
 Project: Solr
  Issue Type: Bug
  Components: replication (java)
Affects Versions: 4.2.1
Reporter: Minoru Osuka
Assignee: Koji Sekiguchi
Priority: Minor
 Attachments: SOLR-4751.patch


 When set lang/stopwords_ja.txt to confFiles in replication settings,
 {code:xml}
   requestHandler name=/replication class=solr.ReplicationHandler 
lst name=master
  str name=replicateAftercommit/str
  str name=replicateAfterstartup/str
  str 
 name=confFilesschema.xml,stopwords.txt,lang/stopwords_ja.txt/str
/lst
   /requestHandler
 {code}
 Only stopwords_ja.txt exists in solr/collection1/conf/lang directory on slave 
 node.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4991) QueryParser doesnt handle synonyms correctly for chinese


[ 
https://issues.apache.org/jira/browse/LUCENE-4991?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654604#comment-13654604
 ] 

Commit Tag Bot commented on LUCENE-4991:


[trunk commit] rmuir
http://svn.apache.org/viewvc?view=revisionrevision=1481100

LUCENE-4991: QueryParser doesnt handle synonyms correctly for chinese

 QueryParser doesnt handle synonyms correctly for chinese
 

 Key: LUCENE-4991
 URL: https://issues.apache.org/jira/browse/LUCENE-4991
 Project: Lucene - Core
  Issue Type: Bug
  Components: modules/queryparser
Reporter: Robert Muir
 Attachments: LUCENE-4991.patch


 As reported multiple times on the user list:
 http://find.searchhub.org/document/eaf0e88a6a0d4d1f
 http://find.searchhub.org/document/abf28043c52b6efc
 http://find.searchhub.org/document/1313794632c90826
 The logic here is not forming the right query structures and ignoring 
 positionIncrementAttribute from the tokenStream.
 * when default operator is AND, you can see it more clearly, as synonyms are 
 wrongly inserted as additional MUST terms:
 expected:+field:中 +(field:国 field:國) 
 but was:+field:中 +field:国 +field:國
 * even when default operator is OR, its still wrong, because we ignore posInc 
 and this means coord computation is not correct (so scoring is wrong)
 This also screws up scoring and queries for decompounding too (because they 
 go thru this exact situation if they add the original compound as a synonym).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (SOLR-4787) Join Contrib

2013-05-10 Thread Joel Bernstein (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-4787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joel Bernstein updated SOLR-4787:
-

Attachment: SOLR-4787.patch

Added vjoin2 and broke out several inner classes into there own class.

 Join Contrib
 

 Key: SOLR-4787
 URL: https://issues.apache.org/jira/browse/SOLR-4787
 Project: Solr
  Issue Type: New Feature
  Components: search
Affects Versions: 4.2.1
Reporter: Joel Bernstein
Priority: Minor
 Fix For: 4.2.1

 Attachments: SOLR-4787.patch, SOLR-4787.patch, SOLR-4787.patch, 
 SOLR-4787.patch, SOLR-4787.patch, SOLR-4787.patch


 This contrib provides a place where different join implementations can be 
 contributed to Solr. This contrib currently includes 2 join implementations. 
 The initial patch was generated from the Solr 4.2.1 tag. Because of changes 
 in the FieldCache API this patch will only build with Solr 4.2 or above.
 *PostFilterJoinQParserPlugin aka pjoin*
 The pjoin provides a join implementation that filters results in one core 
 based on the results of a search in another core. This is similar in 
 functionality to the JoinQParserPlugin but the implementation differs in a 
 couple of important ways.
 The first way is that the pjoin is designed to work with integer join keys 
 only. So, in order to use pjoin, integer join keys must be included in both 
 the to and from core.
 The second difference is that the pjoin builds memory structures that are 
 used to quickly connect the join keys. It also uses a custom SolrCache named 
 join to hold intermediate DocSets which are needed to build the join memory 
 structures. So, the pjoin will need more memory then the JoinQParserPlugin to 
 perform the join.
 The main advantage of the pjoin is that it can scale to join millions of keys 
 between cores.
 Because it's a PostFilter, it only needs to join records that match the main 
 query.
 The syntax of the pjoin is the same as the JoinQParserPlugin except that the 
 plugin is referenced by the string pjoin rather then join.
 fq=\{!pjoin fromCore=collection2 from=id_i to=id_i\}user:customer1
 The example filter query above will search the fromCore (collection2) for 
 user:customer1. This query will generate a list of values from the from 
 field that will be used to filter the main query. Only records from the main 
 query, where the to field is present in the from list will be included in 
 the results.
 The solrconfig.xml in the main query core must contain the reference to the 
 pjoin.
 queryParser name=pjoin 
 class=org.apache.solr.joins.PostFilterJoinQParserPlugin/
 And the join contrib jars must be registed in the solrconfig.xml.
 lib dir=../../../dist/ regex=solr-joins-\d.*\.jar /
 The solrconfig.xml in the fromcore must have the join SolrCache configured.
  cache name=join
   class=solr.LRUCache
   size=4096
   initialSize=1024
   /
 *JoinValueSourceParserPlugin aka vjoin*
 The second implementation is the JoinValueSourceParserPlugin aka vjoin. 
 This implements a ValueSource function query that can return values from a 
 second core based on join keys. This allows relevance data to be stored in a 
 separate core and then joined in the main query.
 The vjoin is called using the vjoin function query. For example:
 bf=vjoin(fromCore, fromKey, fromVal, toKey)
 This example shows vjoin being called by the edismax boost function 
 parameter. This example will return the fromVal from the fromCore. The 
 fromKey and toKey are used to link the records from the main query to the 
 records in the fromCore.
 As with the pjoin, both the fromKey and toKey must be integers. Also like 
 the pjoin, the join SolrCache is used to hold the join memory structures.
 To configure the vjoin you must register the ValueSource plugin in the 
 solrconfig.xml as follows:
  valueSourceParser name=vjoin 
 class=org.apache.solr.joins.JoinValueSourceParserPlugin /

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: [Discussion] Discontinue the ROLLBACK command in Solr?

2013-05-10 Thread Varun Thacker

Hi,

Rollback is a good feature for people who are using Solr in non SolrCloud
mode.

I like Jan's idea of exposing IW.commit(MapString,String commitUserData)
to Solr. A quick look at this shows that this doesn't exist yet (
http://wiki.apache.org/solr/UpdateXmlMessages#A.22commit.22_and_.22optimize.22
)

Then we could probably let the user pass the commitUserData which he tagged
the commit with to rollback, and perform a rollback. Basically rollbacks
can only be preformed to a commit which has been explicitly tagged.

I think this should solve a lot of problems regarding rollback. If this is
solvable using this method I'll create the necessary JIRA's to fix it.

This method should also fix this issue pointed out I think.
https://issues.apache.org/jira/browse/SOLR-4733

Mark, This idea or any other approach still wouldn't support rollbacks in
SolrCloud right?



On Fri, May 10, 2013 at 5:21 AM, Jan Høydahl jan@cominvent.com wrote:

 Hi,

 Thanks for clarifying, I have been thinking that full RAMbuffer triggered
 a commit, but I now see the difference between a flush of a segment and
 further down the road a commit which then links all these intermediate
 segments with that particular commit.

 So ROLLBACK closes the writer but instead of persisting docs with a
 commit, it cleans up all uncommitted (flushed or not) doc and files. This
 is actually not as bad as I thought.

 So the main catch with ROLLBACK then is if COMMITs are done by other
 clients or if autoCommit or commitWithin is being used. However, after
 reading http://blog.mikemccandless.com/2012/03/transactional-lucene.htmlI see 
 that it is possible to tag COMMITs with userData, so we could make
 the rollback actually roll back to the last explicit commit (still assuming
 you're the only client around) if, say AutoCommits and CommitWithins are
 tagged as such.

 So I'm OK if you guys rather want to start improve the ROLLBACK API
 instead of deprecating it. A client should be certain that ALL docs since
 last commit gets rolled back, else the rollback command must fail.

  --
 Jan Høydahl, search solution architect
 Cominvent AS - www.cominvent.com

 9. mai 2013 kl. 20:49 skrev Mikhail Khludnev mkhlud...@griddynamics.com:

 Mark,
 thanks for confirming my feeling.

 Hence, I ask for leaving it for old-school deployments. However, it's a
 good thing to clearly indicate to the user that it's not implemented for
 tlog. What about log WARN/ERROR and/or throw an exception with explanation?



 On Thu, May 9, 2013 at 10:28 AM, Jack Krupansky 
 j...@basetechnology.comwrote:

 This does highlight that rollback is perfectly reasonable for embedded
 Solr or other dedicated, single-node, well-controlled apps, even if it
 doesn't work for cloud.

 -- Jack Krupansky

 -Original Message- From: Mark Miller
 Sent: Thursday, May 09, 2013 1:09 PM

 To: dev@lucene.apache.org
 Subject: Re: [Discussion] Discontinue the ROLLBACK command in Solr?

 RAMbuffer doesn't affect this stuff - if you rollback, it rolls back to
 the last commit point. Flushed segments are fine.

 - Mark

 On May 9, 2013, at 12:54 PM, Mikhail Khludnev mkhlud...@griddynamics.com
 wrote:

  Jan,
 Could you please clarify current rollback functionality to me? Let's I
 have autoCommit disabled, but my RAMbuffer is not huge, hence I have few
 flushes after previous commit. if I invoke rollback will it forget about
 flushed segments?



 On Thu, May 2, 2013 at 12:38 AM, Jan Høydahl jan@cominvent.com
 wrote:
 Hi,

 Many are confused about the rollback feature in Solr, since it cannot
 guarantee a rollback of updates from that client since last commit.

 In my opinion it is pretty useless to have a rollback feature you cannot
 rely upon - Unless, that is, you are the only client for sure, having no
 autoCommit, and a huge RAMbuffer.

 So why don't we simply deprecate the feature in 4.x and remove it from
 5.0?

 --
 Jan Høydahl, search solution architect
 Cominvent AS - www.cominvent.com
 Solr Training - www.solrtraining.com


 --**--**
 -
 To unsubscribe, e-mail: 
 dev-unsubscribe@lucene.apache.**orgdev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org




 --
 Sincerely yours
 Mikhail Khludnev
 Principal Engineer,
 Grid Dynamics




 --**--**-
 To unsubscribe, e-mail: 
 dev-unsubscribe@lucene.apache.**orgdev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org

 --**--**-
 To unsubscribe, e-mail: 
 dev-unsubscribe@lucene.apache.**orgdev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org




 --
 Sincerely yours
 Mikhail Khludnev
 Principal Engineer,
 Grid Dynamics

 http://www.griddynamics.com/
  mkhlud...@griddynamics.com





-- 


Regards,
Varun Thacker
http://www.vthacker.in/

[jira] [Updated] (SOLR-4787) Join Contrib

2013-05-10 Thread Joel Bernstein (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-4787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joel Bernstein updated SOLR-4787:
-

Description: 
This contrib provides a place where different join implementations can be 
contributed to Solr. This contrib currently includes 3 join implementations. 
The initial patch was generated from the Solr 4.2.1 tag. Because of changes in 
the FieldCache API this patch will only build with Solr 4.2 or above.

*PostFilterJoinQParserPlugin aka pjoin*

The pjoin provides a join implementation that filters results in one core based 
on the results of a search in another core. This is similar in functionality to 
the JoinQParserPlugin but the implementation differs in a couple of important 
ways.

The first way is that the pjoin is designed to work with integer join keys 
only. So, in order to use pjoin, integer join keys must be included in both the 
to and from core.

The second difference is that the pjoin builds memory structures that are used 
to quickly connect the join keys. It also uses a custom SolrCache named join 
to hold intermediate DocSets which are needed to build the join memory 
structures. So, the pjoin will need more memory then the JoinQParserPlugin to 
perform the join.

The main advantage of the pjoin is that it can scale to join millions of keys 
between cores.

Because it's a PostFilter, it only needs to join records that match the main 
query.

The syntax of the pjoin is the same as the JoinQParserPlugin except that the 
plugin is referenced by the string pjoin rather then join.

fq=\{!pjoin fromCore=collection2 from=id_i to=id_i\}user:customer1

The example filter query above will search the fromCore (collection2) for 
user:customer1. This query will generate a list of values from the from 
field that will be used to filter the main query. Only records from the main 
query, where the to field is present in the from list will be included in 
the results.

The solrconfig.xml in the main query core must contain the reference to the 
pjoin.

queryParser name=pjoin 
class=org.apache.solr.joins.PostFilterJoinQParserPlugin/

And the join contrib jars must be registed in the solrconfig.xml.

lib dir=../../../dist/ regex=solr-joins-\d.*\.jar /

The solrconfig.xml in the fromcore must have the join SolrCache configured.

 cache name=join
  class=solr.LRUCache
  size=4096
  initialSize=1024
  /


*JoinValueSourceParserPlugin aka vjoin*

The second implementation is the JoinValueSourceParserPlugin aka vjoin. This 
implements a ValueSource function query that can return values from a second 
core based on join keys. This allows relevance data to be stored in a separate 
core and then joined in the main query.

The vjoin is called using the vjoin function query. For example:

bf=vjoin(fromCore, fromKey, fromVal, toKey)

This example shows vjoin being called by the edismax boost function 
parameter. This example will return the fromVal from the fromCore. The 
fromKey and toKey are used to link the records from the main query to the 
records in the fromCore.

As with the pjoin, both the fromKey and toKey must be integers. Also like the 
pjoin, the join SolrCache is used to hold the join memory structures.

To configure the vjoin you must register the ValueSource plugin in the 
solrconfig.xml as follows:

valueSourceParser name=vjoin 
class=org.apache.solr.joins.JoinValueSourceParserPlugin /

*JoinValueSourceParserPlugin2 aka vjoin2*

vjoin2 supports personalized ValueSource joins. The syntax is similar to 
vjoin but adds an extra parameter so a query can be specified to join a 
specific record set from the fromCore. This is designed to allow customer 
specific relevance information to be added to a separate core and then joined 
at query time.


Syntax:

bf=vjoin2(fromCore,fromKey,fromVal,toKey,query)







  was:
This contrib provides a place where different join implementations can be 
contributed to Solr. This contrib currently includes 2 join implementations. 
The initial patch was generated from the Solr 4.2.1 tag. Because of changes in 
the FieldCache API this patch will only build with Solr 4.2 or above.

*PostFilterJoinQParserPlugin aka pjoin*

The pjoin provides a join implementation that filters results in one core based 
on the results of a search in another core. This is similar in functionality to 
the JoinQParserPlugin but the implementation differs in a couple of important 
ways.

The first way is that the pjoin is designed to work with integer join keys 
only. So, in order to use pjoin, integer join keys must be included in both the 
to and from core.

The second difference is that the pjoin builds memory structures that are used 
to quickly connect the join keys. It also uses a custom SolrCache named join 
to hold intermediate DocSets which are needed to build the join memory 
structures. So, the pjoin will need more memory then the

[jira] [Updated] (SOLR-4787) Join Contrib

2013-05-10 Thread Joel Bernstein (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-4787?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joel Bernstein updated SOLR-4787:
-

Description: 
This contrib provides a place where different join implementations can be 
contributed to Solr. This contrib currently includes 3 join implementations. 
The initial patch was generated from the Solr 4.2.1 tag. Because of changes in 
the FieldCache API this patch will only build with Solr 4.2 or above.

*PostFilterJoinQParserPlugin aka pjoin*

The pjoin provides a join implementation that filters results in one core based 
on the results of a search in another core. This is similar in functionality to 
the JoinQParserPlugin but the implementation differs in a couple of important 
ways.

The first way is that the pjoin is designed to work with integer join keys 
only. So, in order to use pjoin, integer join keys must be included in both the 
to and from core.

The second difference is that the pjoin builds memory structures that are used 
to quickly connect the join keys. It also uses a custom SolrCache named join 
to hold intermediate DocSets which are needed to build the join memory 
structures. So, the pjoin will need more memory then the JoinQParserPlugin to 
perform the join.

The main advantage of the pjoin is that it can scale to join millions of keys 
between cores.

Because it's a PostFilter, it only needs to join records that match the main 
query.

The syntax of the pjoin is the same as the JoinQParserPlugin except that the 
plugin is referenced by the string pjoin rather then join.

fq=\{!pjoin fromCore=collection2 from=id_i to=id_i\}user:customer1

The example filter query above will search the fromCore (collection2) for 
user:customer1. This query will generate a list of values from the from 
field that will be used to filter the main query. Only records from the main 
query, where the to field is present in the from list will be included in 
the results.

The solrconfig.xml in the main query core must contain the reference to the 
pjoin.

queryParser name=pjoin 
class=org.apache.solr.joins.PostFilterJoinQParserPlugin/

And the join contrib jars must be registed in the solrconfig.xml.

lib dir=../../../dist/ regex=solr-joins-\d.*\.jar /

The solrconfig.xml in the fromcore must have the join SolrCache configured.

 cache name=join
  class=solr.LRUCache
  size=4096
  initialSize=1024
  /


*JoinValueSourceParserPlugin aka vjoin*

The second implementation is the JoinValueSourceParserPlugin aka vjoin. This 
implements a ValueSource function query that can return values from a second 
core based on join keys. This allows relevance data to be stored in a separate 
core and then joined in the main query.

The vjoin is called using the vjoin function query. For example:

bf=vjoin(fromCore, fromKey, fromVal, toKey)

This example shows vjoin being called by the edismax boost function 
parameter. This example will return the fromVal from the fromCore. The 
fromKey and toKey are used to link the records from the main query to the 
records in the fromCore.

As with the pjoin, both the fromKey and toKey must be integers. Also like the 
pjoin, the join SolrCache is used to hold the join memory structures.

To configure the vjoin you must register the ValueSource plugin in the 
solrconfig.xml as follows:

valueSourceParser name=vjoin 
class=org.apache.solr.joins.JoinValueSourceParserPlugin /

*JoinValueSourceParserPlugin2 aka vjoin2*

vjoin2 supports personalized ValueSource joins. The syntax is similar to 
vjoin but adds an extra parameter so a query can be specified to join a 
specific record set from the fromCore. This is designed to allow customer 
specific relevance information to be added to fromCore and then joined at query 
time.


Syntax:

bf=vjoin2(fromCore,fromKey,fromVal,toKey,query)







  was:
This contrib provides a place where different join implementations can be 
contributed to Solr. This contrib currently includes 3 join implementations. 
The initial patch was generated from the Solr 4.2.1 tag. Because of changes in 
the FieldCache API this patch will only build with Solr 4.2 or above.

*PostFilterJoinQParserPlugin aka pjoin*

The pjoin provides a join implementation that filters results in one core based 
on the results of a search in another core. This is similar in functionality to 
the JoinQParserPlugin but the implementation differs in a couple of important 
ways.

The first way is that the pjoin is designed to work with integer join keys 
only. So, in order to use pjoin, integer join keys must be included in both the 
to and from core.

The second difference is that the pjoin builds memory structures that are used 
to quickly connect the join keys. It also uses a custom SolrCache named join 
to hold intermediate DocSets which are needed to build the join memory 
structures. So, the pjoin will need more memory then the

[jira] [Commented] (SOLR-4790) When defining a core with the same name (discovery mode or not), CoreContainer should throw an error


[ 
https://issues.apache.org/jira/browse/SOLR-4790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13654618#comment-13654618
 ] 

Commit Tag Bot commented on SOLR-4790:
--

[branch_4x commit] erick
http://svn.apache.org/viewvc?view=revisionrevision=148

SOLR-4790, Defining a core with the same name should throw an error

 When defining a core with the same name (discovery mode or not), 
 CoreContainer should throw an error
 

 Key: SOLR-4790
 URL: https://issues.apache.org/jira/browse/SOLR-4790
 Project: Solr
  Issue Type: Improvement
Affects Versions: 4.3, 5.0
Reporter: Erick Erickson
Assignee: Erick Erickson
 Attachments: SOLR-4790.patch, SOLR-4790.patch, SOLR-4790.patch


 When you define a core with the same name as another core (discovery mode 
 definitely, old-style xml probably), last one wins. Which means it's very 
 hard to track down what caused the problem.
 What's worse, the last-encountered core replaces the first one, leading to 
 cores that change an unexpected index.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Resolved] (SOLR-4790) When defining a core with the same name (discovery mode or not), CoreContainer should throw an error


 [ 
https://issues.apache.org/jira/browse/SOLR-4790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Erick Erickson resolved SOLR-4790.
--

   Resolution: Fixed
Fix Version/s: 4.4
   5.0

 When defining a core with the same name (discovery mode or not), 
 CoreContainer should throw an error
 

 Key: SOLR-4790
 URL: https://issues.apache.org/jira/browse/SOLR-4790
 Project: Solr
  Issue Type: Improvement
Affects Versions: 4.3, 5.0
Reporter: Erick Erickson
Assignee: Erick Erickson
 Fix For: 5.0, 4.4

 Attachments: SOLR-4790.patch, SOLR-4790.patch, SOLR-4790.patch


 When you define a core with the same name as another core (discovery mode 
 definitely, old-style xml probably), last one wins. Which means it's very 
 hard to track down what caused the problem.
 What's worse, the last-encountered core replaces the first one, leading to 
 cores that change an unexpected index.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: [Discussion] Discontinue the ROLLBACK command in Solr?

2013-05-10 Thread Shalin Shekhar Mangar

There's an issue open: https://issues.apache.org/jira/browse/SOLR-2701


On Fri, May 10, 2013 at 10:14 AM, Varun Thacker
varunthacker1...@gmail.comwrote:

 Hi,

 Rollback is a good feature for people who are using Solr in non SolrCloud
 mode.

 I like Jan's idea of exposing IW.commit(MapString,String commitUserData)
 to Solr. A quick look at this shows that this doesn't exist yet (
 http://wiki.apache.org/solr/UpdateXmlMessages#A.22commit.22_and_.22optimize.22
 )

 Then we could probably let the user pass the commitUserData which he
 tagged the commit with to rollback, and perform a rollback. Basically
 rollbacks can only be preformed to a commit which has been explicitly
 tagged.

 I think this should solve a lot of problems regarding rollback. If this
 is solvable using this method I'll create the necessary JIRA's to fix it.

 This method should also fix this issue pointed out I think.
 https://issues.apache.org/jira/browse/SOLR-4733

 Mark, This idea or any other approach still wouldn't support rollbacks in
 SolrCloud right?



 On Fri, May 10, 2013 at 5:21 AM, Jan Høydahl jan@cominvent.comwrote:

 Hi,

 Thanks for clarifying, I have been thinking that full RAMbuffer triggered
 a commit, but I now see the difference between a flush of a segment and
 further down the road a commit which then links all these intermediate
 segments with that particular commit.

 So ROLLBACK closes the writer but instead of persisting docs with a
 commit, it cleans up all uncommitted (flushed or not) doc and files. This
 is actually not as bad as I thought.

  So the main catch with ROLLBACK then is if COMMITs are done by other
 clients or if autoCommit or commitWithin is being used. However, after
 reading http://blog.mikemccandless.com/2012/03/transactional-lucene.htmlI 
 see that it is possible to tag COMMITs with userData, so we could make
 the rollback actually roll back to the last explicit commit (still assuming
 you're the only client around) if, say AutoCommits and CommitWithins are
 tagged as such.

 So I'm OK if you guys rather want to start improve the ROLLBACK API
 instead of deprecating it. A client should be certain that ALL docs since
 last commit gets rolled back, else the rollback command must fail.

  --
 Jan Høydahl, search solution architect
 Cominvent AS - www.cominvent.com

 9. mai 2013 kl. 20:49 skrev Mikhail Khludnev mkhlud...@griddynamics.com
 :

 Mark,
 thanks for confirming my feeling.

 Hence, I ask for leaving it for old-school deployments. However, it's a
 good thing to clearly indicate to the user that it's not implemented for
 tlog. What about log WARN/ERROR and/or throw an exception with explanation?



 On Thu, May 9, 2013 at 10:28 AM, Jack Krupansky 
 j...@basetechnology.comwrote:

 This does highlight that rollback is perfectly reasonable for embedded
 Solr or other dedicated, single-node, well-controlled apps, even if it
 doesn't work for cloud.

 -- Jack Krupansky

 -Original Message- From: Mark Miller
 Sent: Thursday, May 09, 2013 1:09 PM

 To: dev@lucene.apache.org
 Subject: Re: [Discussion] Discontinue the ROLLBACK command in Solr?

 RAMbuffer doesn't affect this stuff - if you rollback, it rolls back to
 the last commit point. Flushed segments are fine.

 - Mark

 On May 9, 2013, at 12:54 PM, Mikhail Khludnev 
 mkhlud...@griddynamics.com wrote:

  Jan,
 Could you please clarify current rollback functionality to me? Let's I
 have autoCommit disabled, but my RAMbuffer is not huge, hence I have few
 flushes after previous commit. if I invoke rollback will it forget about
 flushed segments?



 On Thu, May 2, 2013 at 12:38 AM, Jan Høydahl jan@cominvent.com
 wrote:
 Hi,

 Many are confused about the rollback feature in Solr, since it cannot
 guarantee a rollback of updates from that client since last commit.

 In my opinion it is pretty useless to have a rollback feature you
 cannot rely upon - Unless, that is, you are the only client for sure,
 having no autoCommit, and a huge RAMbuffer.

 So why don't we simply deprecate the feature in 4.x and remove it from
 5.0?

 --
 Jan Høydahl, search solution architect
 Cominvent AS - www.cominvent.com
 Solr Training - www.solrtraining.com


 --**--**
 -
 To unsubscribe, e-mail: 
 dev-unsubscribe@lucene.apache.**orgdev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org




 --
 Sincerely yours
 Mikhail Khludnev
 Principal Engineer,
 Grid Dynamics




 --**--**
 -
 To unsubscribe, e-mail: 
 dev-unsubscribe@lucene.apache.**orgdev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org

 --**--**
 -
 To unsubscribe, e-mail: 
 dev-unsubscribe@lucene.apache.**orgdev-unsubscr...@lucene.apache.org
 For additional commands, e-mail: dev-h...@lucene.apache.org




 --

[jira] [Created] (SOLR-4808) Persist and use replication factor at Collection and Shard level

2013-05-10 Thread Anshum Gupta (JIRA)

Anshum Gupta created SOLR-4808:
--

 Summary: Persist and use replication factor at Collection and 
Shard level
 Key: SOLR-4808
 URL: https://issues.apache.org/jira/browse/SOLR-4808
 Project: Solr
  Issue Type: New Feature
  Components: SolrCloud
Reporter: Anshum Gupta


The replication factor for a collection as of now is not persisted and used 
while adding replicas.
We should save the replication factor at collection factor as well as shard 
level.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-4991) QueryParser doesnt handle synonyms correctly for chinese