[jira] [Resolved] (LUCENE-5407) Deadlock? while indexing in cascaded threads

2014-01-22 Thread Simon Willnauer (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5407?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Simon Willnauer resolved LUCENE-5407.
-

Resolution: Not A Problem
  Assignee: Simon Willnauer

closing... this is not what the IW was designed for.

 Deadlock? while indexing in cascaded threads
 

 Key: LUCENE-5407
 URL: https://issues.apache.org/jira/browse/LUCENE-5407
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: Windows 7 64 bits, JRE 1.7.0_25 64 bits
Reporter: Luis Filipe Nassif
Assignee: Simon Willnauer
 Attachments: Test.java, thread_dump.txt


 Apparently I found a deadlock problem with IndexWriter in a cascaded thread 
 design to add documents (I am working on an application integrating Tika, 
 which has the capability to add embedded documents to the index as 
 independent documents as they are found). The attached code illustrates the 
 problem. Sometimes it stops processing, at least one of the threads remains 
 in WAITING state. It must be executed no more than 5 times in my environment 
 to trigger the problem.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5407) Deadlock? while indexing in cascaded threads

2014-01-22 Thread Simon Willnauer (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878397#comment-13878397
 ] 

Simon Willnauer commented on LUCENE-5407:
-

There is no way this would work here. Your first thread enters the index writer 
and gets a threadstate and then once it pulls the fields reader you enter the 
indexwriter again to add a sub-document. This sub-document IW#addDocument 
call tries to get a thread-state as well but since there are only 8 
ThreadStates by default you run into a deadlock since for the parent-doc 
thread to make progress the sub-documents thread must return from 
IW#addDocument but it can't since it waits for another thread to release one of 
the ThreadStates. So yes that is a deadlock situation but I really don't see 
any way this could work. I guess further this approach is not what I recommend 
you do anyways. If you want to index those documents in a stream then I guess a 
producer - queue - consumer approach might be best suited.  

 Deadlock? while indexing in cascaded threads
 

 Key: LUCENE-5407
 URL: https://issues.apache.org/jira/browse/LUCENE-5407
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: Windows 7 64 bits, JRE 1.7.0_25 64 bits
Reporter: Luis Filipe Nassif
 Attachments: Test.java, thread_dump.txt


 Apparently I found a deadlock problem with IndexWriter in a cascaded thread 
 design to add documents (I am working on an application integrating Tika, 
 which has the capability to add embedded documents to the index as 
 independent documents as they are found). The attached code illustrates the 
 problem. Sometimes it stops processing, at least one of the threads remains 
 in WAITING state. It must be executed no more than 5 times in my environment 
 to trigger the problem.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5476) Overseer Role for nodes

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878403#comment-13878403
 ] 

ASF subversion and git services commented on SOLR-5476:
---

Commit 1560279 from [~noble.paul] in branch 'dev/trunk'
[ https://svn.apache.org/r1560279 ]

SOLR-5476 hardened a bit

 Overseer Role for nodes
 ---

 Key: SOLR-5476
 URL: https://issues.apache.org/jira/browse/SOLR-5476
 Project: Solr
  Issue Type: Sub-task
  Components: SolrCloud
Reporter: Noble Paul
Assignee: Noble Paul
 Fix For: 5.0, 4.7

 Attachments: SOLR-5476.patch, SOLR-5476.patch, SOLR-5476.patch, 
 SOLR-5476.patch, SOLR-5476.patch


 In a very large cluster the Overseer is likely to be overloaded.If the same 
 node is a serving a few other shards it can lead to OverSeer getting slowed 
 down due to GC pauses , or simply too much of work  . If the cluster is 
 really large , it is possible to dedicate high end h/w for OverSeers
 It works as a new collection admin command
 command=addrolerole=overseernode=192.168.1.5:8983_solr
 This results in the creation of a entry in the /roles.json in ZK which would 
 look like the following
 {code:javascript}
 {
 overseer : [192.168.1.5:8983_solr]
 }
 {code}
 If a node is designated for overseer it gets preference over others when 
 overseer election takes place. If no designated servers are available another 
 random node would become the Overseer.
 Later on, if one of the designated nodes are brought up ,it would take over 
 the Overseer role from the current Overseer to become the Overseer of the 
 system



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5476) Overseer Role for nodes

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878405#comment-13878405
 ] 

ASF subversion and git services commented on SOLR-5476:
---

Commit 1560280 from [~noble.paul] in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1560280 ]

SOLR-5476 hardened a bit

 Overseer Role for nodes
 ---

 Key: SOLR-5476
 URL: https://issues.apache.org/jira/browse/SOLR-5476
 Project: Solr
  Issue Type: Sub-task
  Components: SolrCloud
Reporter: Noble Paul
Assignee: Noble Paul
 Fix For: 5.0, 4.7

 Attachments: SOLR-5476.patch, SOLR-5476.patch, SOLR-5476.patch, 
 SOLR-5476.patch, SOLR-5476.patch


 In a very large cluster the Overseer is likely to be overloaded.If the same 
 node is a serving a few other shards it can lead to OverSeer getting slowed 
 down due to GC pauses , or simply too much of work  . If the cluster is 
 really large , it is possible to dedicate high end h/w for OverSeers
 It works as a new collection admin command
 command=addrolerole=overseernode=192.168.1.5:8983_solr
 This results in the creation of a entry in the /roles.json in ZK which would 
 look like the following
 {code:javascript}
 {
 overseer : [192.168.1.5:8983_solr]
 }
 {code}
 If a node is designated for overseer it gets preference over others when 
 overseer election takes place. If no designated servers are available another 
 random node would become the Overseer.
 Later on, if one of the designated nodes are brought up ,it would take over 
 the Overseer role from the current Overseer to become the Overseer of the 
 system



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5476) Overseer Role for nodes

2014-01-22 Thread Noble Paul (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878407#comment-13878407
 ] 

Noble Paul commented on SOLR-5476:
--

 [~markrmil...@gmail.com] What is the sure shot way for 
Overseer/OverseerCollectionProcessor to quit and somebody else take its place? 
The failures are due to the leader not giving up

 Overseer Role for nodes
 ---

 Key: SOLR-5476
 URL: https://issues.apache.org/jira/browse/SOLR-5476
 Project: Solr
  Issue Type: Sub-task
  Components: SolrCloud
Reporter: Noble Paul
Assignee: Noble Paul
 Fix For: 5.0, 4.7

 Attachments: SOLR-5476.patch, SOLR-5476.patch, SOLR-5476.patch, 
 SOLR-5476.patch, SOLR-5476.patch


 In a very large cluster the Overseer is likely to be overloaded.If the same 
 node is a serving a few other shards it can lead to OverSeer getting slowed 
 down due to GC pauses , or simply too much of work  . If the cluster is 
 really large , it is possible to dedicate high end h/w for OverSeers
 It works as a new collection admin command
 command=addrolerole=overseernode=192.168.1.5:8983_solr
 This results in the creation of a entry in the /roles.json in ZK which would 
 look like the following
 {code:javascript}
 {
 overseer : [192.168.1.5:8983_solr]
 }
 {code}
 If a node is designated for overseer it gets preference over others when 
 overseer election takes place. If no designated servers are available another 
 random node would become the Overseer.
 Later on, if one of the designated nodes are brought up ,it would take over 
 the Overseer role from the current Overseer to become the Overseer of the 
 system



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5476) Overseer Role for nodes

2014-01-22 Thread Shalin Shekhar Mangar (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878453#comment-13878453
 ] 

Shalin Shekhar Mangar commented on SOLR-5476:
-

Perhaps you can create your own instance of overseer and force election just 
like OverseerTest does with OverseerRestarter?

 Overseer Role for nodes
 ---

 Key: SOLR-5476
 URL: https://issues.apache.org/jira/browse/SOLR-5476
 Project: Solr
  Issue Type: Sub-task
  Components: SolrCloud
Reporter: Noble Paul
Assignee: Noble Paul
 Fix For: 5.0, 4.7

 Attachments: SOLR-5476.patch, SOLR-5476.patch, SOLR-5476.patch, 
 SOLR-5476.patch, SOLR-5476.patch


 In a very large cluster the Overseer is likely to be overloaded.If the same 
 node is a serving a few other shards it can lead to OverSeer getting slowed 
 down due to GC pauses , or simply too much of work  . If the cluster is 
 really large , it is possible to dedicate high end h/w for OverSeers
 It works as a new collection admin command
 command=addrolerole=overseernode=192.168.1.5:8983_solr
 This results in the creation of a entry in the /roles.json in ZK which would 
 look like the following
 {code:javascript}
 {
 overseer : [192.168.1.5:8983_solr]
 }
 {code}
 If a node is designated for overseer it gets preference over others when 
 overseer election takes place. If no designated servers are available another 
 random node would become the Overseer.
 Later on, if one of the designated nodes are brought up ,it would take over 
 the Overseer role from the current Overseer to become the Overseer of the 
 system



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



JDK 8 Build 124 JDK 7 Update 60 build 03 are available on java.net

2014-01-22 Thread Rory O'Donnell Oracle, Dublin Ireland

Hi Uwe/Dawid,

JDK 8 Build b124 Early Access Build is now available for download 
http://jdk8.java.net/download.html  test.
JDK 7u60 b03 Early Access Build is also available for download 
https://jdk7.java.net/download.html test.


See you at FOSDEM next week.

Thanks for your support, Rory

--
Rgds,Rory O'Donnell
Quality Engineering Manager
Oracle EMEA , Dublin, Ireland



[jira] [Commented] (SOLR-5476) Overseer Role for nodes

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878534#comment-13878534
 ] 

ASF subversion and git services commented on SOLR-5476:
---

Commit 1560314 from [~noble.paul] in branch 'dev/trunk'
[ https://svn.apache.org/r1560314 ]

SOLR-5476 ignored till there is a reliable way to restart Overseer

 Overseer Role for nodes
 ---

 Key: SOLR-5476
 URL: https://issues.apache.org/jira/browse/SOLR-5476
 Project: Solr
  Issue Type: Sub-task
  Components: SolrCloud
Reporter: Noble Paul
Assignee: Noble Paul
 Fix For: 5.0, 4.7

 Attachments: SOLR-5476.patch, SOLR-5476.patch, SOLR-5476.patch, 
 SOLR-5476.patch, SOLR-5476.patch


 In a very large cluster the Overseer is likely to be overloaded.If the same 
 node is a serving a few other shards it can lead to OverSeer getting slowed 
 down due to GC pauses , or simply too much of work  . If the cluster is 
 really large , it is possible to dedicate high end h/w for OverSeers
 It works as a new collection admin command
 command=addrolerole=overseernode=192.168.1.5:8983_solr
 This results in the creation of a entry in the /roles.json in ZK which would 
 look like the following
 {code:javascript}
 {
 overseer : [192.168.1.5:8983_solr]
 }
 {code}
 If a node is designated for overseer it gets preference over others when 
 overseer election takes place. If no designated servers are available another 
 random node would become the Overseer.
 Later on, if one of the designated nodes are brought up ,it would take over 
 the Overseer role from the current Overseer to become the Overseer of the 
 system



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5476) Overseer Role for nodes

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5476?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878537#comment-13878537
 ] 

ASF subversion and git services commented on SOLR-5476:
---

Commit 1560315 from [~noble.paul] in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1560315 ]

SOLR-5476 needs a reliable way to restart Overseer

 Overseer Role for nodes
 ---

 Key: SOLR-5476
 URL: https://issues.apache.org/jira/browse/SOLR-5476
 Project: Solr
  Issue Type: Sub-task
  Components: SolrCloud
Reporter: Noble Paul
Assignee: Noble Paul
 Fix For: 5.0, 4.7

 Attachments: SOLR-5476.patch, SOLR-5476.patch, SOLR-5476.patch, 
 SOLR-5476.patch, SOLR-5476.patch


 In a very large cluster the Overseer is likely to be overloaded.If the same 
 node is a serving a few other shards it can lead to OverSeer getting slowed 
 down due to GC pauses , or simply too much of work  . If the cluster is 
 really large , it is possible to dedicate high end h/w for OverSeers
 It works as a new collection admin command
 command=addrolerole=overseernode=192.168.1.5:8983_solr
 This results in the creation of a entry in the /roles.json in ZK which would 
 look like the following
 {code:javascript}
 {
 overseer : [192.168.1.5:8983_solr]
 }
 {code}
 If a node is designated for overseer it gets preference over others when 
 overseer election takes place. If no designated servers are available another 
 random node would become the Overseer.
 Later on, if one of the designated nodes are brought up ,it would take over 
 the Overseer role from the current Overseer to become the Overseer of the 
 system



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-5651) Make labels clickable and the output wrapped in UI

2014-01-22 Thread Stefan Matheis (steffkes) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-5651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stefan Matheis (steffkes) updated SOLR-5651:


Fix Version/s: (was: 4.6.1)
   4.7
   5.0

 Make labels clickable and the output wrapped in UI
 --

 Key: SOLR-5651
 URL: https://issues.apache.org/jira/browse/SOLR-5651
 Project: Solr
  Issue Type: Bug
  Components: web gui
Affects Versions: 4.6
Reporter: Artem Lukanin
 Fix For: 5.0, 4.7

 Attachments: SOLR-5651_clickable_labels.patch, 
 SOLR-5651_clickable_labels.patch


 Now it is practically impossible to tick off check boxes in the UI, because 
 the label text is wrapped in a fancy A tag without HREF.
 Also it is very hard to read the output on the Query menu when 
 debugQuery=true. It is impossible to scroll the window to the right. It would 
 be better if the text in the PRE tag was wrapped.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-5651) Make labels clickable and the output wrapped in UI

2014-01-22 Thread Stefan Matheis (steffkes) (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-5651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Stefan Matheis (steffkes) updated SOLR-5651:


Component/s: web gui

 Make labels clickable and the output wrapped in UI
 --

 Key: SOLR-5651
 URL: https://issues.apache.org/jira/browse/SOLR-5651
 Project: Solr
  Issue Type: Bug
  Components: web gui
Affects Versions: 4.6
Reporter: Artem Lukanin
 Fix For: 5.0, 4.7

 Attachments: SOLR-5651_clickable_labels.patch, 
 SOLR-5651_clickable_labels.patch


 Now it is practically impossible to tick off check boxes in the UI, because 
 the label text is wrapped in a fancy A tag without HREF.
 Also it is very hard to read the output on the Query menu when 
 debugQuery=true. It is impossible to scroll the window to the right. It would 
 be better if the text in the PRE tag was wrapped.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Re: [VOTE] Release Lucene/Solr 4.6.1 RC2

2014-01-22 Thread Jan Høydahl
+1 

SUCCESS! [0:29:25.421656] -- Ran from a local cached copy of the files :)

--
Jan Høydahl, search solution architect
Cominvent AS - www.cominvent.com

20. jan. 2014 kl. 03:29 skrev Mark Miller markrmil...@gmail.com:

 Please vote to release the following artifacts:
 
 http://people.apache.org/~markrmiller/lucene_solr_4_6_1r1559610/
 
 Here is my +1.
 
 SUCCESS! [0:51:51.964657]
 
 --
 - Mark
 



[jira] [Commented] (LUCENE-5377) Lucene 4.6 cause mixed version segments info file(.si) wrong

2014-01-22 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878572#comment-13878572
 ] 

Michael McCandless commented on LUCENE-5377:


Indeed, something is wrong here.

If I fix DirectoryTaxonomyWriter to use IW.addIndexes(IR[]) instead of 
IW.addIndexes(Directory[]) then the test passes.  The bug is due to this code 
in IndexWriter.addIndces(Dir[]):

{noformat}
TrackingDirectoryWrapper trackingDir = new 
TrackingDirectoryWrapper(directory);
try {
  
newInfo.getCodec().segmentInfoFormat().getSegmentInfoWriter().write(trackingDir,
 newInfo, fis, context);
} catch (UnsupportedOperationException uoe) {
  // OK: 3x codec cannot write a new SI file;
  // SegmentInfos will write this on commit
}
{noformat}

The problem is ... this logic is wrong: the segment was written with 
Lucene40SegmentInfoFormat, which works in 4.5.x but in 4.6.x we upgraded this 
to Lucene46SIFormat...

 Lucene 4.6 cause mixed version segments info file(.si) wrong
 

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index, modules/facet
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5407) Deadlock? while indexing in cascaded threads

2014-01-22 Thread Luis Filipe Nassif (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5407?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878593#comment-13878593
 ] 

Luis Filipe Nassif commented on LUCENE-5407:


Thank you for comment.  Although there are less than 8 threads in example, I 
agree the general case will not work.

 Deadlock? while indexing in cascaded threads
 

 Key: LUCENE-5407
 URL: https://issues.apache.org/jira/browse/LUCENE-5407
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: Windows 7 64 bits, JRE 1.7.0_25 64 bits
Reporter: Luis Filipe Nassif
Assignee: Simon Willnauer
 Attachments: Test.java, thread_dump.txt


 Apparently I found a deadlock problem with IndexWriter in a cascaded thread 
 design to add documents (I am working on an application integrating Tika, 
 which has the capability to add embedded documents to the index as 
 independent documents as they are found). The attached code illustrates the 
 problem. Sometimes it stops processing, at least one of the threads remains 
 in WAITING state. It must be executed no more than 5 times in my environment 
 to trigger the problem.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) Lucene 4.6 cause mixed version segments info file(.si) wrong

2014-01-22 Thread Shai Erera (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878604#comment-13878604
 ] 

Shai Erera commented on LUCENE-5377:


Isn't it then a problem of anyone that will try to addIndexes a pre-4.6 index 
to a 4.6+?

I looked at TestBackCompat.addOldIndexes - it doesn't do anything with the 
added index to verify it's OK. I added 
{{DirectoryReader.open(targetDir).close();}} to the test after {{w.close()}}, 
so at least we verify the index can be opened, but the test doesn't fail, not 
sure why. The newest oldNames is 4.2 - shouldn't a 4.5 have been created when 
we released 4.6, since we changed the index format? But then a 4.0/4.2 index 
should equally fail to be added to a 4.6, right? So maybe this doesn't explain 
the failure to reproduce this error...

 Lucene 4.6 cause mixed version segments info file(.si) wrong
 

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index, modules/facet
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3565) StandardQueryParser generates incorrect query for groups containing one term if using default operator AND

2014-01-22 Thread Trejkaz (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878608#comment-13878608
 ] 

Trejkaz commented on LUCENE-3565:
-

The expectation in the test case aside, what Lucene was actually returning was 
+text:a +text:b, ignoring the OR entirely.

This seems to have been fixed somewhere between 3.6.0 and 3.6.2, though.


 StandardQueryParser generates incorrect query for groups containing one term 
 if using default operator AND
 --

 Key: LUCENE-3565
 URL: https://issues.apache.org/jira/browse/LUCENE-3565
 Project: Lucene - Core
  Issue Type: Bug
  Components: modules/queryparser
Affects Versions: 3.4
Reporter: Trejkaz
  Labels: queryparser

 The following test demonstrates a bug in StandardQueryParser when 
 DEFAULT_OPERATOR is set to AND:
 {code}
 @Test
 public void testDefaultOperatorSingleGrouped() throws Exception
 {
 StandardQueryParser parser = new StandardQueryParser();
 
 parser.getQueryConfigHandler().set(StandardQueryConfigHandler.ConfigurationKeys.DEFAULT_OPERATOR,
  StandardQueryConfigHandler.Operator.AND);
 BooleanQuery expectedQuery = new BooleanQuery();
 BooleanQuery innerQuery1 = new BooleanQuery();
 innerQuery1.add(new TermQuery(new Term(text, a)), 
 BooleanClause.Occur.MUST);
 expectedQuery.add(innerQuery1, BooleanClause.Occur.SHOULD);
 BooleanQuery innerQuery2 = new BooleanQuery();
 innerQuery2.add(new TermQuery(new Term(text, b)), 
 BooleanClause.Occur.MUST);
 expectedQuery.add(innerQuery2, BooleanClause.Occur.SHOULD);
 Query actualQuery = parser.parse((a) OR (b), text);
 assertEquals(Wrong query was generated, expectedQuery, actualQuery);
 }
 {code}
 BooleanSingleChildOptimizationQueryNodeProcessor appears to be responsible 
 because if I remove it, the correct query is generated (however, doing so 
 breaks a number of other tests of our own.)



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3565) StandardQueryParser generates incorrect query for groups containing one term if using default operator AND

2014-01-22 Thread Trejkaz (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878615#comment-13878615
 ] 

Trejkaz commented on LUCENE-3565:
-

And actually, if you read my original comment from 2011, I accepted that both 
being SHOULD would have been a valid result too. But that is definitely not 
what it was generating. :)


 StandardQueryParser generates incorrect query for groups containing one term 
 if using default operator AND
 --

 Key: LUCENE-3565
 URL: https://issues.apache.org/jira/browse/LUCENE-3565
 Project: Lucene - Core
  Issue Type: Bug
  Components: modules/queryparser
Affects Versions: 3.4
Reporter: Trejkaz
  Labels: queryparser

 The following test demonstrates a bug in StandardQueryParser when 
 DEFAULT_OPERATOR is set to AND:
 {code}
 @Test
 public void testDefaultOperatorSingleGrouped() throws Exception
 {
 StandardQueryParser parser = new StandardQueryParser();
 
 parser.getQueryConfigHandler().set(StandardQueryConfigHandler.ConfigurationKeys.DEFAULT_OPERATOR,
  StandardQueryConfigHandler.Operator.AND);
 BooleanQuery expectedQuery = new BooleanQuery();
 BooleanQuery innerQuery1 = new BooleanQuery();
 innerQuery1.add(new TermQuery(new Term(text, a)), 
 BooleanClause.Occur.MUST);
 expectedQuery.add(innerQuery1, BooleanClause.Occur.SHOULD);
 BooleanQuery innerQuery2 = new BooleanQuery();
 innerQuery2.add(new TermQuery(new Term(text, b)), 
 BooleanClause.Occur.MUST);
 expectedQuery.add(innerQuery2, BooleanClause.Occur.SHOULD);
 Query actualQuery = parser.parse((a) OR (b), text);
 assertEquals(Wrong query was generated, expectedQuery, actualQuery);
 }
 {code}
 BooleanSingleChildOptimizationQueryNodeProcessor appears to be responsible 
 because if I remove it, the correct query is generated (however, doing so 
 breaks a number of other tests of our own.)



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) Lucene 4.6 cause mixed version segments info file(.si) wrong

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878686#comment-13878686
 ] 

Robert Muir commented on LUCENE-5377:
-

First of all, some test should fail.

IndexWriter.addIndexes(Dir) is so dangerous...

To fix the bug, i think we have to support write logic in the old .si writer? 
And only allow the UOE if its really 3.x codec...

Long term this is bogus. We should either remove IndexWriter.addIndexes(Dir) or 
change things so it doesnt need to write .si at all here (e.g. put partial 
filenames in the .si that don't include segment prefix)

 Lucene 4.6 cause mixed version segments info file(.si) wrong
 

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index, modules/facet
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3069) Lucene should have an entirely memory resident term dictionary

2014-01-22 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878687#comment-13878687
 ] 

Michael McCandless commented on LUCENE-3069:


Note that all the commit messages at the end of this issue (generated by Jira's 
FishEye plugin I think) incorrectly state that Han Lee committed changes here.

This is due to an issue in FishEye with username collision ... Han Jiang's (who 
really committed here) apache username is han, but in Jira that user name 
belongs to Han Lee, which leads to this mis-labeling.

Here's the INFRA issue: https://issues.apache.org/jira/browse/INFRA-3243 but 
it's currently WONTFIX unfortunately ...

 Lucene should have an entirely memory resident term dictionary
 --

 Key: LUCENE-3069
 URL: https://issues.apache.org/jira/browse/LUCENE-3069
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index, core/search
Affects Versions: 4.0-ALPHA
Reporter: Simon Willnauer
Assignee: Han Jiang
  Labels: gsoc2013
 Fix For: 4.7

 Attachments: LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, 
 LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, 
 LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, 
 LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, df-ttf-estimate.txt, 
 example.png


 FST based TermDictionary has been a great improvement yet it still uses a 
 delta codec file for scanning to terms. Some environments have enough memory 
 available to keep the entire FST based term dict in memory. We should add a 
 TermDictionary implementation that encodes all needed information for each 
 term into the FST (custom fst.Output) and builds a FST from the entire term 
 not just the delta.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-5598) LanguageIdentifierUpdateProcessor ignores all but the first value of multiValued string fields

2014-01-22 Thread Vitaliy Zhovtyuk (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-5598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Zhovtyuk updated SOLR-5598:
---

Attachment: SOLR-5598.patch

Added content logging in case of non-string value
Added test for multivalue field with 1st value as empty string
Added test for multivalue field with values in 2languages (most en). resolved 
as en.

 LanguageIdentifierUpdateProcessor ignores all but the first value of 
 multiValued string fields
 --

 Key: SOLR-5598
 URL: https://issues.apache.org/jira/browse/SOLR-5598
 Project: Solr
  Issue Type: Bug
  Components: contrib - LangId
Affects Versions: 4.5.1
Reporter: Andreas Hubold
 Fix For: 4.7

 Attachments: SOLR-5598.patch, SOLR-5598.patch


 The LanguageIdentifierUpdateProcessor just uses the first value of the 
 multiValued field to detect the language. 
 Method {{concatFields}} calls {{doc.getFieldValue(fieldName)}} but should 
 instead iterate over {{doc.getFieldValues(fieldName)}}.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) Lucene 4.6 cause mixed version segments info file(.si) wrong

2014-01-22 Thread Shai Erera (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878701#comment-13878701
 ] 

Shai Erera commented on LUCENE-5377:


bq. First of all, some test should fail.

I modified TestBackCompatibility.testAddIndexes to do this:

{code}
boolean orig = OLD_FORMAT_IMPERSONATION_IS_ACTIVE;
OLD_FORMAT_IMPERSONATION_IS_ACTIVE = false;
// test code goes here
OLD_FORMAT_IMPERSONATION_IS_ACTIVE = orig;
{code}

And now it fails, because writing with old Codecs is not supported. Maybe it 
cannot be done like that for 4x because we need to support 3x indexes, but I 
haven't tried (because in 4x we catch and ignore the UOE, so I don't know yet 
how it will affect the test).

bq. or change things so it doesnt need to write .si at all here (e.g. put 
partial filenames in the .si that don't include segment prefix)

That's a good idea - if we strip the segment name from the files .si records 
(which is redundant anyway!), we won't need to rewrite .si. I believe though 
that we should fix this in 4.6.1?

 Lucene 4.6 cause mixed version segments info file(.si) wrong
 

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index, modules/facet
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-5377) Lucene 4.6 cause mixed version segments info file(.si) wrong

2014-01-22 Thread Robert Muir (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated LUCENE-5377:


Component/s: (was: modules/facet)

 Lucene 4.6 cause mixed version segments info file(.si) wrong
 

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Michael McCandless (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael McCandless updated LUCENE-5377:
---

Summary: IW.addIndexes(Dir[]) causes silent index corruption  (was: Lucene 
4.6 cause mixed version segments info file(.si) wrong)

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Re: [VOTE] Release Lucene/Solr 4.6.1 RC2

2014-01-22 Thread Michael McCandless
I think we should respin after we fix
https://issues.apache.org/jira/browse/LUCENE-5377 ...

Mike McCandless

http://blog.mikemccandless.com


On Wed, Jan 22, 2014 at 6:42 AM, Jan Høydahl jan@cominvent.com wrote:
 +1

 SUCCESS! [0:29:25.421656] -- Ran from a local cached copy of the files
 :)

 --
 Jan Høydahl, search solution architect
 Cominvent AS - www.cominvent.com

 20. jan. 2014 kl. 03:29 skrev Mark Miller markrmil...@gmail.com:

 Please vote to release the following artifacts:

 http://people.apache.org/~markrmiller/lucene_solr_4_6_1r1559610/

 Here is my +1.

 SUCCESS! [0:51:51.964657]

 --
 - Mark



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878709#comment-13878709
 ] 

Robert Muir commented on LUCENE-5377:
-

{quote}
And now it fails, because writing with old Codecs is not supported. Maybe it 
cannot be done like that for 4x because we need to support 3x indexes, but I 
haven't tried (because in 4x we catch and ignore the UOE, so I don't know yet 
how it will affect the test).
{quote}

Oh, thanks for investigating. Now i understand how it got thru the tests. 
Impersonation masked the bug

{quote}
That's a good idea - if we strip the segment name from the files .si records 
(which is redundant anyway!), we won't need to rewrite .si. I believe though 
that we should fix this in 4.6.1?
{quote}

But thats the long-term fix. For the short term i think we should just allow 
the old .si format to be written. Very simple, very clean and conservative 
solution, just not elegant at all.

Yes, this is a corruption issue. I really think we should respin 4.6.1 for this.

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Shai Erera (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878719#comment-13878719
 ] 

Shai Erera commented on LUCENE-5377:


bq. For the short term i think we should just allow the old .si format to be 
written.

You mean to temporarily restore Lucene40SIWriter? That will also fix it :)

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878720#comment-13878720
 ] 

Robert Muir commented on LUCENE-5377:
-

Exactly. I think this (and fixing the test bugs here), is all we should do for 
4.6.1 here, lets do the segment-stripping separately?

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Shai Erera (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878726#comment-13878726
 ] 

Shai Erera commented on LUCENE-5377:


OK let's fix it separately. A new Codec ... just the thought ...

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878725#comment-13878725
 ] 

Michael McCandless commented on LUCENE-5377:


bq.  I think this (and fixing the test bugs here), is all we should do for 
4.6.1 here, lets do the segment-stripping separately?

+1

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878732#comment-13878732
 ] 

Robert Muir commented on LUCENE-5377:
-

the test infra is really screwed up here! There might be more hidden bugs!

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Assigned] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Robert Muir (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir reassigned LUCENE-5377:
---

Assignee: Robert Muir

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Attachments: TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Assigned] (LUCENE-5409) ToParentBlockJoinCollector.getTopGroups returns empty Groups

2014-01-22 Thread Michael McCandless (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael McCandless reassigned LUCENE-5409:
--

Assignee: Michael McCandless

 ToParentBlockJoinCollector.getTopGroups returns empty Groups
 

 Key: LUCENE-5409
 URL: https://issues.apache.org/jira/browse/LUCENE-5409
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/search
Affects Versions: 4.6
 Environment: Ubuntu 12.04
Reporter: Peng Cheng
Assignee: Michael McCandless
Priority: Critical
 Fix For: 4.7

   Original Estimate: 168h
  Remaining Estimate: 168h

 A bug is observed to cause unstable results returned by the getTopGroups 
 function of class ToParentBlockJoinCollector.
 In the scorer generation stage, the ToParentBlockJoinCollector will 
 automatically rewrite all the associated ToParentBlockJoinQuery (and their 
 subqueries), and save them into its in-memory Look-up table, namely 
 joinQueryID (see enroll() method for detail). Unfortunately, in the 
 getTopGroups method, the new ToParentBlockJoinQuery parameter is not 
 rewritten (at least users are not expected to do so). When the new one is 
 searched in the old lookup table (considering the impact of rewrite() on 
 hashCode()), the lookup will largely fail and eventually end up with a 
 topGroup collection consisting of only empty groups (their hitCounts are 
 guaranteed to be zero).
 An easy fix would be to rewrite the original BlockJoinQuery before invoking 
 getTopGroups method. However, the computational cost of this is not optimal. 
 A better but slightly more complex solution would be to save unrewrited 
 Queries into the lookup table.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5594) Enable using extended field types with prefix queries for non-default encoded strings

2014-01-22 Thread Anshum Gupta (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878743#comment-13878743
 ] 

Anshum Gupta commented on SOLR-5594:


Thanks for that info Steve. I was able to set the svn property (just that I was 
trying to accomplish that through my IDE, which didn't work).
Also, I'll use those ANT_OPTS to have stuff working for now.

 Enable using extended field types with prefix queries for non-default encoded 
 strings
 -

 Key: SOLR-5594
 URL: https://issues.apache.org/jira/browse/SOLR-5594
 Project: Solr
  Issue Type: Improvement
  Components: query parsers, Schema and Analysis
Affects Versions: 4.6
Reporter: Anshum Gupta
Assignee: Anshum Gupta
Priority: Minor
 Attachments: SOLR-5594-branch_4x.patch, SOLR-5594.patch, 
 SOLR-5594.patch, SOLR-5594.patch, SOLR-5594.patch, SOLR-5594.patch, 
 SOLR-5594.patch


 Enable users to be able to use prefix query with custom field types with 
 non-default encoding/decoding for queries more easily. e.g. having a custom 
 field work with base64 encoded query strings.
 Currently, the workaround for it is to have the override at getRewriteMethod 
 level. Perhaps having the prefixQuery also use the calling FieldType's 
 readableToIndexed method would work better.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Robert Muir (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated LUCENE-5377:


Attachment: LUCENE-5377.patch

Here is a patch.

Somehow, this impersonation flag dangerously got turned on by default.
Thats the root cause of the test issue.

Some tests that explicitly create old codecs for writing will fail with the 
patch. We have to fix them to use the boolean.

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Attachments: LUCENE-5377.patch, TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5409) ToParentBlockJoinCollector.getTopGroups returns empty Groups

2014-01-22 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878749#comment-13878749
 ] 

Michael McCandless commented on LUCENE-5409:


Hi Peng, this is a good catch!  Probably we could key on the .origChildQuery 
since we already carry that one along.

Do you have a test case showing the issue?

 ToParentBlockJoinCollector.getTopGroups returns empty Groups
 

 Key: LUCENE-5409
 URL: https://issues.apache.org/jira/browse/LUCENE-5409
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/search
Affects Versions: 4.6
 Environment: Ubuntu 12.04
Reporter: Peng Cheng
Assignee: Michael McCandless
Priority: Critical
 Fix For: 4.7

   Original Estimate: 168h
  Remaining Estimate: 168h

 A bug is observed to cause unstable results returned by the getTopGroups 
 function of class ToParentBlockJoinCollector.
 In the scorer generation stage, the ToParentBlockJoinCollector will 
 automatically rewrite all the associated ToParentBlockJoinQuery (and their 
 subqueries), and save them into its in-memory Look-up table, namely 
 joinQueryID (see enroll() method for detail). Unfortunately, in the 
 getTopGroups method, the new ToParentBlockJoinQuery parameter is not 
 rewritten (at least users are not expected to do so). When the new one is 
 searched in the old lookup table (considering the impact of rewrite() on 
 hashCode()), the lookup will largely fail and eventually end up with a 
 topGroup collection consisting of only empty groups (their hitCounts are 
 guaranteed to be zero).
 An easy fix would be to rewrite the original BlockJoinQuery before invoking 
 getTopGroups method. However, the computational cost of this is not optimal. 
 A better but slightly more complex solution would be to save unrewrited 
 Queries into the lookup table.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Shai Erera (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878764#comment-13878764
 ] 

Shai Erera commented on LUCENE-5377:


I guess I messed this up when working on LUCENE-5196, turning impersonation to 
true by default. Sorry about that, hope it won't impact a lot of tests.

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Attachments: LUCENE-5377.patch, TestCore_45.java, TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3069) Lucene should have an entirely memory resident term dictionary

2014-01-22 Thread Mark Miller (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878765#comment-13878765
 ] 

Mark Miller commented on LUCENE-3069:
-

I think it's actually two different things - our commit messages are generated 
by a script that subscribes to svnpubsub, and it does some look ups to figure 
out the right user name. The fisheye stuff is what you see if you look at the 
source tab I think. So it might be easier to fix, since I think it's in our 
control (INFRA's anyway).

https://svn.apache.org/repos/infra/infrastructure/trunk/projects/svngit2jira/

 Lucene should have an entirely memory resident term dictionary
 --

 Key: LUCENE-3069
 URL: https://issues.apache.org/jira/browse/LUCENE-3069
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index, core/search
Affects Versions: 4.0-ALPHA
Reporter: Simon Willnauer
Assignee: Han Jiang
  Labels: gsoc2013
 Fix For: 4.7

 Attachments: LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, 
 LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, 
 LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, 
 LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, df-ttf-estimate.txt, 
 example.png


 FST based TermDictionary has been a great improvement yet it still uses a 
 delta codec file for scanning to terms. Some environments have enough memory 
 available to keep the entire FST based term dict in memory. We should add a 
 TermDictionary implementation that encodes all needed information for each 
 term into the FST (custom fst.Output) and builds a FST from the entire term 
 not just the delta.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Robert Muir (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated LUCENE-5377:


Attachment: LUCENE-5377.patch

I fixed all these tests here.

So this patch fixes the disease (in the test infra), and fixes 4.0 SIFormat to 
be able to write .si just for this purpose.

The bug does not really impact trunk, as it didnt have the dangerous 'catch 
UOE', on trunk the bug is the user would get an actual UOE. But this is fixed, 
as now the SIFormat can write.

I think this patch is ready: for the backport we have to ensure we only hide 
that UOE if its really Lucene3x, so an instanceof is really needed there.

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878773#comment-13878773
 ] 

Michael McCandless commented on LUCENE-5377:


+1, looks good.

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Mark Miller (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mark Miller updated LUCENE-5377:


Fix Version/s: 4.6.1
   4.7
   5.0

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Fix For: 5.0, 4.7, 4.6.1

 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Shai Erera (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878778#comment-13878778
 ] 

Shai Erera commented on LUCENE-5377:


+1, looks good!

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Fix For: 5.0, 4.7, 4.6.1

 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878790#comment-13878790
 ] 

Robert Muir commented on LUCENE-5377:
-

I opened a followup issue (LUCENE-5412) for the correct fix. But this one is 
too dangerous IMO for a bugfix release.

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Fix For: 5.0, 4.7, 4.6.1

 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (LUCENE-5412) SI.files should strip segment names

2014-01-22 Thread Robert Muir (JIRA)
Robert Muir created LUCENE-5412:
---

 Summary: SI.files should strip segment names
 Key: LUCENE-5412
 URL: https://issues.apache.org/jira/browse/LUCENE-5412
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir


Followup from LUCENE-5377:

the root cause of all the hair there is because SI.files (set of files used by 
the segment), must be rewritten in addIndexes because its invalidated by the 
change of segment name.

alternatively we could just not store this stuff, and strip the segment names. 
This is how CFS deals with this issue already.

I think we can do this in a backwards compatible way too: for old codecs, the 
readers could be changed to simply strip internally (e.g. ignore the extra 
information they wrote) ?



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5412) SI.files should strip segment names

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878798#comment-13878798
 ] 

Robert Muir commented on LUCENE-5412:
-

Exactly, something like that: I dont think we should change our high-level API?
SI.files should return the set of actual filenames as it does now.

This would just be a change to the codec API, to e.g. strip the stuff we pass 
to the SI Writer or whatever.
And the old codecs would strip what they read (as they wrote the unnecessary 
additional stuff).
And we remove the call to SIWriter in addIndexes :)

 SI.files should strip segment names
 ---

 Key: LUCENE-5412
 URL: https://issues.apache.org/jira/browse/LUCENE-5412
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir

 Followup from LUCENE-5377:
 the root cause of all the hair there is because SI.files (set of files used 
 by the segment), must be rewritten in addIndexes because its invalidated by 
 the change of segment name.
 alternatively we could just not store this stuff, and strip the segment 
 names. This is how CFS deals with this issue already.
 I think we can do this in a backwards compatible way too: for old codecs, the 
 readers could be changed to simply strip internally (e.g. ignore the extra 
 information they wrote) ?



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5412) SI.files should strip segment names

2014-01-22 Thread Shai Erera (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878796#comment-13878796
 ] 

Shai Erera commented on LUCENE-5412:


Could we change SegmentInfo.setFiles() to strip the segment name maybe? Just 
thinking of fewer places in the code to touch.

 SI.files should strip segment names
 ---

 Key: LUCENE-5412
 URL: https://issues.apache.org/jira/browse/LUCENE-5412
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir

 Followup from LUCENE-5377:
 the root cause of all the hair there is because SI.files (set of files used 
 by the segment), must be rewritten in addIndexes because its invalidated by 
 the change of segment name.
 alternatively we could just not store this stuff, and strip the segment 
 names. This is how CFS deals with this issue already.
 I think we can do this in a backwards compatible way too: for old codecs, the 
 readers could be changed to simply strip internally (e.g. ignore the extra 
 information they wrote) ?



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878800#comment-13878800
 ] 

ASF subversion and git services commented on LUCENE-5377:
-

Commit 1560401 from [~rcmuir] in branch 'dev/trunk'
[ https://svn.apache.org/r1560401 ]

LUCENE-5377: IW.addIndexes(Dir[]) causes silent index corruption

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Fix For: 5.0, 4.7, 4.6.1

 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



RE: JDK 8 Build 124 JDK 7 Update 60 build 03 are available on java.net

2014-01-22 Thread Uwe Schindler
Hi Rory,

 

I installed both versions.

 

Uwe

 

-

Uwe Schindler

H.-H.-Meier-Allee 63, D-28213 Bremen

 http://www.thetaphi.de/ http://www.thetaphi.de

eMail: u...@thetaphi.de

 

From: Rory O'Donnell Oracle, Dublin Ireland [mailto:rory.odonn...@oracle.com] 
Sent: Wednesday, January 22, 2014 12:08 PM
To: Uwe Schindler; Dawid Weiss
Cc: dev@lucene.apache.org; Dalibor Topic; Cecilia Borg; Balchandra Vaidya
Subject: JDK 8 Build 124  JDK 7 Update 60 build 03 are available on java.net

 

Hi Uwe/Dawid,

JDK 8 Build b124 Early Access Build is now available for download 
http://jdk8.java.net/download.html   test.
JDK 7u60 b03 Early Access Build is also available for download  
https://jdk7.java.net/download.html  test. 

See you at FOSDEM next week.

Thanks for your support, Rory 

-- 
Rgds,Rory O'Donnell
Quality Engineering Manager
Oracle EMEA , Dublin, Ireland


[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878805#comment-13878805
 ] 

ASF subversion and git services commented on LUCENE-5377:
-

Commit 1560405 from [~rcmuir] in branch 'dev/trunk'
[ https://svn.apache.org/r1560405 ]

LUCENE-5377: remove dead code

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Fix For: 5.0, 4.7, 4.6.1

 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878808#comment-13878808
 ] 

Robert Muir commented on LUCENE-5377:
-

Give me some time with the backport, i dont want to introduce another bug here. 
Will get this all resolved in a few hours.

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Fix For: 5.0, 4.7, 4.6.1

 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5409) ToParentBlockJoinCollector.getTopGroups returns empty Groups

2014-01-22 Thread Peng Cheng (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878809#comment-13878809
 ] 

Peng Cheng commented on LUCENE-5409:


Hi Michael, I'll be working on the test case this evening.
Thanks a lot for the confirmation. I've read your book.

 ToParentBlockJoinCollector.getTopGroups returns empty Groups
 

 Key: LUCENE-5409
 URL: https://issues.apache.org/jira/browse/LUCENE-5409
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/search
Affects Versions: 4.6
 Environment: Ubuntu 12.04
Reporter: Peng Cheng
Assignee: Michael McCandless
Priority: Critical
 Fix For: 4.7

   Original Estimate: 168h
  Remaining Estimate: 168h

 A bug is observed to cause unstable results returned by the getTopGroups 
 function of class ToParentBlockJoinCollector.
 In the scorer generation stage, the ToParentBlockJoinCollector will 
 automatically rewrite all the associated ToParentBlockJoinQuery (and their 
 subqueries), and save them into its in-memory Look-up table, namely 
 joinQueryID (see enroll() method for detail). Unfortunately, in the 
 getTopGroups method, the new ToParentBlockJoinQuery parameter is not 
 rewritten (at least users are not expected to do so). When the new one is 
 searched in the old lookup table (considering the impact of rewrite() on 
 hashCode()), the lookup will largely fail and eventually end up with a 
 topGroup collection consisting of only empty groups (their hitCounts are 
 guaranteed to be zero).
 An easy fix would be to rewrite the original BlockJoinQuery before invoking 
 getTopGroups method. However, the computational cost of this is not optimal. 
 A better but slightly more complex solution would be to save unrewrited 
 Queries into the lookup table.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Re: (Lucene-core) Is Query's rewrite method mandated to preserver original Query's hashcode?

2014-01-22 Thread Peng Cheng
opened as https://issues.apache.org/jira/browse/LUCENE-5409


On Tue, Jan 14, 2014 at 5:42 PM, Uwe Schindler u...@thetaphi.de wrote:

 Yes, open an issue!



 Uwe



 -

 Uwe Schindler

 H.-H.-Meier-Allee 63, D-28213 Bremen

 http://www.thetaphi.de

 eMail: u...@thetaphi.de



 *From:* Peng Cheng [mailto:p...@sciencescape.net]
 *Sent:* Tuesday, January 14, 2014 10:41 PM
 *To:* dev@lucene.apache.org
 *Subject:* Re: (Lucene-core) Is Query's rewrite method mandated to
 preserver original Query's hashcode?



 Do you suggest me to open a jira ticket about it? I think its a bug
 considering common interface standard (rewrite should not be exposed to the
 end user), documentation and running efficiency (as you said, rewrite is
 slow).



 On Tue, Jan 14, 2014 at 4:38 AM, Peng Cheng p...@sciencescape.net wrote:

 I see, perhaps the best solution is to put the un-rewritten
 blockJoinQuries into the joinQueryID? The result will be the same. Right
 now the code have very strange behavior if no rewrite is called beforehand,
 it gives empty groups or correct results at random.



 Its a great pleasure to read your reply, never expect someone to respond
 that fast.



 Yours Peng





 On Tue, Jan 14, 2014 at 2:33 AM, Uwe Schindler u...@thetaphi.de wrote:

 Hi Peng,



 rewrite() returns a different query that will definitely not preserve the
 hashCode() or be equals() to the original one or any other rewritten one.
 The reason for this is: A rewritten query is a new query that contains
 information about the index it will be executed on (e.g., it references
 terms from that index), so it **cannot** be equal to the original one. If
 it cannot be equal, also the hashCode should be different. If you execute
 the query on a later stage you have to rewrite the original query again,
 because the index may have changed. And take care: This rewrite may produce
 a completely different query (with a new hashCode again) if the index
 changed in the meantime.



 As there is a workaround (to me it looks, that the code is missing
 documentation), so you can manually rewrite the query before invoking
 getTopGroups() using Searcher#rewrite(query). Why is a hotfix needed?



 Also rewriting the query on every call of getTopGroups is a major overhead
 (most query’s rewrites are very expensice and take as long as the execution
 of the query, e.g. MultiTermQueries), so it should only be done once, not
 on every call. Maybe that’s the reason why it was left out, but it was not
 documented.



 Uwe



 -

 Uwe Schindler

 H.-H.-Meier-Allee 63, D-28213 Bremen

 http://www.thetaphi.de

 eMail: u...@thetaphi.de



 *From:* Peng Cheng [mailto:p...@sciencescape.net]
 *Sent:* Tuesday, January 14, 2014 3:59 AM
 *To:* dev@lucene.apache.org; d...@sciencescape.org


 *Subject:* (Lucene-core) Is Query's rewrite method mandated to preserver
 original Query's hashcode?



 Hi developers,



 I've recently found a few bugs in advanced features of Lucene-core 4.6
 (which is perfectly normal as those features are less likely to be used and
 tested), the most serious one has rendered my ToParentBlockJoinCollector
 close to useless:



 In the scorer generation stage, the ToParentBlockJoinCollector will
 automatically rewrite all the associated ToParentBlockJoinQuery (and their
 subqueries), and save them into its in-memory Look-up table, namely
 joinQueryID (see enroll() method for detail). Unfortunately, in the
 getTopGroups method, the new ToParentBlockJoinQuery parameter is not
 rewritten (at least users are not expected to do so). When the new one is
 searched in the old lookup table (considering the impact of rewrite() on
 hashCode()), the result (namely _slot) will always fail and eventually end
 up with a topGroup collection consisting of only empty groups (their
 hitCounts are guaranteed to be zero).



 I'm not positive about whether rewrite() should preserver Query's
 hashcode, as I've found many counterexamples already. If this is not true,
 then this problem can be solved by rewriting the origianl BlockJoinQuery
 before invoking getTopGroups method. Nevertheless users are not expected to
 do so, therefore I would suggest submitting a hotfix that add the described
 rewrite step.



 If rewrite() must preserver the hashcode, then this is a problem of the
 various rewrite() implementations and fix should be much harder.



 This bug has caused widespread panic in my company and I would like to see
 it fixed ASAP. Please give me some suggestion so I know which hotfix I
 should be working on.



 All the best,



 Yours Peng







RE: (Lucene-core) Is Query's rewrite method mandated to preserver original Query's hashcode?

2014-01-22 Thread Uwe Schindler
Thanks!

 

Uwe

 

-

Uwe Schindler

H.-H.-Meier-Allee 63, D-28213 Bremen

 http://www.thetaphi.de/ http://www.thetaphi.de

eMail: u...@thetaphi.de

 

From: Peng Cheng [mailto:p...@sciencescape.net] 
Sent: Wednesday, January 22, 2014 5:23 PM
To: dev@lucene.apache.org
Subject: Re: (Lucene-core) Is Query's rewrite method mandated to preserver 
original Query's hashcode?

 

opened as https://issues.apache.org/jira/browse/LUCENE-5409

 

On Tue, Jan 14, 2014 at 5:42 PM, Uwe Schindler u...@thetaphi.de wrote:

Yes, open an issue!

 

Uwe

 

-

Uwe Schindler

H.-H.-Meier-Allee 63, D-28213 Bremen

 http://www.thetaphi.de/ http://www.thetaphi.de

eMail: u...@thetaphi.de

 

From: Peng Cheng [mailto:p...@sciencescape.net] 
Sent: Tuesday, January 14, 2014 10:41 PM
To: dev@lucene.apache.org
Subject: Re: (Lucene-core) Is Query's rewrite method mandated to preserver 
original Query's hashcode?

 

Do you suggest me to open a jira ticket about it? I think its a bug considering 
common interface standard (rewrite should not be exposed to the end user), 
documentation and running efficiency (as you said, rewrite is slow).

 

On Tue, Jan 14, 2014 at 4:38 AM, Peng Cheng p...@sciencescape.net wrote:

I see, perhaps the best solution is to put the un-rewritten blockJoinQuries 
into the joinQueryID? The result will be the same. Right now the code have very 
strange behavior if no rewrite is called beforehand, it gives empty groups or 
correct results at random.

 

Its a great pleasure to read your reply, never expect someone to respond that 
fast.

 

Yours Peng

 

 

On Tue, Jan 14, 2014 at 2:33 AM, Uwe Schindler u...@thetaphi.de wrote:

Hi Peng,

 

rewrite() returns a different query that will definitely not preserve the 
hashCode() or be equals() to the original one or any other rewritten one. The 
reason for this is: A rewritten query is a new query that contains information 
about the index it will be executed on (e.g., it references terms from that 
index), so it *cannot* be equal to the original one. If it cannot be equal, 
also the hashCode should be different. If you execute the query on a later 
stage you have to rewrite the original query again, because the index may have 
changed. And take care: This rewrite may produce a completely different query 
(with a new hashCode again) if the index changed in the meantime.

 

As there is a workaround (to me it looks, that the code is missing 
documentation), so you can manually rewrite the query before invoking 
getTopGroups() using Searcher#rewrite(query). Why is a hotfix needed?

 

Also rewriting the query on every call of getTopGroups is a major overhead 
(most query’s rewrites are very expensice and take as long as the execution of 
the query, e.g. MultiTermQueries), so it should only be done once, not on every 
call. Maybe that’s the reason why it was left out, but it was not documented.

 

Uwe

 

-

Uwe Schindler

H.-H.-Meier-Allee 63, D-28213 Bremen

http://www.thetaphi.de http://www.thetaphi.de/ 

eMail: u...@thetaphi.de

 

From: Peng Cheng [mailto:p...@sciencescape.net] 
Sent: Tuesday, January 14, 2014 3:59 AM
To: dev@lucene.apache.org; d...@sciencescape.org


Subject: (Lucene-core) Is Query's rewrite method mandated to preserver original 
Query's hashcode?

 

Hi developers,

 

I've recently found a few bugs in advanced features of Lucene-core 4.6 (which 
is perfectly normal as those features are less likely to be used and tested), 
the most serious one has rendered my ToParentBlockJoinCollector close to 
useless:

 

In the scorer generation stage, the ToParentBlockJoinCollector will 
automatically rewrite all the associated ToParentBlockJoinQuery (and their 
subqueries), and save them into its in-memory Look-up table, namely joinQueryID 
(see enroll() method for detail). Unfortunately, in the getTopGroups method, 
the new ToParentBlockJoinQuery parameter is not rewritten (at least users are 
not expected to do so). When the new one is searched in the old lookup table 
(considering the impact of rewrite() on hashCode()), the result (namely _slot) 
will always fail and eventually end up with a topGroup collection consisting of 
only empty groups (their hitCounts are guaranteed to be zero).

 

I'm not positive about whether rewrite() should preserver Query's hashcode, as 
I've found many counterexamples already. If this is not true, then this problem 
can be solved by rewriting the origianl BlockJoinQuery before invoking 
getTopGroups method. Nevertheless users are not expected to do so, therefore I 
would suggest submitting a hotfix that add the described rewrite step.

 

If rewrite() must preserver the hashcode, then this is a problem of the various 
rewrite() implementations and fix should be much harder.

 

This bug has caused widespread panic in my company and I would like to see it 
fixed ASAP. Please give me some suggestion so I know which hotfix I should be 
working on.

 

All the best,

 

Yours Peng

 

 

 



[jira] [Created] (LUCENE-5413) remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE

2014-01-22 Thread Robert Muir (JIRA)
Robert Muir created LUCENE-5413:
---

 Summary: remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE
 Key: LUCENE-5413
 URL: https://issues.apache.org/jira/browse/LUCENE-5413
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir
 Fix For: 4.7


Somehow we have two impersonation booleans in 4.x. I feel we should only have 
one. perhaps some logic in a test somewhere needs OLD_FORMAT  codec == 
Lucene3x instead, but having too booleans is not good.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5594) Enable using extended field types with prefix queries for non-default encoded strings

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878825#comment-13878825
 ] 

ASF subversion and git services commented on SOLR-5594:
---

Commit 1560412 from hoss...@apache.org in branch 'dev/trunk'
[ https://svn.apache.org/r1560412 ]

SOLR-5594: Allow FieldTypes to specify custom PrefixQuery behavior

 Enable using extended field types with prefix queries for non-default encoded 
 strings
 -

 Key: SOLR-5594
 URL: https://issues.apache.org/jira/browse/SOLR-5594
 Project: Solr
  Issue Type: Improvement
  Components: query parsers, Schema and Analysis
Affects Versions: 4.6
Reporter: Anshum Gupta
Assignee: Anshum Gupta
Priority: Minor
 Attachments: SOLR-5594-branch_4x.patch, SOLR-5594.patch, 
 SOLR-5594.patch, SOLR-5594.patch, SOLR-5594.patch, SOLR-5594.patch, 
 SOLR-5594.patch


 Enable users to be able to use prefix query with custom field types with 
 non-default encoding/decoding for queries more easily. e.g. having a custom 
 field work with base64 encoded query strings.
 Currently, the workaround for it is to have the override at getRewriteMethod 
 level. Perhaps having the prefixQuery also use the calling FieldType's 
 readableToIndexed method would work better.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



Re: JDK 8 Build 124 JDK 7 Update 60 build 03 are available on java.net

2014-01-22 Thread Rory O'Donnell Oracle, Dublin Ireland

Thanks Uwe!
On 22/01/2014 16:14, Uwe Schindler wrote:


Hi Rory,

I installed both versions.

Uwe

-

Uwe Schindler

H.-H.-Meier-Allee 63, D-28213 Bremen

http://www.thetaphi.de http://www.thetaphi.de/

eMail: u...@thetaphi.de

*From:*Rory O'Donnell Oracle, Dublin Ireland 
[mailto:rory.odonn...@oracle.com]

*Sent:* Wednesday, January 22, 2014 12:08 PM
*To:* Uwe Schindler; Dawid Weiss
*Cc:* dev@lucene.apache.org; Dalibor Topic; Cecilia Borg; Balchandra 
Vaidya
*Subject:* JDK 8 Build 124  JDK 7 Update 60 build 03 are available on 
java.net


Hi Uwe/Dawid,

JDK 8 Build b124 Early Access Build is now available for download 
http://jdk8.java.net/download.html  test.
JDK 7u60 b03 Early Access Build is also available for download 
https://jdk7.java.net/download.html test.


See you at FOSDEM next week.

Thanks for your support, Rory

--
Rgds,Rory O'Donnell
Quality Engineering Manager
Oracle EMEA , Dublin, Ireland


--
Rgds,Rory O'Donnell
Quality Engineering Manager
Oracle EMEA , Dublin, Ireland



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878849#comment-13878849
 ] 

ASF subversion and git services commented on LUCENE-5377:
-

Commit 1560421 from [~rcmuir] in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1560421 ]

LUCENE-5377: IW.addIndexes(Dir[]) causes silent index corruption

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Fix For: 5.0, 4.7, 4.6.1

 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878850#comment-13878850
 ] 

Robert Muir commented on LUCENE-5377:
-

I'll let Jenkins play with the 4.x backport for just a bit before merging back 
to the release branch.

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Fix For: 5.0, 4.7, 4.6.1

 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5594) Enable using extended field types with prefix queries for non-default encoded strings

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878866#comment-13878866
 ] 

ASF subversion and git services commented on SOLR-5594:
---

Commit 1560432 from hoss...@apache.org in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1560432 ]

SOLR-5594: Allow FieldTypes to specify custom PrefixQuery behavior (merge 
r1560412)

 Enable using extended field types with prefix queries for non-default encoded 
 strings
 -

 Key: SOLR-5594
 URL: https://issues.apache.org/jira/browse/SOLR-5594
 Project: Solr
  Issue Type: Improvement
  Components: query parsers, Schema and Analysis
Affects Versions: 4.6
Reporter: Anshum Gupta
Assignee: Anshum Gupta
Priority: Minor
 Fix For: 5.0, 4.7

 Attachments: SOLR-5594-branch_4x.patch, SOLR-5594.patch, 
 SOLR-5594.patch, SOLR-5594.patch, SOLR-5594.patch, SOLR-5594.patch, 
 SOLR-5594.patch


 Enable users to be able to use prefix query with custom field types with 
 non-default encoding/decoding for queries more easily. e.g. having a custom 
 field work with base64 encoded query strings.
 Currently, the workaround for it is to have the override at getRewriteMethod 
 level. Perhaps having the prefixQuery also use the calling FieldType's 
 readableToIndexed method would work better.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Resolved] (SOLR-5594) Enable using extended field types with prefix queries for non-default encoded strings

2014-01-22 Thread Hoss Man (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-5594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Hoss Man resolved SOLR-5594.


   Resolution: Fixed
Fix Version/s: 4.7
   5.0

Thanks Anshum!

Committed revision 1560412.
Committed revision 1560432.


 Enable using extended field types with prefix queries for non-default encoded 
 strings
 -

 Key: SOLR-5594
 URL: https://issues.apache.org/jira/browse/SOLR-5594
 Project: Solr
  Issue Type: Improvement
  Components: query parsers, Schema and Analysis
Affects Versions: 4.6
Reporter: Anshum Gupta
Assignee: Anshum Gupta
Priority: Minor
 Fix For: 5.0, 4.7

 Attachments: SOLR-5594-branch_4x.patch, SOLR-5594.patch, 
 SOLR-5594.patch, SOLR-5594.patch, SOLR-5594.patch, SOLR-5594.patch, 
 SOLR-5594.patch


 Enable users to be able to use prefix query with custom field types with 
 non-default encoding/decoding for queries more easily. e.g. having a custom 
 field work with base64 encoded query strings.
 Currently, the workaround for it is to have the override at getRewriteMethod 
 level. Perhaps having the prefixQuery also use the calling FieldType's 
 readableToIndexed method would work better.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878885#comment-13878885
 ] 

Robert Muir commented on LUCENE-5377:
-

Here is the 4.x backport for review: http://svn.apache.org/r1560421

I'm gonna run the tests a few times more and merge to 4.6 branch

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Fix For: 5.0, 4.7, 4.6.1

 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-5653) Create a RESTManager to provide REST API endpoints for reconfigurable plugins

2014-01-22 Thread Steve Rowe (JIRA)
Steve Rowe created SOLR-5653:


 Summary: Create a RESTManager to provide REST API endpoints for 
reconfigurable plugins
 Key: SOLR-5653
 URL: https://issues.apache.org/jira/browse/SOLR-5653
 Project: Solr
  Issue Type: Sub-task
Reporter: Steve Rowe


It should be possible to reconfigure of Solr plugins' resources and init params 
without directly editing the serialized schema or {{solrconfig.xml}} (see 
Hoss's arguments about this in the context of the schema, which also apply to 
{{solrconfig.xml}}, in the description of SOLR-4658)

The RESTManager should allow plugins declared in either the schema or in 
{{solrconfig.xml}} to register one or more REST endpoints, one endpoint per 
reconfigurable resource, including init params.  To allow for multiple plugin 
instances, registering plugins will need to provide a handle of some form to 
distinguish the instances.

This RESTManager should also be able to create new instances of plugins that it 
has been configured to allow.  The RESTManager will need its own serialized 
configuration to remember these plugin declarations.

Example endpoints:

* SynonymFilterFactory
** init params: {{/solr/collection1/config/syns/myinstance/options}}
** synonyms resource: {{/solr/collection1/config/syns/myinstance/synonyms-list}}
* /select request handler
** init params: {{/solr/collection1/config/requestHandlers/select/options}}

We should aim for full CRUD over init params and structured resources.  The 
plugins will bear responsibility for handling resource modification requests, 
though we should provide utility methods to make this easy.

However, since we won't be directly modifying the serialized schema and 
{{solrconfig.xml}}, anything configured in those two places can't be 
invalidated by configuration serialized elsewhere.  As a result, it won't be 
possible to remove plugins declared in the serialized schema or 
{{solrconfig.xml}}.  Similarly, any init params declared in either place won't 
be modifiable.  Instead, there should be some form of init param that declares 
that the plugin is reconfigurable, maybe using something like managed - note 
that request handlers already provide a handle - the request handler name - 
and so don't need that to be separately specified:

{code:xml}
requestHandler name=/select class=solr.SearchHandler
   managed/
/requestHandler
{code}

and in the serialized schema - a handle needs to be specified here:

{code:xml}
fieldType name=text_general class=solr.TextField 
positionIncrementGap=100
...
  analyzer type=query
tokenizer class=solr.StandardTokenizerFactory/
filter class=solr.SynonymFilterFactory managed=english-synonyms/
...
{code}

All of the above examples use the existing plugin factory class names, but 
we'll have to create new RESTManager-aware classes to handle registration with 
RESTManager.

Core/collection reloading should not be performed automatically when a REST API 
call is made to one of these RESTManager-mediated REST endpoints, since for 
batched config modifications, that could take way too long.  But maybe 
reloading could be a query parameter to these REST API calls. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-5653) Create a RESTManager to provide REST API endpoints for reconfigurable plugins

2014-01-22 Thread Steve Rowe (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-5653?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Steve Rowe updated SOLR-5653:
-

Description: 
It should be possible to reconfigure Solr plugins' resources and init params 
without directly editing the serialized schema or {{solrconfig.xml}} (see 
Hoss's arguments about this in the context of the schema, which also apply to 
{{solrconfig.xml}}, in the description of SOLR-4658)

The RESTManager should allow plugins declared in either the schema or in 
{{solrconfig.xml}} to register one or more REST endpoints, one endpoint per 
reconfigurable resource, including init params.  To allow for multiple plugin 
instances, registering plugins will need to provide a handle of some form to 
distinguish the instances.

This RESTManager should also be able to create new instances of plugins that it 
has been configured to allow.  The RESTManager will need its own serialized 
configuration to remember these plugin declarations.

Example endpoints:

* SynonymFilterFactory
** init params: {{/solr/collection1/config/syns/myinstance/options}}
** synonyms resource: {{/solr/collection1/config/syns/myinstance/synonyms-list}}
* /select request handler
** init params: {{/solr/collection1/config/requestHandlers/select/options}}

We should aim for full CRUD over init params and structured resources.  The 
plugins will bear responsibility for handling resource modification requests, 
though we should provide utility methods to make this easy.

However, since we won't be directly modifying the serialized schema and 
{{solrconfig.xml}}, anything configured in those two places can't be 
invalidated by configuration serialized elsewhere.  As a result, it won't be 
possible to remove plugins declared in the serialized schema or 
{{solrconfig.xml}}.  Similarly, any init params declared in either place won't 
be modifiable.  Instead, there should be some form of init param that declares 
that the plugin is reconfigurable, maybe using something like managed - note 
that request handlers already provide a handle - the request handler name - 
and so don't need that to be separately specified:

{code:xml}
requestHandler name=/select class=solr.SearchHandler
   managed/
/requestHandler
{code}

and in the serialized schema - a handle needs to be specified here:

{code:xml}
fieldType name=text_general class=solr.TextField 
positionIncrementGap=100
...
  analyzer type=query
tokenizer class=solr.StandardTokenizerFactory/
filter class=solr.SynonymFilterFactory managed=english-synonyms/
...
{code}

All of the above examples use the existing plugin factory class names, but 
we'll have to create new RESTManager-aware classes to handle registration with 
RESTManager.

Core/collection reloading should not be performed automatically when a REST API 
call is made to one of these RESTManager-mediated REST endpoints, since for 
batched config modifications, that could take way too long.  But maybe 
reloading could be a query parameter to these REST API calls. 

  was:
It should be possible to reconfigure of Solr plugins' resources and init params 
without directly editing the serialized schema or {{solrconfig.xml}} (see 
Hoss's arguments about this in the context of the schema, which also apply to 
{{solrconfig.xml}}, in the description of SOLR-4658)

The RESTManager should allow plugins declared in either the schema or in 
{{solrconfig.xml}} to register one or more REST endpoints, one endpoint per 
reconfigurable resource, including init params.  To allow for multiple plugin 
instances, registering plugins will need to provide a handle of some form to 
distinguish the instances.

This RESTManager should also be able to create new instances of plugins that it 
has been configured to allow.  The RESTManager will need its own serialized 
configuration to remember these plugin declarations.

Example endpoints:

* SynonymFilterFactory
** init params: {{/solr/collection1/config/syns/myinstance/options}}
** synonyms resource: {{/solr/collection1/config/syns/myinstance/synonyms-list}}
* /select request handler
** init params: {{/solr/collection1/config/requestHandlers/select/options}}

We should aim for full CRUD over init params and structured resources.  The 
plugins will bear responsibility for handling resource modification requests, 
though we should provide utility methods to make this easy.

However, since we won't be directly modifying the serialized schema and 
{{solrconfig.xml}}, anything configured in those two places can't be 
invalidated by configuration serialized elsewhere.  As a result, it won't be 
possible to remove plugins declared in the serialized schema or 
{{solrconfig.xml}}.  Similarly, any init params declared in either place won't 
be modifiable.  Instead, there should be some form of init param that declares 
that the plugin is reconfigurable, maybe using something like managed - note 
that request handlers already 

[jira] [Commented] (SOLR-5641) REST API to modify request handlers

2014-01-22 Thread Steve Rowe (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878897#comment-13878897
 ] 

Steve Rowe commented on SOLR-5641:
--

The patch on this issue assumes that {{solrconfig.xml}} is directly modifiable 
via REST API, but it shouldn't be - we'll need to make this work with the 
RESTManager described in SOLR-5653.

 REST API to modify request handlers
 ---

 Key: SOLR-5641
 URL: https://issues.apache.org/jira/browse/SOLR-5641
 Project: Solr
  Issue Type: Sub-task
Reporter: Willy Solaligue
 Attachments: SOLR-5641.patch


 There should be a REST API to allow modify request handlers.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-5654) Create a synonym filter factory that is (re)configurable, and capable of reporting its configuration, via REST API

2014-01-22 Thread Steve Rowe (JIRA)
Steve Rowe created SOLR-5654:


 Summary: Create a synonym filter factory that is (re)configurable, 
and capable of reporting its configuration, via REST API
 Key: SOLR-5654
 URL: https://issues.apache.org/jira/browse/SOLR-5654
 Project: Solr
  Issue Type: Sub-task
  Components: Schema and Analysis
Reporter: Steve Rowe


A synonym filter factory could be (re)configurable via REST API by registering 
with the RESTManager described in SOLR-5653, and then responding to REST API 
calls to modify its init params and its synonyms resource file.

Read-only (GET) REST API calls should also be provided, both for init params 
and the synonyms resource file.

It should be possible to add/remove/modify one or more entries in the synonyms 
resource file.

We should probably use JSON for the REST request body, as is done in the Schema 
REST API methods.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3069) Lucene should have an entirely memory resident term dictionary

2014-01-22 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878917#comment-13878917
 ] 

Michael McCandless commented on LUCENE-3069:


Thanks Mark.

Yes, you can see FishEye's comments under Source and and also the All tab.

Our (svnpubsub) commit messages are correct here (they say Han Jiang), but 
the FishEye comments are incorrect (they say Han Lee).

 Lucene should have an entirely memory resident term dictionary
 --

 Key: LUCENE-3069
 URL: https://issues.apache.org/jira/browse/LUCENE-3069
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index, core/search
Affects Versions: 4.0-ALPHA
Reporter: Simon Willnauer
Assignee: Han Jiang
  Labels: gsoc2013
 Fix For: 4.7

 Attachments: LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, 
 LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, 
 LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, 
 LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, df-ttf-estimate.txt, 
 example.png


 FST based TermDictionary has been a great improvement yet it still uses a 
 delta codec file for scanning to terms. Some environments have enough memory 
 available to keep the entire FST based term dict in memory. We should add a 
 TermDictionary implementation that encodes all needed information for each 
 term into the FST (custom fst.Output) and builds a FST from the entire term 
 not just the delta.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-3069) Lucene should have an entirely memory resident term dictionary

2014-01-22 Thread Mark Miller (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-3069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878932#comment-13878932
 ] 

Mark Miller commented on LUCENE-3069:
-

Ah, sorry. I'm usually in Comments view and I took Note that all the commit 
messages at the end of this issue  as referring to the ASF subversion and git 
services commit tags. Given past experience, I don't trust the fisheye or the 
like integrations. We might wake up one day and they will just be gone along 
with their history...

 Lucene should have an entirely memory resident term dictionary
 --

 Key: LUCENE-3069
 URL: https://issues.apache.org/jira/browse/LUCENE-3069
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/index, core/search
Affects Versions: 4.0-ALPHA
Reporter: Simon Willnauer
Assignee: Han Jiang
  Labels: gsoc2013
 Fix For: 4.7

 Attachments: LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, 
 LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, 
 LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, 
 LUCENE-3069.patch, LUCENE-3069.patch, LUCENE-3069.patch, df-ttf-estimate.txt, 
 example.png


 FST based TermDictionary has been a great improvement yet it still uses a 
 delta codec file for scanning to terms. Some environments have enough memory 
 available to keep the entire FST based term dict in memory. We should add a 
 TermDictionary implementation that encodes all needed information for each 
 term into the FST (custom fst.Output) and builds a FST from the entire term 
 not just the delta.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-5656) When using HDFS, the Overseer should have the ability to reassign the cores from failed nodes to running nodes.

2014-01-22 Thread Mark Miller (JIRA)
Mark Miller created SOLR-5656:
-

 Summary: When using HDFS, the Overseer should have the ability to 
reassign the cores from failed nodes to running nodes.
 Key: SOLR-5656
 URL: https://issues.apache.org/jira/browse/SOLR-5656
 Project: Solr
  Issue Type: New Feature
Reporter: Mark Miller
Assignee: Mark Miller


Given that the index and transaction logs are in hdfs, it's simple for 
surviving hardware to take over serving cores for failed hardware.

There are some tricky issues around having the Overseer handle this for you, 
but seems a simple first pass is not too difficult.

This will add another alternative to replicating both with hdfs and solr.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-5526) Query parser extends standard cause NPE on Solr startup

2014-01-22 Thread Vitaliy Zhovtyuk (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-5526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vitaliy Zhovtyuk updated SOLR-5526:
---

Attachment: SOLR-5526.patch

minor fixes:
{quote} QParserPlugin.standardPlugins's javadoc needs to point out the 
importance of these names being static  final so people aren't surprised by 
these  tests when new parsers are added in the future.{quote}
Added relevant javadocs.
{quote}
TestStandardQParsers is doing something sufficiently odd that it really needs 
some javadocs explaining why it exists (ie: mention the class loading problems 
associated if there is a standardPlugin that has a non-static, non-final name, 
with an {{@see}} this issue, {{@see QParserPlugin.standardPlugins}}, 
etc...){quote}
Added javadocs
{quote} we should probably make TestStandardQParsers assert that the static  
final name it finds in each class matches the name associated in 
QParserPlugin.standardPlugins.{quote}
Thats actually what TestStandardQParsers does. Unit test takes classes 
registered in QParserPlugin.standardPlugins and ensure that each class has 
final and static NAME field.
Added relevant javadocs to TestStandardQParsers.
{quote} solrconfig-query-parser-init.xml has a cut  paste comment referring to 
an unrelated test.{quote}
Fixed, added relevant comments.

{quote} TestInitQParser should have a javadoc comment explaining what the point 
of the test is{quote}
Fixed, added relevant comments.

{quote}TestInitQParser should actaully do a query using the fail parser 
registered in the config, to help future-proof us against someone unwittingly 
changing the test config in a way that defeats the point of the test.{quote}
This test alerady does query using defType=fail, so i expect this registered 
QParser used and return result.

 Query parser extends standard cause NPE on Solr startup
 ---

 Key: SOLR-5526
 URL: https://issues.apache.org/jira/browse/SOLR-5526
 Project: Solr
  Issue Type: Bug
  Components: query parsers
Affects Versions: 4.5.1, 4.6, 5.0
 Environment: Linux, Java 1.7.0_45
Reporter: Nikolay Khitrin
Priority: Blocker
 Attachments: NPE_load_trace, SOLR-5526-final-names.patch, 
 SOLR-5526-final-names.patch, SOLR-5526-tests.patch, SOLR-5526.patch, 
 SOLR-5526.patch, SOLR-5526.patch


 Adding any custom query parser extending standard one with non-final field 
 {{NAME}} lead to messy {{NullPointerException}} during Solr startup.
 Definition of standard parsers is located in  
 {{QParserPlugin.standardPlugins}} static array. The array contains names from 
 static {{NAME}} fields and classes for each plugin.   
 But all of listed parsers are derived from {{QParserPlugin}}, so we have 
 circular dependency of static fields.
 Normally, class loader start initializing {{QParserPlugin}} before all listed 
 plugins in {{SolrCore.initQParsers}}, and array elements referenced to 
 {{NAME}} plugins' fields are filled properly.
 Custom parsers are instantiated before standard parsers. And when we subclass 
 plugin with non-final {{NAME}} field and add it to Solr via 
 {{solrconfig.xml}}, class loader start loading our class before 
 {{QParserPlugin}}. Because {{QParserPlugin}} is a superclass for plugin, it 
 must be initialized before subclasses, and static dereferencing cause null 
 elements in {{standardPlugins}} array because it filled before {{NAME}} field 
 of loading plugin's superclass.
 How to reproduce:
 # Checkout Solr (trunk or stable)
 # Add the following line to solr/example/solr/collection1/conf/solrconfig.xml
   {{queryParser name=fail class=solr.search.LuceneQParserPlugin/}}
 # Call ant run-example in solr folder
 Possible workarounds:
 * dev-workaround: add {{int workaround = 
 QParserPlugin.standardPlugins.length;}} as a first line to
   {{SolrCore.initQParsers}}
 * user-workaround: add plugin with final {{NAME}} field (edismax) to 
 solrconfig.xml  before subclasses of standard plugins. 
   {{queryParser name=workaround 
 class=solr.search.ExtendedDismaxQParserPlugin/}}
   
 Possible fix:
 Move {{standardPlugins}} to new final class to break circular dependency.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-5655) Create a stopword filter factory that is (re)configurable, and capable of reporting its configuration, via REST API

2014-01-22 Thread Steve Rowe (JIRA)
Steve Rowe created SOLR-5655:


 Summary: Create a stopword filter factory that is 
(re)configurable, and capable of reporting its configuration, via REST API
 Key: SOLR-5655
 URL: https://issues.apache.org/jira/browse/SOLR-5655
 Project: Solr
  Issue Type: Sub-task
  Components: Schema and Analysis
Reporter: Steve Rowe


A stopword filter factory could be (re)configurable via REST API by registering 
with the RESTManager described in SOLR-5653, and then responding to REST API 
calls to modify its init params and its stopwords resource file.

Read-only (GET) REST API calls should also be provided, both for init params 
and the stopwords resource file.

It should be possible to add/remove one or more entries in the stopwords 
resource file.

We should probably use JSON for the REST request body, as is done in the Schema 
REST API methods.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Created] (SOLR-5657) When a SolrCore starts on HDFS, it should gracefully handle HDFS being in safe mode.

2014-01-22 Thread Mark Miller (JIRA)
Mark Miller created SOLR-5657:
-

 Summary: When a SolrCore starts on HDFS, it should gracefully 
handle HDFS being in safe mode.
 Key: SOLR-5657
 URL: https://issues.apache.org/jira/browse/SOLR-5657
 Project: Solr
  Issue Type: Bug
Reporter: Mark Miller
Assignee: Mark Miller


Currently, things just fail. Ideally, the SolrCore would wait until safe mode 
was exited and resume loading.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5654) Create a synonym filter factory that is (re)configurable, and capable of reporting its configuration, via REST API

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878957#comment-13878957
 ] 

Robert Muir commented on SOLR-5654:
---

Why do we need new factories for synonyms and stopwords? I dont understand this 
design at all, this seems like duplication of all the analysis factories!

Instead, just pass a different resourceloader or something to the existing ones!

 Create a synonym filter factory that is (re)configurable, and capable of 
 reporting its configuration, via REST API
 --

 Key: SOLR-5654
 URL: https://issues.apache.org/jira/browse/SOLR-5654
 Project: Solr
  Issue Type: Sub-task
  Components: Schema and Analysis
Reporter: Steve Rowe

 A synonym filter factory could be (re)configurable via REST API by 
 registering with the RESTManager described in SOLR-5653, and then responding 
 to REST API calls to modify its init params and its synonyms resource file.
 Read-only (GET) REST API calls should also be provided, both for init params 
 and the synonyms resource file.
 It should be possible to add/remove/modify one or more entries in the 
 synonyms resource file.
 We should probably use JSON for the REST request body, as is done in the 
 Schema REST API methods.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5654) Create a synonym filter factory that is (re)configurable, and capable of reporting its configuration, via REST API

2014-01-22 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13878969#comment-13878969
 ] 

Michael McCandless commented on SOLR-5654:
--

In Lucene server (LUCENE-5376) I just created a custom ResourceLoader to pull 
stopwords that were specified (inlined) in the JSON when the field is 
registered.

But separately I think it's ... dangerous to allow changing stopwords / syns on 
an already created field / running index?  Ie, such changes won't fully take 
effect until you re-index all content ... I know it's convenient to be able to 
make such changes, but it's also trappy.

 Create a synonym filter factory that is (re)configurable, and capable of 
 reporting its configuration, via REST API
 --

 Key: SOLR-5654
 URL: https://issues.apache.org/jira/browse/SOLR-5654
 Project: Solr
  Issue Type: Sub-task
  Components: Schema and Analysis
Reporter: Steve Rowe

 A synonym filter factory could be (re)configurable via REST API by 
 registering with the RESTManager described in SOLR-5653, and then responding 
 to REST API calls to modify its init params and its synonyms resource file.
 Read-only (GET) REST API calls should also be provided, both for init params 
 and the synonyms resource file.
 It should be possible to add/remove/modify one or more entries in the 
 synonyms resource file.
 We should probably use JSON for the REST request body, as is done in the 
 Schema REST API methods.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5654) Create a synonym filter factory that is (re)configurable, and capable of reporting its configuration, via REST API

2014-01-22 Thread Steve Rowe (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13879015#comment-13879015
 ] 

Steve Rowe commented on SOLR-5654:
--

{quote}
Why do we need new factories for synonyms and stopwords? I dont understand this 
design at all, this seems like duplication of all the analysis factories!

Instead, just pass a different resourceloader or something to the existing ones!
{quote}

The point of this issue is to provide REST API methods to interrogate and 
modify/persist synonym config and mappings.  A different resourceloader would 
only allow for this info to be pulled from an alternate persistence store - it 
wouldn't do anything for the REST API and persistence part.

{quote}
But separately I think it's ... dangerous to allow changing stopwords / syns on 
an already created field / running index? Ie, such changes won't fully take 
effect until you re-index all content ... I know it's convenient to be able to 
make such changes, but it's also trappy.
{quote}

That's already true today for people who manually modify config and 
restart/reload.  I guess your point is that we shouldn't be making this easier. 
 I disagree: the point of the issue is to allow people more fine-grained 
control over an already-existing freedom.  I think documentation warning people 
about the danger of modifying config with an existing index is sufficient to 
help people who want this capability avoid creating indexes with mixed analysis 
config.


 Create a synonym filter factory that is (re)configurable, and capable of 
 reporting its configuration, via REST API
 --

 Key: SOLR-5654
 URL: https://issues.apache.org/jira/browse/SOLR-5654
 Project: Solr
  Issue Type: Sub-task
  Components: Schema and Analysis
Reporter: Steve Rowe

 A synonym filter factory could be (re)configurable via REST API by 
 registering with the RESTManager described in SOLR-5653, and then responding 
 to REST API calls to modify its init params and its synonyms resource file.
 Read-only (GET) REST API calls should also be provided, both for init params 
 and the synonyms resource file.
 It should be possible to add/remove/modify one or more entries in the 
 synonyms resource file.
 We should probably use JSON for the REST request body, as is done in the 
 Schema REST API methods.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Resolved] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread Robert Muir (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir resolved LUCENE-5377.
-

Resolution: Fixed

Thank you very much Littlestar. What a terrible bug!

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Fix For: 5.0, 4.7, 4.6.1

 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5377) IW.addIndexes(Dir[]) causes silent index corruption

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13879024#comment-13879024
 ] 

ASF subversion and git services commented on LUCENE-5377:
-

Commit 1560477 from [~rcmuir] in branch 'dev/branches/lucene_solr_4_6'
[ https://svn.apache.org/r1560477 ]

LUCENE-5377: IW.addIndexes(Dir[]) causes silent index corruption

 IW.addIndexes(Dir[]) causes silent index corruption
 ---

 Key: LUCENE-5377
 URL: https://issues.apache.org/jira/browse/LUCENE-5377
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/index
Affects Versions: 4.6
 Environment: windows/linux
Reporter: Littlestar
Assignee: Robert Muir
 Fix For: 5.0, 4.7, 4.6.1

 Attachments: LUCENE-5377.patch, LUCENE-5377.patch, TestCore_45.java, 
 TestCore_46.java


 my old facet index create by Lucene version=4.2
 use indexChecker ok.
 now I upgrade to Lucene 4.6 and put some new records to index.
 then reopen index, some files in indexdir missing
 no .si files.
 I debug into it,  new version format of segments.gen(segments_N) record bad 
 segments info.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Assigned] (LUCENE-5228) IndexWriter.addIndexes copies raw files but acquires no locks

2014-01-22 Thread Michael McCandless (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael McCandless reassigned LUCENE-5228:
--

Assignee: Michael McCandless

 IndexWriter.addIndexes copies raw files but acquires no locks
 -

 Key: LUCENE-5228
 URL: https://issues.apache.org/jira/browse/LUCENE-5228
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir
Assignee: Michael McCandless
 Attachments: LUCENE-5228.patch


 I see stuff like: merge problem with lucene 3 and 4 indices (from solr 
 users list), and cannot even think how to respond to these users because so 
 many things can go wrong with IndexWriter.addIndexes(Directory)
 it currently has in its javadocs:
 NOTE: the index in each Directory must not be changed (opened by a writer) 
 while this method is running. This method does not acquire a write lock in 
 each input Directory, so it is up to the caller to enforce this. 
 This method should be acquiring locks: its copying *RAW FILES*. Otherwise we 
 should remove it. If someone doesnt like that, or is mad because its 10ns 
 slower, they can use NoLockFactory. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-5228) IndexWriter.addIndexes copies raw files but acquires no locks

2014-01-22 Thread Michael McCandless (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael McCandless updated LUCENE-5228:
---

Attachment: LUCENE-5228.patch

How about something like this?

I fixed Lock to implement Closeable, and just added the acquire/close inside 
IW.addIndexes(Dir[]).

 IndexWriter.addIndexes copies raw files but acquires no locks
 -

 Key: LUCENE-5228
 URL: https://issues.apache.org/jira/browse/LUCENE-5228
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir
Assignee: Michael McCandless
 Attachments: LUCENE-5228.patch


 I see stuff like: merge problem with lucene 3 and 4 indices (from solr 
 users list), and cannot even think how to respond to these users because so 
 many things can go wrong with IndexWriter.addIndexes(Directory)
 it currently has in its javadocs:
 NOTE: the index in each Directory must not be changed (opened by a writer) 
 while this method is running. This method does not acquire a write lock in 
 each input Directory, so it is up to the caller to enforce this. 
 This method should be acquiring locks: its copying *RAW FILES*. Otherwise we 
 should remove it. If someone doesnt like that, or is mad because its 10ns 
 slower, they can use NoLockFactory. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-5413) remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE

2014-01-22 Thread Robert Muir (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated LUCENE-5413:


Attachment: LUCENE-5413.patch

patch: this only applies to branch_4x (trunk is not impacted, as it does not 
support 3.x indexes)

 remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE
 -

 Key: LUCENE-5413
 URL: https://issues.apache.org/jira/browse/LUCENE-5413
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir
 Fix For: 4.7

 Attachments: LUCENE-5413.patch


 Somehow we have two impersonation booleans in 4.x. I feel we should only have 
 one. perhaps some logic in a test somewhere needs OLD_FORMAT  codec == 
 Lucene3x instead, but having too booleans is not good.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5228) IndexWriter.addIndexes copies raw files but acquires no locks

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13879184#comment-13879184
 ] 

Robert Muir commented on LUCENE-5228:
-

Very simple approach! I like it.

 IndexWriter.addIndexes copies raw files but acquires no locks
 -

 Key: LUCENE-5228
 URL: https://issues.apache.org/jira/browse/LUCENE-5228
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir
Assignee: Michael McCandless
 Attachments: LUCENE-5228.patch


 I see stuff like: merge problem with lucene 3 and 4 indices (from solr 
 users list), and cannot even think how to respond to these users because so 
 many things can go wrong with IndexWriter.addIndexes(Directory)
 it currently has in its javadocs:
 NOTE: the index in each Directory must not be changed (opened by a writer) 
 while this method is running. This method does not acquire a write lock in 
 each input Directory, so it is up to the caller to enforce this. 
 This method should be acquiring locks: its copying *RAW FILES*. Otherwise we 
 should remove it. If someone doesnt like that, or is mad because its 10ns 
 slower, they can use NoLockFactory. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-5413) remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE

2014-01-22 Thread Robert Muir (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated LUCENE-5413:


Attachment: LUCENE-5413.patch

fix a couple test bugs in the first patch

 remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE
 -

 Key: LUCENE-5413
 URL: https://issues.apache.org/jira/browse/LUCENE-5413
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir
 Fix For: 4.7

 Attachments: LUCENE-5413.patch, LUCENE-5413.patch


 Somehow we have two impersonation booleans in 4.x. I feel we should only have 
 one. perhaps some logic in a test somewhere needs OLD_FORMAT  codec == 
 Lucene3x instead, but having too booleans is not good.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5413) remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE

2014-01-22 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13879200#comment-13879200
 ] 

Michael McCandless commented on LUCENE-5413:


+1 to the last patch!

 remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE
 -

 Key: LUCENE-5413
 URL: https://issues.apache.org/jira/browse/LUCENE-5413
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir
 Fix For: 4.7

 Attachments: LUCENE-5413.patch, LUCENE-5413.patch


 Somehow we have two impersonation booleans in 4.x. I feel we should only have 
 one. perhaps some logic in a test somewhere needs OLD_FORMAT  codec == 
 Lucene3x instead, but having too booleans is not good.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5654) Create a synonym filter factory that is (re)configurable, and capable of reporting its configuration, via REST API

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13879204#comment-13879204
 ] 

Robert Muir commented on SOLR-5654:
---

{quote}
The point of this issue is to provide REST API methods to interrogate and 
modify/persist synonym config and mappings. A different resourceloader would 
only allow for this info to be pulled from an alternate persistence store - it 
wouldn't do anything for the REST API and persistence part.
{quote}

It wouldnt prevent it either.

Reworded: why is a custom factory necessary?

 Create a synonym filter factory that is (re)configurable, and capable of 
 reporting its configuration, via REST API
 --

 Key: SOLR-5654
 URL: https://issues.apache.org/jira/browse/SOLR-5654
 Project: Solr
  Issue Type: Sub-task
  Components: Schema and Analysis
Reporter: Steve Rowe

 A synonym filter factory could be (re)configurable via REST API by 
 registering with the RESTManager described in SOLR-5653, and then responding 
 to REST API calls to modify its init params and its synonyms resource file.
 Read-only (GET) REST API calls should also be provided, both for init params 
 and the synonyms resource file.
 It should be possible to add/remove/modify one or more entries in the 
 synonyms resource file.
 We should probably use JSON for the REST request body, as is done in the 
 Schema REST API methods.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Resolved] (LUCENE-5413) remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE

2014-01-22 Thread Robert Muir (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir resolved LUCENE-5413.
-

Resolution: Fixed

 remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE
 -

 Key: LUCENE-5413
 URL: https://issues.apache.org/jira/browse/LUCENE-5413
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir
 Fix For: 4.7

 Attachments: LUCENE-5413.patch, LUCENE-5413.patch


 Somehow we have two impersonation booleans in 4.x. I feel we should only have 
 one. perhaps some logic in a test somewhere needs OLD_FORMAT  codec == 
 Lucene3x instead, but having too booleans is not good.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5413) remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5413?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13879207#comment-13879207
 ] 

ASF subversion and git services commented on LUCENE-5413:
-

Commit 1560519 from [~rcmuir] in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1560519 ]

LUCENE-5413: remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE

 remove LuceneTestCase.PREFLEX_IMPERSONATION_IS_ACTIVE
 -

 Key: LUCENE-5413
 URL: https://issues.apache.org/jira/browse/LUCENE-5413
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir
 Fix For: 4.7

 Attachments: LUCENE-5413.patch, LUCENE-5413.patch


 Somehow we have two impersonation booleans in 4.x. I feel we should only have 
 one. perhaps some logic in a test somewhere needs OLD_FORMAT  codec == 
 Lucene3x instead, but having too booleans is not good.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-5657) When a SolrCore starts on HDFS, it should gracefully handle HDFS being in safe mode.

2014-01-22 Thread Mark Miller (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-5657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mark Miller updated SOLR-5657:
--

Attachment: SOLR-5657.patch

 When a SolrCore starts on HDFS, it should gracefully handle HDFS being in 
 safe mode.
 

 Key: SOLR-5657
 URL: https://issues.apache.org/jira/browse/SOLR-5657
 Project: Solr
  Issue Type: Bug
Reporter: Mark Miller
Assignee: Mark Miller
 Fix For: 5.0, 4.7

 Attachments: SOLR-5657.patch


 Currently, things just fail. Ideally, the SolrCore would wait until safe mode 
 was exited and resume loading.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5654) Create a synonym filter factory that is (re)configurable, and capable of reporting its configuration, via REST API

2014-01-22 Thread Michael McCandless (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5654?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13879209#comment-13879209
 ] 

Michael McCandless commented on SOLR-5654:
--

bq. I guess your point is that we shouldn't be making this easier. 

Right.

bq. I disagree: the point of the issue is to allow people more fine-grained 
control over an already-existing freedom.

Just because there's already an existing (not necessarily good) freedom doesn't 
mean it must be made easier.  Optimize is an existing freedom :)

Does Solr at least record somewhere that full re-index required?  So the user 
(if s/he knows to look in the right place on the admin UI) is informed that 
inconsistent results might be because they didn't fully re-index yet...

 Create a synonym filter factory that is (re)configurable, and capable of 
 reporting its configuration, via REST API
 --

 Key: SOLR-5654
 URL: https://issues.apache.org/jira/browse/SOLR-5654
 Project: Solr
  Issue Type: Sub-task
  Components: Schema and Analysis
Reporter: Steve Rowe

 A synonym filter factory could be (re)configurable via REST API by 
 registering with the RESTManager described in SOLR-5653, and then responding 
 to REST API calls to modify its init params and its synonyms resource file.
 Read-only (GET) REST API calls should also be provided, both for init params 
 and the synonyms resource file.
 It should be possible to add/remove/modify one or more entries in the 
 synonyms resource file.
 We should probably use JSON for the REST request body, as is done in the 
 Schema REST API methods.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-5657) When a SolrCore starts on HDFS, it should gracefully handle HDFS being in safe mode.

2014-01-22 Thread Mark Miller (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-5657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mark Miller updated SOLR-5657:
--

Fix Version/s: 4.7
   5.0

 When a SolrCore starts on HDFS, it should gracefully handle HDFS being in 
 safe mode.
 

 Key: SOLR-5657
 URL: https://issues.apache.org/jira/browse/SOLR-5657
 Project: Solr
  Issue Type: Bug
Reporter: Mark Miller
Assignee: Mark Miller
 Fix For: 5.0, 4.7

 Attachments: SOLR-5657.patch


 Currently, things just fail. Ideally, the SolrCore would wait until safe mode 
 was exited and resume loading.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5657) When a SolrCore starts on HDFS, it should gracefully handle HDFS being in safe mode.

2014-01-22 Thread Mark Miller (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13879210#comment-13879210
 ] 

Mark Miller commented on SOLR-5657:
---

This significantly improves how the various hdfs components handle running into 
safe mode on startup. Also includes random testing that hits HdfsLockFactory, 
HdfsDirectory, and HdfsUpdateLog.

 When a SolrCore starts on HDFS, it should gracefully handle HDFS being in 
 safe mode.
 

 Key: SOLR-5657
 URL: https://issues.apache.org/jira/browse/SOLR-5657
 Project: Solr
  Issue Type: Bug
Reporter: Mark Miller
Assignee: Mark Miller
 Fix For: 5.0, 4.7

 Attachments: SOLR-5657.patch


 Currently, things just fail. Ideally, the SolrCore would wait until safe mode 
 was exited and resume loading.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (LUCENE-5410) Add fuzziness support to SimpleQueryParser

2014-01-22 Thread Lee Hinman (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-5410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lee Hinman updated LUCENE-5410:
---

Attachment: LUCENE-5410.patch

Attached patch that supports both edit distance for single terms and slop for 
phrases.

I chose to add only a single flag ({{FUZZINESS_OPERATOR}}) for 
enabling/disabling both behaviors, but it would be easy to separate them if 
desired.

 Add fuzziness support to SimpleQueryParser
 --

 Key: LUCENE-5410
 URL: https://issues.apache.org/jira/browse/LUCENE-5410
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/queryparser
Affects Versions: 4.7
Reporter: Lee Hinman
Priority: Minor
 Attachments: LUCENE-5410.patch

   Original Estimate: 168h
  Remaining Estimate: 168h

 It would be nice to add fuzzy query support to the {{SimpleQueryParser}} so 
 that:
 {{foo~2}}
 generates a {{FuzzyQuery}} with an max edit distance of 2 and:
 {{foo bar~2}}
 generates a {{PhraseQuery}} with a slop of 2.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5657) When a SolrCore starts on HDFS, it should gracefully handle HDFS being in safe mode.

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13879288#comment-13879288
 ] 

ASF subversion and git services commented on SOLR-5657:
---

Commit 1560553 from [~markrmil...@gmail.com] in branch 'dev/trunk'
[ https://svn.apache.org/r1560553 ]

SOLR-5657: When a SolrCore starts on HDFS, it should gracefully handle HDFS 
being in safe mode.

 When a SolrCore starts on HDFS, it should gracefully handle HDFS being in 
 safe mode.
 

 Key: SOLR-5657
 URL: https://issues.apache.org/jira/browse/SOLR-5657
 Project: Solr
  Issue Type: Bug
Reporter: Mark Miller
Assignee: Mark Miller
 Fix For: 5.0, 4.7

 Attachments: SOLR-5657.patch


 Currently, things just fail. Ideally, the SolrCore would wait until safe mode 
 was exited and resume loading.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-5657) When a SolrCore starts on HDFS, it should gracefully handle HDFS being in safe mode.

2014-01-22 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-5657?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13879289#comment-13879289
 ] 

ASF subversion and git services commented on SOLR-5657:
---

Commit 1560554 from [~markrmil...@gmail.com] in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1560554 ]

SOLR-5657: When a SolrCore starts on HDFS, it should gracefully handle HDFS 
being in safe mode.

 When a SolrCore starts on HDFS, it should gracefully handle HDFS being in 
 safe mode.
 

 Key: SOLR-5657
 URL: https://issues.apache.org/jira/browse/SOLR-5657
 Project: Solr
  Issue Type: Bug
Reporter: Mark Miller
Assignee: Mark Miller
 Fix For: 5.0, 4.7

 Attachments: SOLR-5657.patch


 Currently, things just fail. Ideally, the SolrCore would wait until safe mode 
 was exited and resume loading.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (LUCENE-5410) Add fuzziness support to SimpleQueryParser

2014-01-22 Thread Robert Muir (JIRA)

[ 
https://issues.apache.org/jira/browse/LUCENE-5410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13879319#comment-13879319
 ] 

Robert Muir commented on LUCENE-5410:
-

Hi Lee, in general this seems like a good approach.

Can we avoid the changes to QueryBuilder? The purpose of QueryBuilder is to 
interact with the analysis chain, and thats not typically how fuzzy queries are 
handled (for the same reason QueryBuilder has no prefix logic, so I think the 
case should be handled in a similar fashion). This way we can avoid adding 
methods like createFuzzyBooleanQuery, newTermOrFuzzyQuery and so on.

As far as the parser itself, I don't like having both newPhraseQuery(String) 
and newPhraseQuery(String, int). Can we just nuke the first one and have the 
latter instead? Exact phrase queries can just past 0 here.

Can we add the tilde operator to the list in testRandomQueries2 ?



 Add fuzziness support to SimpleQueryParser
 --

 Key: LUCENE-5410
 URL: https://issues.apache.org/jira/browse/LUCENE-5410
 Project: Lucene - Core
  Issue Type: Improvement
  Components: core/queryparser
Affects Versions: 4.7
Reporter: Lee Hinman
Priority: Minor
 Attachments: LUCENE-5410.patch

   Original Estimate: 168h
  Remaining Estimate: 168h

 It would be nice to add fuzzy query support to the {{SimpleQueryParser}} so 
 that:
 {{foo~2}}
 generates a {{FuzzyQuery}} with an max edit distance of 2 and:
 {{foo bar~2}}
 generates a {{PhraseQuery}} with a slop of 2.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[JENKINS] Lucene-Artifacts-4.x - Build # 534 - Failure

2014-01-22 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Artifacts-4.x/534/

No tests ran.

Build Log:
[...truncated 6284 lines...]
BUILD FAILED
/usr/home/hudson/hudson-slave/workspace/Lucene-Artifacts-4.x/lucene/build.xml:543:
 The following error occurred while executing this line:
/usr/home/hudson/hudson-slave/workspace/Lucene-Artifacts-4.x/lucene/common-build.xml:2352:
 Can't get https://issues.apache.org/jira/rest/api/2/project/LUCENE to 
/usr/home/hudson/hudson-slave/workspace/Lucene-Artifacts-4.x/lucene/build/docs/changes/jiraVersionList.json

Total time: 5 minutes 33 seconds
Build step 'Invoke Ant' marked build as failure
Archiving artifacts
Publishing Javadoc
Email was triggered for: Failure
Sending email for trigger: Failure



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[JENKINS-MAVEN] Lucene-Solr-Maven-4.x #570: POMs out of sync

2014-01-22 Thread Apache Jenkins Server
Build: https://builds.apache.org/job/Lucene-Solr-Maven-4.x/570/

No tests ran.

Build Log:
[...truncated 6624 lines...]
BUILD FAILED
/usr/home/hudson/hudson-slave/workspace/Lucene-Solr-Maven-4.x/build.xml:475: 
The following error occurred while executing this line:
/usr/home/hudson/hudson-slave/workspace/Lucene-Solr-Maven-4.x/build.xml:153: 
The following error occurred while executing this line:
/usr/home/hudson/hudson-slave/workspace/Lucene-Solr-Maven-4.x/lucene/common-build.xml:539:
 The following error occurred while executing this line:
/usr/home/hudson/hudson-slave/workspace/Lucene-Solr-Maven-4.x/lucene/common-build.xml:536:
 The following error occurred while executing this line:
/usr/home/hudson/hudson-slave/workspace/Lucene-Solr-Maven-4.x/lucene/build.xml:543:
 The following error occurred while executing this line:
/usr/home/hudson/hudson-slave/workspace/Lucene-Solr-Maven-4.x/lucene/common-build.xml:2352:
 Can't get https://issues.apache.org/jira/rest/api/2/project/LUCENE to 
/usr/home/hudson/hudson-slave/workspace/Lucene-Solr-Maven-4.x/lucene/build/docs/changes/jiraVersionList.json

Total time: 5 minutes 34 seconds
Build step 'Invoke Ant' marked build as failure
Recording test results
Email was triggered for: Failure
Sending email for trigger: Failure



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[JENKINS] Lucene-Solr-trunk-Windows (64bit/jdk1.7.0_51) - Build # 3702 - Still Failing!

2014-01-22 Thread Policeman Jenkins Server
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Windows/3702/
Java: 64bit/jdk1.7.0_51 -XX:+UseCompressedOops -XX:+UseG1GC

All tests passed

Build Log:
[...truncated 52343 lines...]
BUILD FAILED
C:\Users\JenkinsSlave\workspace\Lucene-Solr-trunk-Windows\build.xml:453: The 
following error occurred while executing this line:
C:\Users\JenkinsSlave\workspace\Lucene-Solr-trunk-Windows\build.xml:57: The 
following error occurred while executing this line:
C:\Users\JenkinsSlave\workspace\Lucene-Solr-trunk-Windows\lucene\build.xml:208: 
The following error occurred while executing this line:
C:\Users\JenkinsSlave\workspace\Lucene-Solr-trunk-Windows\lucene\build.xml:543: 
The following error occurred while executing this line:
C:\Users\JenkinsSlave\workspace\Lucene-Solr-trunk-Windows\lucene\common-build.xml:2321:
 Can't get https://issues.apache.org/jira/rest/api/2/project/LUCENE to 
C:\Users\JenkinsSlave\workspace\Lucene-Solr-trunk-Windows\lucene\build\docs\changes\jiraVersionList.json

Total time: 112 minutes 6 seconds
Build step 'Invoke Ant' marked build as failure
Description set: Java: 64bit/jdk1.7.0_51 -XX:+UseCompressedOops -XX:+UseG1GC
Archiving artifacts
Recording test results
Email was triggered for: Failure
Sending email for trigger: Failure



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: Welcome Areek Zillur as Lucene/Solr committer!

2014-01-22 Thread Koji Sekiguchi

Welcome, Areek!

koji
--
http://soleami.com/blog/mahout-and-machine-learning-training-course-is-here.html

(14/01/22 4:26), Robert Muir wrote:

I'm pleased to announce that Areek Zillur has accepted to join our ranks as
a committer.

Areek has been improving suggester support in Lucene and Solr, including a
revamped Solr component slated for the 4.7 release. [1]

Areek, it is tradition that you introduce yourself with a brief bio.

Once your account is setup, you should then be able to add yourself to the
who we are page on the website as well.

Congratulations and welcome!

[1] https://issues.apache.org/jira/browse/SOLR-5378






-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[JENKINS] Lucene-Solr-4.x-Windows (64bit/jdk1.7.0_60-ea-b03) - Build # 3626 - Failure!

2014-01-22 Thread Policeman Jenkins Server
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-4.x-Windows/3626/
Java: 64bit/jdk1.7.0_60-ea-b03 -XX:+UseCompressedOops -XX:+UseSerialGC

All tests passed

Build Log:
[...truncated 41961 lines...]
  [javadoc] Generating Javadoc
  [javadoc] Javadoc execution
  [javadoc] warning: [options] bootstrap class path not set in conjunction with 
-source 1.6
  [javadoc] Loading source files for package org.apache.lucene...
  [javadoc] Loading source files for package org.apache.lucene.analysis...
  [javadoc] Loading source files for package 
org.apache.lucene.analysis.tokenattributes...
  [javadoc] Loading source files for package org.apache.lucene.codecs...
  [javadoc] Loading source files for package 
org.apache.lucene.codecs.compressing...
  [javadoc] Loading source files for package 
org.apache.lucene.codecs.lucene3x...
  [javadoc] Loading source files for package 
org.apache.lucene.codecs.lucene40...
  [javadoc] Loading source files for package 
org.apache.lucene.codecs.lucene41...
  [javadoc] Loading source files for package 
org.apache.lucene.codecs.lucene42...
  [javadoc] Loading source files for package 
org.apache.lucene.codecs.lucene45...
  [javadoc] Loading source files for package 
org.apache.lucene.codecs.lucene46...
  [javadoc] Loading source files for package 
org.apache.lucene.codecs.perfield...
  [javadoc] Loading source files for package org.apache.lucene.document...
  [javadoc] Loading source files for package org.apache.lucene.index...
  [javadoc] Loading source files for package org.apache.lucene.search...
  [javadoc] Loading source files for package 
org.apache.lucene.search.payloads...
  [javadoc] Loading source files for package 
org.apache.lucene.search.similarities...
  [javadoc] Loading source files for package org.apache.lucene.search.spans...
  [javadoc] Loading source files for package org.apache.lucene.store...
  [javadoc] Loading source files for package org.apache.lucene.util...
  [javadoc] Loading source files for package org.apache.lucene.util.automaton...
  [javadoc] Loading source files for package org.apache.lucene.util.fst...
  [javadoc] Loading source files for package org.apache.lucene.util.mutable...
  [javadoc] Loading source files for package org.apache.lucene.util.packed...
  [javadoc] Constructing Javadoc information...
  [javadoc] Standard Doclet version 1.7.0_60-ea
  [javadoc] Building tree for all the packages and classes...
  [javadoc] Generating 
C:\Users\JenkinsSlave\workspace\Lucene-Solr-4.x-Windows\lucene\build\docs\core\org\apache\lucene\search\package-summary.html...
  [javadoc] Copying file 
C:\Users\JenkinsSlave\workspace\Lucene-Solr-4.x-Windows\lucene\core\src\java\org\apache\lucene\search\doc-files\nrq-formula-1.png
 to directory 
C:\Users\JenkinsSlave\workspace\Lucene-Solr-4.x-Windows\lucene\build\docs\core\org\apache\lucene\search\doc-files...
  [javadoc] Copying file 
C:\Users\JenkinsSlave\workspace\Lucene-Solr-4.x-Windows\lucene\core\src\java\org\apache\lucene\search\doc-files\nrq-formula-2.png
 to directory 
C:\Users\JenkinsSlave\workspace\Lucene-Solr-4.x-Windows\lucene\build\docs\core\org\apache\lucene\search\doc-files...
  [javadoc] Building index for all the packages and classes...
  [javadoc] Building index for all classes...
  [javadoc] Generating 
C:\Users\JenkinsSlave\workspace\Lucene-Solr-4.x-Windows\lucene\build\docs\core\help-doc.html...
  [javadoc] 1 warning

[...truncated 27 lines...]
  [javadoc] Generating Javadoc
  [javadoc] Javadoc execution
  [javadoc] Loading source files for package org.apache.lucene.analysis.ar...
  [javadoc] warning: [options] bootstrap class path not set in conjunction with 
-source 1.6
  [javadoc] Loading source files for package org.apache.lucene.analysis.bg...
  [javadoc] Loading source files for package org.apache.lucene.analysis.br...
  [javadoc] Loading source files for package org.apache.lucene.analysis.ca...
  [javadoc] Loading source files for package 
org.apache.lucene.analysis.charfilter...
  [javadoc] Loading source files for package org.apache.lucene.analysis.cjk...
  [javadoc] Loading source files for package org.apache.lucene.analysis.ckb...
  [javadoc] Loading source files for package org.apache.lucene.analysis.cn...
  [javadoc] Loading source files for package 
org.apache.lucene.analysis.commongrams...
  [javadoc] Loading source files for package 
org.apache.lucene.analysis.compound...
  [javadoc] Loading source files for package 
org.apache.lucene.analysis.compound.hyphenation...
  [javadoc] Loading source files for package org.apache.lucene.analysis.core...
  [javadoc] Loading source files for package org.apache.lucene.analysis.cz...
  [javadoc] Loading source files for package org.apache.lucene.analysis.da...
  [javadoc] Loading source files for package org.apache.lucene.analysis.de...
  [javadoc] Loading source files for package org.apache.lucene.analysis.el...
  [javadoc] Loading source files for package org.apache.lucene.analysis.en...
  [javadoc] Loading source files for package 

[JENKINS] Lucene-trunk-Linux-Java7-64-test-only - Build # 74595 - Failure!

2014-01-22 Thread builder
Build: builds.flonkings.com/job/Lucene-trunk-Linux-Java7-64-test-only/74595/

1 tests failed.
REGRESSION:  org.apache.lucene.search.TestSearchAfter.testQueries

Error Message:


Stack Trace:
java.lang.NullPointerException
at 
__randomizedtesting.SeedInfo.seed([3A4880EB941822D6:66C64C308E719778]:0)
at 
org.apache.lucene.search.TopFieldCollector$PagingFieldCollector.collect(TopFieldCollector.java:982)
at org.apache.lucene.search.Scorer.score(Scorer.java:65)
at 
org.apache.lucene.search.AssertingScorer.score(AssertingScorer.java:136)
at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:613)
at 
org.apache.lucene.search.AssertingIndexSearcher.search(AssertingIndexSearcher.java:93)
at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:573)
at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:525)
at 
org.apache.lucene.search.IndexSearcher.searchAfter(IndexSearcher.java:416)
at 
org.apache.lucene.search.TestSearchAfter.assertQuery(TestSearchAfter.java:259)
at 
org.apache.lucene.search.TestSearchAfter.assertQuery(TestSearchAfter.java:205)
at 
org.apache.lucene.search.TestSearchAfter.testQueries(TestSearchAfter.java:189)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner.invoke(RandomizedRunner.java:1559)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner.access$600(RandomizedRunner.java:79)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$6.evaluate(RandomizedRunner.java:737)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$7.evaluate(RandomizedRunner.java:773)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$8.evaluate(RandomizedRunner.java:787)
at 
org.apache.lucene.util.TestRuleSetupTeardownChained$1.evaluate(TestRuleSetupTeardownChained.java:50)
at 
org.apache.lucene.util.TestRuleFieldCacheSanity$1.evaluate(TestRuleFieldCacheSanity.java:51)
at 
org.apache.lucene.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:46)
at 
com.carrotsearch.randomizedtesting.rules.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:55)
at 
org.apache.lucene.util.TestRuleThreadAndTestName$1.evaluate(TestRuleThreadAndTestName.java:49)
at 
org.apache.lucene.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:70)
at 
org.apache.lucene.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:48)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:358)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl.forkTimeoutingTask(ThreadLeakControl.java:782)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl$3.evaluate(ThreadLeakControl.java:442)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner.runSingleTest(RandomizedRunner.java:746)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$3.evaluate(RandomizedRunner.java:648)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$4.evaluate(RandomizedRunner.java:682)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$5.evaluate(RandomizedRunner.java:693)
at 
org.apache.lucene.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:46)
at 
org.apache.lucene.util.TestRuleStoreClassName$1.evaluate(TestRuleStoreClassName.java:42)
at 
com.carrotsearch.randomizedtesting.rules.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:55)
at 
com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:39)
at 
com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:39)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
org.apache.lucene.util.TestRuleAssertionsRequired$1.evaluate(TestRuleAssertionsRequired.java:43)
at 
org.apache.lucene.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:48)
at 
org.apache.lucene.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:70)
at 
org.apache.lucene.util.TestRuleIgnoreTestSuites$1.evaluate(TestRuleIgnoreTestSuites.java:55)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 

Re: [JENKINS] Lucene-trunk-Linux-Java7-64-test-only - Build # 74595 - Failure!

2014-01-22 Thread Robert Muir
1. this doesnt reproduce.
2. the line of the NPE makes no sense: its this code,
if (queueFull) {

queueFull is a primitive type (boolean)...


On Wed, Jan 22, 2014 at 8:41 PM, buil...@flonkings.com wrote:

 Build:
 builds.flonkings.com/job/Lucene-trunk-Linux-Java7-64-test-only/74595/

 1 tests failed.
 REGRESSION:  org.apache.lucene.search.TestSearchAfter.testQueries

 Error Message:


 Stack Trace:
 java.lang.NullPointerException
 at
 __randomizedtesting.SeedInfo.seed([3A4880EB941822D6:66C64C308E719778]:0)
 at
 org.apache.lucene.search.TopFieldCollector$PagingFieldCollector.collect(TopFieldCollector.java:982)
 at org.apache.lucene.search.Scorer.score(Scorer.java:65)
 at
 org.apache.lucene.search.AssertingScorer.score(AssertingScorer.java:136)
 at
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:613)
 at
 org.apache.lucene.search.AssertingIndexSearcher.search(AssertingIndexSearcher.java:93)
 at
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:573)
 at
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:525)
 at
 org.apache.lucene.search.IndexSearcher.searchAfter(IndexSearcher.java:416)
 at
 org.apache.lucene.search.TestSearchAfter.assertQuery(TestSearchAfter.java:259)
 at
 org.apache.lucene.search.TestSearchAfter.assertQuery(TestSearchAfter.java:205)
 at
 org.apache.lucene.search.TestSearchAfter.testQueries(TestSearchAfter.java:189)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
 at
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:606)
 at
 com.carrotsearch.randomizedtesting.RandomizedRunner.invoke(RandomizedRunner.java:1559)
 at
 com.carrotsearch.randomizedtesting.RandomizedRunner.access$600(RandomizedRunner.java:79)
 at
 com.carrotsearch.randomizedtesting.RandomizedRunner$6.evaluate(RandomizedRunner.java:737)
 at
 com.carrotsearch.randomizedtesting.RandomizedRunner$7.evaluate(RandomizedRunner.java:773)
 at
 com.carrotsearch.randomizedtesting.RandomizedRunner$8.evaluate(RandomizedRunner.java:787)
 at
 org.apache.lucene.util.TestRuleSetupTeardownChained$1.evaluate(TestRuleSetupTeardownChained.java:50)
 at
 org.apache.lucene.util.TestRuleFieldCacheSanity$1.evaluate(TestRuleFieldCacheSanity.java:51)
 at
 org.apache.lucene.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:46)
 at
 com.carrotsearch.randomizedtesting.rules.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:55)
 at
 org.apache.lucene.util.TestRuleThreadAndTestName$1.evaluate(TestRuleThreadAndTestName.java:49)
 at
 org.apache.lucene.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:70)
 at
 org.apache.lucene.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:48)
 at
 com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
 at
 com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:358)
 at
 com.carrotsearch.randomizedtesting.ThreadLeakControl.forkTimeoutingTask(ThreadLeakControl.java:782)
 at
 com.carrotsearch.randomizedtesting.ThreadLeakControl$3.evaluate(ThreadLeakControl.java:442)
 at
 com.carrotsearch.randomizedtesting.RandomizedRunner.runSingleTest(RandomizedRunner.java:746)
 at
 com.carrotsearch.randomizedtesting.RandomizedRunner$3.evaluate(RandomizedRunner.java:648)
 at
 com.carrotsearch.randomizedtesting.RandomizedRunner$4.evaluate(RandomizedRunner.java:682)
 at
 com.carrotsearch.randomizedtesting.RandomizedRunner$5.evaluate(RandomizedRunner.java:693)
 at
 org.apache.lucene.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:46)
 at
 org.apache.lucene.util.TestRuleStoreClassName$1.evaluate(TestRuleStoreClassName.java:42)
 at
 com.carrotsearch.randomizedtesting.rules.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:55)
 at
 com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:39)
 at
 com.carrotsearch.randomizedtesting.rules.NoShadowingOrOverridesOnMethodsRule$1.evaluate(NoShadowingOrOverridesOnMethodsRule.java:39)
 at
 com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
 at
 org.apache.lucene.util.TestRuleAssertionsRequired$1.evaluate(TestRuleAssertionsRequired.java:43)
 at
 org.apache.lucene.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:48)
 at
 

[JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.8.0-ea-b124) - Build # 9200 - Still Failing!

2014-01-22 Thread Policeman Jenkins Server
Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/9200/
Java: 64bit/jdk1.8.0-ea-b124 -XX:-UseCompressedOops -XX:+UseG1GC

All tests passed

Build Log:
[...truncated 42833 lines...]
BUILD FAILED
/mnt/ssd/jenkins/workspace/Lucene-Solr-trunk-Linux/build.xml:453: The following 
error occurred while executing this line:
/mnt/ssd/jenkins/workspace/Lucene-Solr-trunk-Linux/build.xml:57: The following 
error occurred while executing this line:
/mnt/ssd/jenkins/workspace/Lucene-Solr-trunk-Linux/lucene/build.xml:208: The 
following error occurred while executing this line:
/mnt/ssd/jenkins/workspace/Lucene-Solr-trunk-Linux/lucene/build.xml:543: The 
following error occurred while executing this line:
/mnt/ssd/jenkins/workspace/Lucene-Solr-trunk-Linux/lucene/common-build.xml:2321:
 Can't get https://issues.apache.org/jira/rest/api/2/project/LUCENE to 
/mnt/ssd/jenkins/workspace/Lucene-Solr-trunk-Linux/lucene/build/docs/changes/jiraVersionList.json

Total time: 60 minutes 14 seconds
Build step 'Invoke Ant' marked build as failure
Description set: Java: 64bit/jdk1.8.0-ea-b124 -XX:-UseCompressedOops 
-XX:+UseG1GC
Archiving artifacts
Recording test results
Email was triggered for: Failure
Sending email for trigger: Failure



-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

  1   2   >