date:20140721

Lee Skillen created PYLUCENE-31:
---

 Summary: JCC Parallel/Multiprocess Compilation + Caching
 Key: PYLUCENE-31
 URL: https://issues.apache.org/jira/browse/PYLUCENE-31
 Project: PyLucene
  Issue Type: Improvement
 Environment: Linux 3.11.0-19-generic #33-Ubuntu SMP x86_64 GNU/Linux
Reporter: Lee Skillen
Priority: Minor


JCC utilises distutils.Extension() in order to build JCC itself and the 
packages that it generates for Java wrapping - Unfortunately distutils performs 
its build sequentially and doesn't take advantage of any additional free cores 
for parallel building.  As discussed on the list this is likely a design 
decision due to potential issues that may arise when building projects with 
awkward, cyclic or recursive dependencies.

These issues shouldn't appear within JCC-based projects because of the 
generative nature of the build; i.e. all dependencies are resolved and 
generated prior to building, and the build process itself is about compilation 
and construction of the wrapper alone, of which the wrapper files are contained 
to a sequence of flattened compilation units.

Enabling this requires monkey patching of distutils, which was also discussed 
on the list as being a potential source of issues, although we feel that the 
risk is likely lower than the current setuptools patching utilised.  This would 
be optional functionality that is also only enabled if the monkey-patching 
succeeds.  Distutils itself is also part of the standard library and might be 
less susceptible to change than setuptools, and the area of code monkey patched 
almost hasn't changed since 2002 (see: 
http://hg.python.org/cpython/file/tip/Lib/distutils/ccompiler.py).

In addition to the distutils changes this patch also includes changes to the 
wrapper class generation to make it more cache friendly, with the target being 
that no changes in the wrapped code equals no changes in the wrapper code.  So 
any changes that minimally change the wrapped code mean that with a tool such 
as ccache the rebuild time would be significantly reduced (almost to a nth, 
where n is the number of files and only one has changed).

Obviously the maintainers would have to assess this risk and decide whether 
they would like to accept the patch or not.  Code has only been tested on Linux 
with Python 2.7.5 but should gracefully fail and prevent parallelisation if one 
of the requirements hasn't been met (not on linux, no multiprocessing support, 
or monkey patching somehow fails).  The change to caching should still benefit 
everyone regardless.

Please note that an additional dependency on orderedset has been added to 
achieve the more deterministic ordering - This may not be desirable (i.e. 
another package might be desired, such as ordered-set, or the code might be 
inlined into the package instead), as per maintainer comments.

--- [following repeated from mailing list] ---

Performance Statistics :-

The following are some quick and dirty statistics for building the jcc pylucene 
itself (incl. java lucene which accounts for about 30-ish seconds upfront) - 
The JCC files are split using --files 8, and each build is preceded with a make 
clean:

Serial (unpatched):

real5m1.502s
user5m22.887s
sys 0m7.749s

Parallel (patched, 4 physical cores, 8 hyperthreads, 8 parallel jobs):

real1m37.382s
user7m16.658s
sys 0m8.697s

Furthermore, some additional changes were made to the wrapped file generation 
to make the generated code more ccache friendly (additional deterministic 
sorting for methods and some usage of an ordered set).  With these in place and 
the CC and CCACHE_COMPILERCHECK environment variables set to ccache gcc and 
content respectively, and ensuring ccache is installed, subsequent 
compilation time is reduced again as follows:

Parallel (patched, 4 physical cores, 8 hyperthreads, 8 parallel jobs, ccache 
enabled):

real0m43.051s
user1m10.392s
sys 0m4.547s

This was a run in which nothing changed between runs, so a realistic run in 
which changes occur it'll be a figure between 0m43.051s and 1m37.382s, 
depending on how drastic the change was. If many changes are expected and you 
want to keep it more cache friendly then using a higher --files would probably 
work (to an extent), or ideally use --files separate, although it doesn't 
currently work for me (need to investigate).

We're mostly utilising the PyLucene build as a test bed since it is repeatable 
for others, rather than just showing numbers for own application compilations; 
we also use it to run the unit test suite after changes to JCC itself to ensure 
it still works as intended for PyLucene.  For illustrative purposes though our 
application takes 1m53s to compile with JCC from scratch serially, 0m31s in 
parallel (8 jobs), 0m14s in parallel with ccache enabled and minimal changes, 
and 0m8s with

[jira] [Updated] (PYLUCENE-31) JCC Parallel/Multiprocess Compilation + Caching


 [ 
https://issues.apache.org/jira/browse/PYLUCENE-31?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lee Skillen updated PYLUCENE-31:


Attachment: feature-parallel-build.patch

 JCC Parallel/Multiprocess Compilation + Caching
 ---

 Key: PYLUCENE-31
 URL: https://issues.apache.org/jira/browse/PYLUCENE-31
 Project: PyLucene
  Issue Type: Improvement
 Environment: Linux 3.11.0-19-generic #33-Ubuntu SMP x86_64 GNU/Linux
Reporter: Lee Skillen
Priority: Minor
  Labels: build, cache, ccache, distutils, jcc, parallel
 Attachments: feature-parallel-build.patch


 JCC utilises distutils.Extension() in order to build JCC itself and the 
 packages that it generates for Java wrapping - Unfortunately distutils 
 performs its build sequentially and doesn't take advantage of any additional 
 free cores for parallel building.  As discussed on the list this is likely a 
 design decision due to potential issues that may arise when building projects 
 with awkward, cyclic or recursive dependencies.
 These issues shouldn't appear within JCC-based projects because of the 
 generative nature of the build; i.e. all dependencies are resolved and 
 generated prior to building, and the build process itself is about 
 compilation and construction of the wrapper alone, of which the wrapper files 
 are contained to a sequence of flattened compilation units.
 Enabling this requires monkey patching of distutils, which was also discussed 
 on the list as being a potential source of issues, although we feel that the 
 risk is likely lower than the current setuptools patching utilised.  This 
 would be optional functionality that is also only enabled if the 
 monkey-patching succeeds.  Distutils itself is also part of the standard 
 library and might be less susceptible to change than setuptools, and the area 
 of code monkey patched almost hasn't changed since 2002 (see: 
 http://hg.python.org/cpython/file/tip/Lib/distutils/ccompiler.py).
 In addition to the distutils changes this patch also includes changes to the 
 wrapper class generation to make it more cache friendly, with the target 
 being that no changes in the wrapped code equals no changes in the wrapper 
 code.  So any changes that minimally change the wrapped code mean that with a 
 tool such as ccache the rebuild time would be significantly reduced (almost 
 to a nth, where n is the number of files and only one has changed).
 Obviously the maintainers would have to assess this risk and decide whether 
 they would like to accept the patch or not.  Code has only been tested on 
 Linux with Python 2.7.5 but should gracefully fail and prevent 
 parallelisation if one of the requirements hasn't been met (not on linux, no 
 multiprocessing support, or monkey patching somehow fails).  The change to 
 caching should still benefit everyone regardless.
 Please note that an additional dependency on orderedset has been added to 
 achieve the more deterministic ordering - This may not be desirable (i.e. 
 another package might be desired, such as ordered-set, or the code might be 
 inlined into the package instead), as per maintainer comments.
 --- [following repeated from mailing list] ---
 Performance Statistics :-
 The following are some quick and dirty statistics for building the jcc 
 pylucene itself (incl. java lucene which accounts for about 30-ish seconds 
 upfront) - The JCC files are split using --files 8, and each build is 
 preceded with a make clean:
 Serial (unpatched):
 real5m1.502s
 user5m22.887s
 sys 0m7.749s
 Parallel (patched, 4 physical cores, 8 hyperthreads, 8 parallel jobs):
 real1m37.382s
 user7m16.658s
 sys 0m8.697s
 Furthermore, some additional changes were made to the wrapped file generation 
 to make the generated code more ccache friendly (additional deterministic 
 sorting for methods and some usage of an ordered set).  With these in place 
 and the CC and CCACHE_COMPILERCHECK environment variables set to ccache gcc 
 and content respectively, and ensuring ccache is installed, subsequent 
 compilation time is reduced again as follows:
 Parallel (patched, 4 physical cores, 8 hyperthreads, 8 parallel jobs, ccache 
 enabled):
 real0m43.051s
 user1m10.392s
 sys 0m4.547s
 This was a run in which nothing changed between runs, so a realistic run in 
 which changes occur it'll be a figure between 0m43.051s and 1m37.382s, 
 depending on how drastic the change was. If many changes are expected and you 
 want to keep it more cache friendly then using a higher --files would 
 probably work (to an extent), or ideally use --files separate, although it 
 doesn't currently work for me (need to investigate).
 We're mostly utilising the PyLucene build as a test bed since it is 
 repeatable for others, rather than just showing

[jira] [Updated] (PYLUCENE-31) JCC Parallel/Multiprocess Compilation + Caching


 [ 
https://issues.apache.org/jira/browse/PYLUCENE-31?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lee Skillen updated PYLUCENE-31:


Attachment: (was: feature-parallel-build.patch)

 JCC Parallel/Multiprocess Compilation + Caching
 ---

 Key: PYLUCENE-31
 URL: https://issues.apache.org/jira/browse/PYLUCENE-31
 Project: PyLucene
  Issue Type: Improvement
 Environment: Linux 3.11.0-19-generic #33-Ubuntu SMP x86_64 GNU/Linux
Reporter: Lee Skillen
Priority: Minor
  Labels: build, cache, ccache, distutils, jcc, parallel

 JCC utilises distutils.Extension() in order to build JCC itself and the 
 packages that it generates for Java wrapping - Unfortunately distutils 
 performs its build sequentially and doesn't take advantage of any additional 
 free cores for parallel building.  As discussed on the list this is likely a 
 design decision due to potential issues that may arise when building projects 
 with awkward, cyclic or recursive dependencies.
 These issues shouldn't appear within JCC-based projects because of the 
 generative nature of the build; i.e. all dependencies are resolved and 
 generated prior to building, and the build process itself is about 
 compilation and construction of the wrapper alone, of which the wrapper files 
 are contained to a sequence of flattened compilation units.
 Enabling this requires monkey patching of distutils, which was also discussed 
 on the list as being a potential source of issues, although we feel that the 
 risk is likely lower than the current setuptools patching utilised.  This 
 would be optional functionality that is also only enabled if the 
 monkey-patching succeeds.  Distutils itself is also part of the standard 
 library and might be less susceptible to change than setuptools, and the area 
 of code monkey patched almost hasn't changed since 2002 (see: 
 http://hg.python.org/cpython/file/tip/Lib/distutils/ccompiler.py).
 In addition to the distutils changes this patch also includes changes to the 
 wrapper class generation to make it more cache friendly, with the target 
 being that no changes in the wrapped code equals no changes in the wrapper 
 code.  So any changes that minimally change the wrapped code mean that with a 
 tool such as ccache the rebuild time would be significantly reduced (almost 
 to a nth, where n is the number of files and only one has changed).
 Obviously the maintainers would have to assess this risk and decide whether 
 they would like to accept the patch or not.  Code has only been tested on 
 Linux with Python 2.7.5 but should gracefully fail and prevent 
 parallelisation if one of the requirements hasn't been met (not on linux, no 
 multiprocessing support, or monkey patching somehow fails).  The change to 
 caching should still benefit everyone regardless.
 Please note that an additional dependency on orderedset has been added to 
 achieve the more deterministic ordering - This may not be desirable (i.e. 
 another package might be desired, such as ordered-set, or the code might be 
 inlined into the package instead), as per maintainer comments.
 --- [following repeated from mailing list] ---
 Performance Statistics :-
 The following are some quick and dirty statistics for building the jcc 
 pylucene itself (incl. java lucene which accounts for about 30-ish seconds 
 upfront) - The JCC files are split using --files 8, and each build is 
 preceded with a make clean:
 Serial (unpatched):
 real5m1.502s
 user5m22.887s
 sys 0m7.749s
 Parallel (patched, 4 physical cores, 8 hyperthreads, 8 parallel jobs):
 real1m37.382s
 user7m16.658s
 sys 0m8.697s
 Furthermore, some additional changes were made to the wrapped file generation 
 to make the generated code more ccache friendly (additional deterministic 
 sorting for methods and some usage of an ordered set).  With these in place 
 and the CC and CCACHE_COMPILERCHECK environment variables set to ccache gcc 
 and content respectively, and ensuring ccache is installed, subsequent 
 compilation time is reduced again as follows:
 Parallel (patched, 4 physical cores, 8 hyperthreads, 8 parallel jobs, ccache 
 enabled):
 real0m43.051s
 user1m10.392s
 sys 0m4.547s
 This was a run in which nothing changed between runs, so a realistic run in 
 which changes occur it'll be a figure between 0m43.051s and 1m37.382s, 
 depending on how drastic the change was. If many changes are expected and you 
 want to keep it more cache friendly then using a higher --files would 
 probably work (to an extent), or ideally use --files separate, although it 
 doesn't currently work for me (need to investigate).
 We're mostly utilising the PyLucene build as a test bed since it is 
 repeatable for others, rather than just showing numbers for own application 
 compilations; we

[jira] [Updated] (PYLUCENE-31) JCC Parallel/Multiprocess Compilation + Caching


 [ 
https://issues.apache.org/jira/browse/PYLUCENE-31?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lee Skillen updated PYLUCENE-31:


Attachment: feature-parallel-build.patch

Corrected out-of-date patch.

 JCC Parallel/Multiprocess Compilation + Caching
 ---

 Key: PYLUCENE-31
 URL: https://issues.apache.org/jira/browse/PYLUCENE-31
 Project: PyLucene
  Issue Type: Improvement
 Environment: Linux 3.11.0-19-generic #33-Ubuntu SMP x86_64 GNU/Linux
Reporter: Lee Skillen
Priority: Minor
  Labels: build, cache, ccache, distutils, jcc, parallel
 Attachments: feature-parallel-build.patch


 JCC utilises distutils.Extension() in order to build JCC itself and the 
 packages that it generates for Java wrapping - Unfortunately distutils 
 performs its build sequentially and doesn't take advantage of any additional 
 free cores for parallel building.  As discussed on the list this is likely a 
 design decision due to potential issues that may arise when building projects 
 with awkward, cyclic or recursive dependencies.
 These issues shouldn't appear within JCC-based projects because of the 
 generative nature of the build; i.e. all dependencies are resolved and 
 generated prior to building, and the build process itself is about 
 compilation and construction of the wrapper alone, of which the wrapper files 
 are contained to a sequence of flattened compilation units.
 Enabling this requires monkey patching of distutils, which was also discussed 
 on the list as being a potential source of issues, although we feel that the 
 risk is likely lower than the current setuptools patching utilised.  This 
 would be optional functionality that is also only enabled if the 
 monkey-patching succeeds.  Distutils itself is also part of the standard 
 library and might be less susceptible to change than setuptools, and the area 
 of code monkey patched almost hasn't changed since 2002 (see: 
 http://hg.python.org/cpython/file/tip/Lib/distutils/ccompiler.py).
 In addition to the distutils changes this patch also includes changes to the 
 wrapper class generation to make it more cache friendly, with the target 
 being that no changes in the wrapped code equals no changes in the wrapper 
 code.  So any changes that minimally change the wrapped code mean that with a 
 tool such as ccache the rebuild time would be significantly reduced (almost 
 to a nth, where n is the number of files and only one has changed).
 Obviously the maintainers would have to assess this risk and decide whether 
 they would like to accept the patch or not.  Code has only been tested on 
 Linux with Python 2.7.5 but should gracefully fail and prevent 
 parallelisation if one of the requirements hasn't been met (not on linux, no 
 multiprocessing support, or monkey patching somehow fails).  The change to 
 caching should still benefit everyone regardless.
 Please note that an additional dependency on orderedset has been added to 
 achieve the more deterministic ordering - This may not be desirable (i.e. 
 another package might be desired, such as ordered-set, or the code might be 
 inlined into the package instead), as per maintainer comments.
 --- [following repeated from mailing list] ---
 Performance Statistics :-
 The following are some quick and dirty statistics for building the jcc 
 pylucene itself (incl. java lucene which accounts for about 30-ish seconds 
 upfront) - The JCC files are split using --files 8, and each build is 
 preceded with a make clean:
 Serial (unpatched):
 real5m1.502s
 user5m22.887s
 sys 0m7.749s
 Parallel (patched, 4 physical cores, 8 hyperthreads, 8 parallel jobs):
 real1m37.382s
 user7m16.658s
 sys 0m8.697s
 Furthermore, some additional changes were made to the wrapped file generation 
 to make the generated code more ccache friendly (additional deterministic 
 sorting for methods and some usage of an ordered set).  With these in place 
 and the CC and CCACHE_COMPILERCHECK environment variables set to ccache gcc 
 and content respectively, and ensuring ccache is installed, subsequent 
 compilation time is reduced again as follows:
 Parallel (patched, 4 physical cores, 8 hyperthreads, 8 parallel jobs, ccache 
 enabled):
 real0m43.051s
 user1m10.392s
 sys 0m4.547s
 This was a run in which nothing changed between runs, so a realistic run in 
 which changes occur it'll be a figure between 0m43.051s and 1m37.382s, 
 depending on how drastic the change was. If many changes are expected and you 
 want to keep it more cache friendly then using a higher --files would 
 probably work (to an extent), or ideally use --files separate, although it 
 doesn't currently work for me (need to investigate).
 We're mostly utilising the PyLucene build as a test bed since it is 
 repeatable for others,

[jira] [Assigned] (SOLR-5968) BinaryResponseWriter fetches unnecessary stored fields when only pseudo-fields are requested

2014-07-21 Thread ASF subversion and git services (JIRA)


 [ 
https://issues.apache.org/jira/browse/SOLR-5968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shalin Shekhar Mangar reassigned SOLR-5968:
---

Assignee: Shalin Shekhar Mangar

 BinaryResponseWriter fetches unnecessary stored fields when only 
 pseudo-fields are requested
 

 Key: SOLR-5968
 URL: https://issues.apache.org/jira/browse/SOLR-5968
 Project: Solr
  Issue Type: Bug
Reporter: Gregg Donovan
Assignee: Shalin Shekhar Mangar
Priority: Minor
 Attachments: SOLR-5968-NOCOMMIT.diff, SOLR-5968.diff


 If fl contains only pseudo fields, BinaryResponseWriter will load unnecessary 
 stored fields before discarding them. The call to SolrIndexSearcher#doc(int 
 i, SetString fields) can be skipped for this case.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5801) Resurrect org.apache.lucene.facet.util.OrdinalMappingAtomicReader


[ 
https://issues.apache.org/jira/browse/LUCENE-5801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068276#comment-14068276
 ] 

ASF subversion and git services commented on LUCENE-5801:
-

Commit 1612197 from [~shaie] in branch 'dev/trunk'
[ https://svn.apache.org/r1612197 ]

LUCENE-5801: add default indexFieldName to facetFields

 Resurrect org.apache.lucene.facet.util.OrdinalMappingAtomicReader
 -

 Key: LUCENE-5801
 URL: https://issues.apache.org/jira/browse/LUCENE-5801
 Project: Lucene - Core
  Issue Type: Bug
Affects Versions: 4.7
Reporter: Nicola Buso
Assignee: Shai Erera
 Fix For: 5.0, 4.10

 Attachments: LUCENE-5801.patch, LUCENE-5801.patch, LUCENE-5801.patch, 
 LUCENE-5801.patch, LUCENE-5801_1.patch, LUCENE-5801_2.patch


 from lucene  4.6.1 the class:
 org.apache.lucene.facet.util.OrdinalMappingAtomicReader
 was removed; resurrect it because used merging indexes related to merged 
 taxonomies.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5801) Resurrect org.apache.lucene.facet.util.OrdinalMappingAtomicReader

2014-07-21 Thread Shai Erera (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-5801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068279#comment-14068279
 ] 

Shai Erera commented on LUCENE-5801:


Committed the fix.

 Resurrect org.apache.lucene.facet.util.OrdinalMappingAtomicReader
 -

 Key: LUCENE-5801
 URL: https://issues.apache.org/jira/browse/LUCENE-5801
 Project: Lucene - Core
  Issue Type: Bug
Affects Versions: 4.7
Reporter: Nicola Buso
Assignee: Shai Erera
 Fix For: 5.0, 4.10

 Attachments: LUCENE-5801.patch, LUCENE-5801.patch, LUCENE-5801.patch, 
 LUCENE-5801.patch, LUCENE-5801_1.patch, LUCENE-5801_2.patch


 from lucene  4.6.1 the class:
 org.apache.lucene.facet.util.OrdinalMappingAtomicReader
 was removed; resurrect it because used merging indexes related to merged 
 taxonomies.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5801) Resurrect org.apache.lucene.facet.util.OrdinalMappingAtomicReader

2014-07-21 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-5801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068277#comment-14068277
 ] 

ASF subversion and git services commented on LUCENE-5801:
-

Commit 1612198 from [~shaie] in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1612198 ]

LUCENE-5801: add default indexFieldName to facetFields

 Resurrect org.apache.lucene.facet.util.OrdinalMappingAtomicReader
 -

 Key: LUCENE-5801
 URL: https://issues.apache.org/jira/browse/LUCENE-5801
 Project: Lucene - Core
  Issue Type: Bug
Affects Versions: 4.7
Reporter: Nicola Buso
Assignee: Shai Erera
 Fix For: 5.0, 4.10

 Attachments: LUCENE-5801.patch, LUCENE-5801.patch, LUCENE-5801.patch, 
 LUCENE-5801.patch, LUCENE-5801_1.patch, LUCENE-5801_2.patch


 from lucene  4.6.1 the class:
 org.apache.lucene.facet.util.OrdinalMappingAtomicReader
 was removed; resurrect it because used merging indexes related to merged 
 taxonomies.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-5968) BinaryResponseWriter fetches unnecessary stored fields when only pseudo-fields are requested

2014-07-21 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-5968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068284#comment-14068284
 ] 

ASF subversion and git services commented on SOLR-5968:
---

Commit 1612200 from sha...@apache.org in branch 'dev/trunk'
[ https://svn.apache.org/r1612200 ]

SOLR-5968: BinaryResponseWriter fetches unnecessary stored fields when only 
pseudo-fields are requested

 BinaryResponseWriter fetches unnecessary stored fields when only 
 pseudo-fields are requested
 

 Key: SOLR-5968
 URL: https://issues.apache.org/jira/browse/SOLR-5968
 Project: Solr
  Issue Type: Bug
Reporter: Gregg Donovan
Assignee: Shalin Shekhar Mangar
Priority: Minor
 Attachments: SOLR-5968-NOCOMMIT.diff, SOLR-5968.diff


 If fl contains only pseudo fields, BinaryResponseWriter will load unnecessary 
 stored fields before discarding them. The call to SolrIndexSearcher#doc(int 
 i, SetString fields) can be skipped for this case.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6227) ChaosMonkeySafeLeaderTest failures on jenkins

2014-07-21 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068287#comment-14068287
 ] 

Shalin Shekhar Mangar commented on SOLR-6227:
-

I haven't seen the failure mentioned in the issue description but my jenkins 
found the following failure yesterday:

{code}
java.lang.AssertionError: expected:0 but was:1
at 
__randomizedtesting.SeedInfo.seed([2D7931A1F137DAA5:AC9FBFB98668BA99]:0)
at org.junit.Assert.fail(Assert.java:93)
at org.junit.Assert.failNotEquals(Assert.java:647)
at org.junit.Assert.assertEquals(Assert.java:128)
at org.junit.Assert.assertEquals(Assert.java:472)
at org.junit.Assert.assertEquals(Assert.java:456)
at 
org.apache.solr.cloud.ChaosMonkeySafeLeaderTest.doTest(ChaosMonkeySafeLeaderTest.java:141)
at 
org.apache.solr.BaseDistributedSearchTestCase.testDistribSearch(BaseDistributedSearchTestCase.java:863)
{code}

 ChaosMonkeySafeLeaderTest failures on jenkins
 -

 Key: SOLR-6227
 URL: https://issues.apache.org/jira/browse/SOLR-6227
 Project: Solr
  Issue Type: Bug
  Components: SolrCloud, Tests
Reporter: Shalin Shekhar Mangar
 Fix For: 4.10


 This is happening very frequently.
 {code}
 1 tests failed.
 REGRESSION:  org.apache.solr.cloud.ChaosMonkeySafeLeaderTest.testDistribSearch
 Error Message:
 shard1 is not consistent.  Got 143 from 
 https://127.0.0.1:36610/xvv/collection1lastClient and got 142 from 
 https://127.0.0.1:33168/xvv/collection1
 Stack Trace:
 java.lang.AssertionError: shard1 is not consistent.  Got 143 from 
 https://127.0.0.1:36610/xvv/collection1lastClient and got 142 from 
 https://127.0.0.1:33168/xvv/collection1
 at 
 __randomizedtesting.SeedInfo.seed([3C1FB6EAFE71:BDF938F2AA829E4D]:0)
 at org.junit.Assert.fail(Assert.java:93)
 at 
 org.apache.solr.cloud.AbstractFullDistribZkTestBase.checkShardConsistency(AbstractFullDistribZkTestBase.java:1139)
 at 
 org.apache.solr.cloud.AbstractFullDistribZkTestBase.checkShardConsistency(AbstractFullDistribZkTestBase.java:1118)
 at 
 org.apache.solr.cloud.ChaosMonkeySafeLeaderTest.doTest(ChaosMonkeySafeLeaderTest.java:150)
 at 
 org.apache.solr.BaseDistributedSearchTestCase.testDistribSearch(BaseDistributedSearchTestCase.java:865)
 {code}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-5968) BinaryResponseWriter fetches unnecessary stored fields when only pseudo-fields are requested


[ 
https://issues.apache.org/jira/browse/SOLR-5968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068316#comment-14068316
 ] 

ASF subversion and git services commented on SOLR-5968:
---

Commit 1612210 from sha...@apache.org in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1612210 ]

SOLR-5968: BinaryResponseWriter fetches unnecessary stored fields when only 
pseudo-fields are requested

 BinaryResponseWriter fetches unnecessary stored fields when only 
 pseudo-fields are requested
 

 Key: SOLR-5968
 URL: https://issues.apache.org/jira/browse/SOLR-5968
 Project: Solr
  Issue Type: Bug
Reporter: Gregg Donovan
Assignee: Shalin Shekhar Mangar
Priority: Minor
 Attachments: SOLR-5968-NOCOMMIT.diff, SOLR-5968.diff


 If fl contains only pseudo fields, BinaryResponseWriter will load unnecessary 
 stored fields before discarding them. The call to SolrIndexSearcher#doc(int 
 i, SetString fields) can be skipped for this case.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Resolved] (SOLR-5968) BinaryResponseWriter fetches unnecessary stored fields when only pseudo-fields are requested


 [ 
https://issues.apache.org/jira/browse/SOLR-5968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shalin Shekhar Mangar resolved SOLR-5968.
-

   Resolution: Fixed
Fix Version/s: 4.10
   5.0

Thanks Gregg. I've committed your patch without the test. I can't quite figure 
out how to test it better either and this optimization is really useful.

 BinaryResponseWriter fetches unnecessary stored fields when only 
 pseudo-fields are requested
 

 Key: SOLR-5968
 URL: https://issues.apache.org/jira/browse/SOLR-5968
 Project: Solr
  Issue Type: Bug
Reporter: Gregg Donovan
Assignee: Shalin Shekhar Mangar
Priority: Minor
 Fix For: 5.0, 4.10

 Attachments: SOLR-5968-NOCOMMIT.diff, SOLR-5968.diff


 If fl contains only pseudo fields, BinaryResponseWriter will load unnecessary 
 stored fields before discarding them. The call to SolrIndexSearcher#doc(int 
 i, SetString fields) can be skipped for this case.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6260) Rename DirectUpdateHandler2

2014-07-21 Thread Yonik Seeley (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068330#comment-14068330
 ] 

Yonik Seeley commented on SOLR-6260:


Reasonable people can disagree.  One should not have to face hurtful personal 
attacks when they do.

 Rename DirectUpdateHandler2
 ---

 Key: SOLR-6260
 URL: https://issues.apache.org/jira/browse/SOLR-6260
 Project: Solr
  Issue Type: Improvement
Affects Versions: 5.0
Reporter: Tomás Fernández Löbbe
Priority: Minor
 Attachments: SOLR-6260.patch, SOLR-6260.patch


 DirectUpdateHandler was removed, I think in Solr 4. DirectUpdateHandler2 
 should be renamed, at least remove that 2. I don't know really what 
 direct means here. Maybe it could be renamed to DefaultUpdateHandler, or 
 UpdateHandlerDefaultImpl, or other good suggestions



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-5473) Split clusterstate.json per collection and watch states selectively


[ 
https://issues.apache.org/jira/browse/SOLR-5473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068335#comment-14068335
 ] 

Noble Paul commented on SOLR-5473:
--

[~markrmil...@gmail.com] Can you look at the latest patch and let me know your 
comments

 Split clusterstate.json per collection and watch states selectively 
 

 Key: SOLR-5473
 URL: https://issues.apache.org/jira/browse/SOLR-5473
 Project: Solr
  Issue Type: Sub-task
  Components: SolrCloud
Reporter: Noble Paul
Assignee: Noble Paul
 Fix For: 5.0

 Attachments: SOLR-5473-74 .patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74_POC.patch, 
 SOLR-5473-configname-fix.patch, SOLR-5473.patch, SOLR-5473.patch, 
 SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, 
 SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, 
 SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, 
 SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, 
 SOLR-5473_undo.patch, ec2-23-20-119-52_solr.log, ec2-50-16-38-73_solr.log


 As defined in the parent issue, store the states of each collection under 
 /collections/collectionname/state.json node



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-5958) Document (and fix) numShards and router selection parameter in SolrCloud


[ 
https://issues.apache.org/jira/browse/SOLR-5958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068340#comment-14068340
 ] 

Noble Paul commented on SOLR-5958:
--

bq.Though the bootstrap does not treat numShards as a required param, the 
Collection API does and throw an error if we don't specify numShards.

The collection API is not used to create the collection, but the STATE command 
does that. The STATE command would fail if numShards is not present, but the 
nodes will always send the value 1  if nothing else is configured 

 Document (and fix) numShards and router selection parameter in SolrCloud
 

 Key: SOLR-5958
 URL: https://issues.apache.org/jira/browse/SOLR-5958
 Project: Solr
  Issue Type: Task
  Components: SolrCloud
Reporter: Anshum Gupta
Assignee: Anshum Gupta
Priority: Minor

 Right now numShards works in rather mysterious ways (unless you know how it 
 works). We should clearly document the following:
 * If we start SolrCloud with bootstrapping, without mentioning numShards 
 parameter, it defaults to 1 and also defaults the router to 'implicit'.
 * Mentioning numShards param, defaults the router to compositeId.
 * Though the bootstrap does not treat numShards as a required param, the 
 Collection API does and throw an error if we don't specify numShards.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Comment Edited] (SOLR-5958) Document (and fix) numShards and router selection parameter in SolrCloud

2014-07-21 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-5958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068340#comment-14068340
 ] 

Noble Paul edited comment on SOLR-5958 at 7/21/14 9:25 AM:
---

bq.Though the bootstrap does not treat numShards as a required param, the 
Collection API does and throw an error if we don't specify numShards.

The collection API is not used to create the collection, but the STATE command 
does that. The STATE command would fail if numShards is not present, but the 
nodes will always send the value 1  if nothing else is configured 

But the collection API has to use some numShards for a non 'implicit router' 
collection . Hence the router is implicit

Probably we need our documentation to improve


was (Author: noble.paul):
bq.Though the bootstrap does not treat numShards as a required param, the 
Collection API does and throw an error if we don't specify numShards.

The collection API is not used to create the collection, but the STATE command 
does that. The STATE command would fail if numShards is not present, but the 
nodes will always send the value 1  if nothing else is configured 

 Document (and fix) numShards and router selection parameter in SolrCloud
 

 Key: SOLR-5958
 URL: https://issues.apache.org/jira/browse/SOLR-5958
 Project: Solr
  Issue Type: Task
  Components: SolrCloud
Reporter: Anshum Gupta
Assignee: Anshum Gupta
Priority: Minor

 Right now numShards works in rather mysterious ways (unless you know how it 
 works). We should clearly document the following:
 * If we start SolrCloud with bootstrapping, without mentioning numShards 
 parameter, it defaults to 1 and also defaults the router to 'implicit'.
 * Mentioning numShards param, defaults the router to compositeId.
 * Though the bootstrap does not treat numShards as a required param, the 
 Collection API does and throw an error if we don't specify numShards.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5819) Add block tree postings format that supports term ords


[ 
https://issues.apache.org/jira/browse/LUCENE-5819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068341#comment-14068341
 ] 

ASF subversion and git services commented on LUCENE-5819:
-

Commit 1612213 from [~mikemccand] in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1612213 ]

LUCENE-5819: add terms dict and postings format that implement term ordinals

 Add block tree postings format that supports term ords
 --

 Key: LUCENE-5819
 URL: https://issues.apache.org/jira/browse/LUCENE-5819
 Project: Lucene - Core
  Issue Type: Improvement
  Components: modules/other
Reporter: Michael McCandless
Assignee: Michael McCandless
 Fix For: 5.0, 4.10

 Attachments: LUCENE-5819.patch, LUCENE-5819.patch


 BlockTree is our default terms dictionary today, but it doesn't
 support term ords, which is an optional API in the postings format to
 retrieve the ordinal for the currently seek'd term, and also later
 seek by that ordinal e.g. to lookup the term.
 This can possibly be useful for e.g. faceting, and maybe at some point
 we can share the postings terms dict with the one used by sorted/set
 DV for cases when app wants to invert and facet on a given field.
 The older (3.x) block terms dict can easily support ords, and we have
 a Lucene41OrdsPF in test-framework, but it's not as fast / compact as
 block-tree, and doesn't (can't easily) implement an optimized
 intersect, but it could be for fields we'd want to facet on, these
 tradeoffs don't matter.  It's nice to have options...



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Resolved] (LUCENE-5819) Add block tree postings format that supports term ords

2014-07-21 Thread Michael McCandless (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-5819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael McCandless resolved LUCENE-5819.


Resolution: Fixed

 Add block tree postings format that supports term ords
 --

 Key: LUCENE-5819
 URL: https://issues.apache.org/jira/browse/LUCENE-5819
 Project: Lucene - Core
  Issue Type: Improvement
  Components: modules/other
Reporter: Michael McCandless
Assignee: Michael McCandless
 Fix For: 5.0, 4.10

 Attachments: LUCENE-5819.patch, LUCENE-5819.patch


 BlockTree is our default terms dictionary today, but it doesn't
 support term ords, which is an optional API in the postings format to
 retrieve the ordinal for the currently seek'd term, and also later
 seek by that ordinal e.g. to lookup the term.
 This can possibly be useful for e.g. faceting, and maybe at some point
 we can share the postings terms dict with the one used by sorted/set
 DV for cases when app wants to invert and facet on a given field.
 The older (3.x) block terms dict can easily support ords, and we have
 a Lucene41OrdsPF in test-framework, but it's not as fast / compact as
 block-tree, and doesn't (can't easily) implement an optimized
 intersect, but it could be for fields we'd want to facet on, these
 tradeoffs don't matter.  It's nice to have options...



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5819) Add block tree postings format that supports term ords

2014-07-21 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-5819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068355#comment-14068355
 ] 

ASF subversion and git services commented on LUCENE-5819:
-

Commit 1612217 from [~mikemccand] in branch 'dev/trunk'
[ https://svn.apache.org/r1612217 ]

LUCENE-5819: fix ord bug; add test case; remove dead code

 Add block tree postings format that supports term ords
 --

 Key: LUCENE-5819
 URL: https://issues.apache.org/jira/browse/LUCENE-5819
 Project: Lucene - Core
  Issue Type: Improvement
  Components: modules/other
Reporter: Michael McCandless
Assignee: Michael McCandless
 Fix For: 5.0, 4.10

 Attachments: LUCENE-5819.patch, LUCENE-5819.patch


 BlockTree is our default terms dictionary today, but it doesn't
 support term ords, which is an optional API in the postings format to
 retrieve the ordinal for the currently seek'd term, and also later
 seek by that ordinal e.g. to lookup the term.
 This can possibly be useful for e.g. faceting, and maybe at some point
 we can share the postings terms dict with the one used by sorted/set
 DV for cases when app wants to invert and facet on a given field.
 The older (3.x) block terms dict can easily support ords, and we have
 a Lucene41OrdsPF in test-framework, but it's not as fast / compact as
 block-tree, and doesn't (can't easily) implement an optimized
 intersect, but it could be for fields we'd want to facet on, these
 tradeoffs don't matter.  It's nice to have options...



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.8.0_11) - Build # 10862 - Failure!

2014-07-21 Thread Policeman Jenkins Server

Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10862/
Java: 32bit/jdk1.8.0_11 -client -XX:+UseConcMarkSweepGC

1 tests failed.
REGRESSION:  
org.apache.lucene.uninverting.TestFieldCacheSort.testEmptyStringVsNullStringSort

Error Message:


Stack Trace:
java.lang.AssertionError
at 
__randomizedtesting.SeedInfo.seed([852BB84974DA2250:E4C88A9E0CBB24E0]:0)
at 
org.apache.lucene.index.AssertingAtomicReader.getSortedDocValues(AssertingAtomicReader.java:638)
at 
org.apache.lucene.index.MultiDocValues.getSortedValues(MultiDocValues.java:312)
at 
org.apache.lucene.index.SlowCompositeReaderWrapper.getSortedDocValues(SlowCompositeReaderWrapper.java:128)
at 
org.apache.lucene.index.FilterAtomicReader.getSortedDocValues(FilterAtomicReader.java:414)
at 
org.apache.lucene.index.FieldFilterAtomicReader.getSortedDocValues(FieldFilterAtomicReader.java:132)
at 
org.apache.lucene.index.ParallelAtomicReader.getSortedDocValues(ParallelAtomicReader.java:288)
at 
org.apache.lucene.index.FilterAtomicReader.getSortedDocValues(FilterAtomicReader.java:414)
at 
org.apache.lucene.index.FieldFilterAtomicReader.getSortedDocValues(FieldFilterAtomicReader.java:132)
at 
org.apache.lucene.index.ParallelAtomicReader.getSortedDocValues(ParallelAtomicReader.java:288)
at 
org.apache.lucene.index.FilterAtomicReader.getSortedDocValues(FilterAtomicReader.java:414)
at 
org.apache.lucene.index.AssertingAtomicReader.getSortedDocValues(AssertingAtomicReader.java:635)
at 
org.apache.lucene.index.ParallelAtomicReader.getSortedDocValues(ParallelAtomicReader.java:288)
at org.apache.lucene.index.DocValues.getSorted(DocValues.java:273)
at 
org.apache.lucene.search.FieldComparator$TermOrdValComparator.getSortedDocValues(FieldComparator.java:821)
at 
org.apache.lucene.search.FieldComparator$TermOrdValComparator.setNextReader(FieldComparator.java:826)
at 
org.apache.lucene.search.TopFieldCollector$OneComparatorNonScoringCollector.doSetNextReader(TopFieldCollector.java:97)
at 
org.apache.lucene.search.SimpleCollector.getLeafCollector(SimpleCollector.java:33)
at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:605)
at 
org.apache.lucene.search.AssertingIndexSearcher.search(AssertingIndexSearcher.java:94)
at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:573)
at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:525)
at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:502)
at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:318)
at 
org.apache.lucene.uninverting.TestFieldCacheSort.testEmptyStringVsNullStringSort(TestFieldCacheSort.java:1029)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:483)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner.invoke(RandomizedRunner.java:1618)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$6.evaluate(RandomizedRunner.java:827)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$7.evaluate(RandomizedRunner.java:863)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$8.evaluate(RandomizedRunner.java:877)
at 
org.apache.lucene.util.TestRuleSetupTeardownChained$1.evaluate(TestRuleSetupTeardownChained.java:50)
at 
org.apache.lucene.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:46)
at 
com.carrotsearch.randomizedtesting.rules.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:55)
at 
org.apache.lucene.util.TestRuleThreadAndTestName$1.evaluate(TestRuleThreadAndTestName.java:49)
at 
org.apache.lucene.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:65)
at 
org.apache.lucene.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:48)
at 
com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:365)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl.forkTimeoutingTask(ThreadLeakControl.java:798)
at 
com.carrotsearch.randomizedtesting.ThreadLeakControl$3.evaluate(ThreadLeakControl.java:458)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner.runSingleTest(RandomizedRunner.java:836)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$3.evaluate(RandomizedRunner.java:738)
at 
com.carrotsearch.randomizedtesting.RandomizedRunner$4.evaluate(RandomizedRunner.java:772)
at

[jira] [Commented] (SOLR-3619) Rename 'example' dir to 'server' and pull examples into an 'examples' directory

2014-07-21 Thread Grant Ingersoll (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-3619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068403#comment-14068403
 ] 

Grant Ingersoll commented on SOLR-3619:
---

bq. Yeah, it feels like one should still be able to start the server and then 
index a document (as they can do now) without any other mandatory steps.

+1

Agree, single core dies, collection1 dies.  Everything should just work out of 
the box!  The first 5 minute experience should be all about the user and their 
data and very little to do about solr configs, schemas, etc.  Same for pretty 
much the whole first day.  By the end of the first week, a new user should have 
a thorough understanding of what they need to do to get to production.  Easy 
to start, easy to finish.  

 Rename 'example' dir to 'server' and pull examples into an 'examples' 
 directory
 ---

 Key: SOLR-3619
 URL: https://issues.apache.org/jira/browse/SOLR-3619
 Project: Solr
  Issue Type: Improvement
Reporter: Mark Miller
 Fix For: 4.9, 5.0

 Attachments: SOLR-3619.patch, server-name-layout.png






--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5835) Add sortMissingLast support to TermValComparator

2014-07-21 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-5835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068404#comment-14068404
 ] 

ASF subversion and git services commented on LUCENE-5835:
-

Commit 1612245 from [~jpountz] in branch 'dev/trunk'
[ https://svn.apache.org/r1612245 ]

LUCENE-5835: TermValComparator can sort missing values last.

 Add sortMissingLast support to TermValComparator
 

 Key: LUCENE-5835
 URL: https://issues.apache.org/jira/browse/LUCENE-5835
 Project: Lucene - Core
  Issue Type: Improvement
Reporter: Adrien Grand
Assignee: Adrien Grand
Priority: Minor
 Fix For: 5.0, 4.10

 Attachments: LUCENE-5835.patch


 It would be nice to allow to configure the behavior on missing values for 
 this comparator, similarly to what TermOrdValComparator does.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5835) Add sortMissingLast support to TermValComparator

2014-07-21 Thread ASF subversion and git services (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-5835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068414#comment-14068414
 ] 

ASF subversion and git services commented on LUCENE-5835:
-

Commit 1612249 from [~jpountz] in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1612249 ]

LUCENE-5835: TermValComparator can sort missing values last.

 Add sortMissingLast support to TermValComparator
 

 Key: LUCENE-5835
 URL: https://issues.apache.org/jira/browse/LUCENE-5835
 Project: Lucene - Core
  Issue Type: Improvement
Reporter: Adrien Grand
Assignee: Adrien Grand
Priority: Minor
 Fix For: 5.0, 4.10

 Attachments: LUCENE-5835.patch


 It would be nice to allow to configure the behavior on missing values for 
 this comparator, similarly to what TermOrdValComparator does.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.8.0_11) - Build # 10862 - Failure!

2014-07-21 Thread Adrien Grand

I'm looking into it.

On Mon, Jul 21, 2014 at 12:46 PM, Policeman Jenkins Server
jenk...@thetaphi.de wrote:
 Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10862/
 Java: 32bit/jdk1.8.0_11 -client -XX:+UseConcMarkSweepGC

 1 tests failed.
 REGRESSION:  
 org.apache.lucene.uninverting.TestFieldCacheSort.testEmptyStringVsNullStringSort

 Error Message:


 Stack Trace:
 java.lang.AssertionError
 at 
 __randomizedtesting.SeedInfo.seed([852BB84974DA2250:E4C88A9E0CBB24E0]:0)
 at 
 org.apache.lucene.index.AssertingAtomicReader.getSortedDocValues(AssertingAtomicReader.java:638)
 at 
 org.apache.lucene.index.MultiDocValues.getSortedValues(MultiDocValues.java:312)
 at 
 org.apache.lucene.index.SlowCompositeReaderWrapper.getSortedDocValues(SlowCompositeReaderWrapper.java:128)
 at 
 org.apache.lucene.index.FilterAtomicReader.getSortedDocValues(FilterAtomicReader.java:414)
 at 
 org.apache.lucene.index.FieldFilterAtomicReader.getSortedDocValues(FieldFilterAtomicReader.java:132)
 at 
 org.apache.lucene.index.ParallelAtomicReader.getSortedDocValues(ParallelAtomicReader.java:288)
 at 
 org.apache.lucene.index.FilterAtomicReader.getSortedDocValues(FilterAtomicReader.java:414)
 at 
 org.apache.lucene.index.FieldFilterAtomicReader.getSortedDocValues(FieldFilterAtomicReader.java:132)
 at 
 org.apache.lucene.index.ParallelAtomicReader.getSortedDocValues(ParallelAtomicReader.java:288)
 at 
 org.apache.lucene.index.FilterAtomicReader.getSortedDocValues(FilterAtomicReader.java:414)
 at 
 org.apache.lucene.index.AssertingAtomicReader.getSortedDocValues(AssertingAtomicReader.java:635)
 at 
 org.apache.lucene.index.ParallelAtomicReader.getSortedDocValues(ParallelAtomicReader.java:288)
 at org.apache.lucene.index.DocValues.getSorted(DocValues.java:273)
 at 
 org.apache.lucene.search.FieldComparator$TermOrdValComparator.getSortedDocValues(FieldComparator.java:821)
 at 
 org.apache.lucene.search.FieldComparator$TermOrdValComparator.setNextReader(FieldComparator.java:826)
 at 
 org.apache.lucene.search.TopFieldCollector$OneComparatorNonScoringCollector.doSetNextReader(TopFieldCollector.java:97)
 at 
 org.apache.lucene.search.SimpleCollector.getLeafCollector(SimpleCollector.java:33)
 at 
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:605)
 at 
 org.apache.lucene.search.AssertingIndexSearcher.search(AssertingIndexSearcher.java:94)
 at 
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:573)
 at 
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:525)
 at 
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:502)
 at 
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:318)
 at 
 org.apache.lucene.uninverting.TestFieldCacheSort.testEmptyStringVsNullStringSort(TestFieldCacheSort.java:1029)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:483)
 at 
 com.carrotsearch.randomizedtesting.RandomizedRunner.invoke(RandomizedRunner.java:1618)
 at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$6.evaluate(RandomizedRunner.java:827)
 at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$7.evaluate(RandomizedRunner.java:863)
 at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$8.evaluate(RandomizedRunner.java:877)
 at 
 org.apache.lucene.util.TestRuleSetupTeardownChained$1.evaluate(TestRuleSetupTeardownChained.java:50)
 at 
 org.apache.lucene.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:46)
 at 
 com.carrotsearch.randomizedtesting.rules.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:55)
 at 
 org.apache.lucene.util.TestRuleThreadAndTestName$1.evaluate(TestRuleThreadAndTestName.java:49)
 at 
 org.apache.lucene.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:65)
 at 
 org.apache.lucene.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:48)
 at 
 com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
 at 
 com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:365)
 at 
 com.carrotsearch.randomizedtesting.ThreadLeakControl.forkTimeoutingTask(ThreadLeakControl.java:798)
 at 
 com.carrotsearch.randomizedtesting.ThreadLeakControl$3.evaluate(ThreadLeakControl.java:458)
 at 
 com.carrotsearch.randomizedtesting.RandomizedRunner.runSingleTest(RandomizedRunner.java:836)
 at

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.8.0_11) - Build # 10862 - Failure!

2014-07-21 Thread Adrien Grand

This is due to https://issues.apache.org/jira/browse/LUCENE-5788.



On Mon, Jul 21, 2014 at 1:16 PM, Adrien Grand jpou...@gmail.com wrote:
 I'm looking into it.

 On Mon, Jul 21, 2014 at 12:46 PM, Policeman Jenkins Server
 jenk...@thetaphi.de wrote:
 Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10862/
 Java: 32bit/jdk1.8.0_11 -client -XX:+UseConcMarkSweepGC

 1 tests failed.
 REGRESSION:  
 org.apache.lucene.uninverting.TestFieldCacheSort.testEmptyStringVsNullStringSort

 Error Message:


 Stack Trace:
 java.lang.AssertionError
 at 
 __randomizedtesting.SeedInfo.seed([852BB84974DA2250:E4C88A9E0CBB24E0]:0)
 at 
 org.apache.lucene.index.AssertingAtomicReader.getSortedDocValues(AssertingAtomicReader.java:638)
 at 
 org.apache.lucene.index.MultiDocValues.getSortedValues(MultiDocValues.java:312)
 at 
 org.apache.lucene.index.SlowCompositeReaderWrapper.getSortedDocValues(SlowCompositeReaderWrapper.java:128)
 at 
 org.apache.lucene.index.FilterAtomicReader.getSortedDocValues(FilterAtomicReader.java:414)
 at 
 org.apache.lucene.index.FieldFilterAtomicReader.getSortedDocValues(FieldFilterAtomicReader.java:132)
 at 
 org.apache.lucene.index.ParallelAtomicReader.getSortedDocValues(ParallelAtomicReader.java:288)
 at 
 org.apache.lucene.index.FilterAtomicReader.getSortedDocValues(FilterAtomicReader.java:414)
 at 
 org.apache.lucene.index.FieldFilterAtomicReader.getSortedDocValues(FieldFilterAtomicReader.java:132)
 at 
 org.apache.lucene.index.ParallelAtomicReader.getSortedDocValues(ParallelAtomicReader.java:288)
 at 
 org.apache.lucene.index.FilterAtomicReader.getSortedDocValues(FilterAtomicReader.java:414)
 at 
 org.apache.lucene.index.AssertingAtomicReader.getSortedDocValues(AssertingAtomicReader.java:635)
 at 
 org.apache.lucene.index.ParallelAtomicReader.getSortedDocValues(ParallelAtomicReader.java:288)
 at org.apache.lucene.index.DocValues.getSorted(DocValues.java:273)
 at 
 org.apache.lucene.search.FieldComparator$TermOrdValComparator.getSortedDocValues(FieldComparator.java:821)
 at 
 org.apache.lucene.search.FieldComparator$TermOrdValComparator.setNextReader(FieldComparator.java:826)
 at 
 org.apache.lucene.search.TopFieldCollector$OneComparatorNonScoringCollector.doSetNextReader(TopFieldCollector.java:97)
 at 
 org.apache.lucene.search.SimpleCollector.getLeafCollector(SimpleCollector.java:33)
 at 
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:605)
 at 
 org.apache.lucene.search.AssertingIndexSearcher.search(AssertingIndexSearcher.java:94)
 at 
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:573)
 at 
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:525)
 at 
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:502)
 at 
 org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:318)
 at 
 org.apache.lucene.uninverting.TestFieldCacheSort.testEmptyStringVsNullStringSort(TestFieldCacheSort.java:1029)
 at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
 at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke(Method.java:483)
 at 
 com.carrotsearch.randomizedtesting.RandomizedRunner.invoke(RandomizedRunner.java:1618)
 at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$6.evaluate(RandomizedRunner.java:827)
 at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$7.evaluate(RandomizedRunner.java:863)
 at 
 com.carrotsearch.randomizedtesting.RandomizedRunner$8.evaluate(RandomizedRunner.java:877)
 at 
 org.apache.lucene.util.TestRuleSetupTeardownChained$1.evaluate(TestRuleSetupTeardownChained.java:50)
 at 
 org.apache.lucene.util.AbstractBeforeAfterRule$1.evaluate(AbstractBeforeAfterRule.java:46)
 at 
 com.carrotsearch.randomizedtesting.rules.SystemPropertiesInvariantRule$1.evaluate(SystemPropertiesInvariantRule.java:55)
 at 
 org.apache.lucene.util.TestRuleThreadAndTestName$1.evaluate(TestRuleThreadAndTestName.java:49)
 at 
 org.apache.lucene.util.TestRuleIgnoreAfterMaxFailures$1.evaluate(TestRuleIgnoreAfterMaxFailures.java:65)
 at 
 org.apache.lucene.util.TestRuleMarkFailure$1.evaluate(TestRuleMarkFailure.java:48)
 at 
 com.carrotsearch.randomizedtesting.rules.StatementAdapter.evaluate(StatementAdapter.java:36)
 at 
 com.carrotsearch.randomizedtesting.ThreadLeakControl$StatementRunner.run(ThreadLeakControl.java:365)
 at 
 com.carrotsearch.randomizedtesting.ThreadLeakControl.forkTimeoutingTask(ThreadLeakControl.java:798)
 at 
 com.carrotsearch.randomizedtesting.ThreadLeakControl$3.evaluate(ThreadLeakControl.java:458)

[jira] [Resolved] (LUCENE-5835) Add sortMissingLast support to TermValComparator

2014-07-21 Thread ASF subversion and git services (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-5835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adrien Grand resolved LUCENE-5835.
--

Resolution: Fixed

 Add sortMissingLast support to TermValComparator
 

 Key: LUCENE-5835
 URL: https://issues.apache.org/jira/browse/LUCENE-5835
 Project: Lucene - Core
  Issue Type: Improvement
Reporter: Adrien Grand
Assignee: Adrien Grand
Priority: Minor
 Fix For: 5.0, 4.10

 Attachments: LUCENE-5835.patch


 It would be nice to allow to configure the behavior on missing values for 
 this comparator, similarly to what TermOrdValComparator does.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5834) Make empty doc values impls singletons


[ 
https://issues.apache.org/jira/browse/LUCENE-5834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068424#comment-14068424
 ] 

ASF subversion and git services commented on LUCENE-5834:
-

Commit 1612251 from [~jpountz] in branch 'dev/trunk'
[ https://svn.apache.org/r1612251 ]

LUCENE-5834: Empty sorted set and numeric doc values are now singletons.

 Make empty doc values impls singletons
 --

 Key: LUCENE-5834
 URL: https://issues.apache.org/jira/browse/LUCENE-5834
 Project: Lucene - Core
  Issue Type: Improvement
Reporter: Adrien Grand
Assignee: Adrien Grand
Priority: Trivial
 Fix For: 5.0, 4.10

 Attachments: LUCENE-5834.patch


 Making these empty instances singletons would allow to use 
 {{unwrapSingleton}} to check if they are single-valued.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Resolved] (LUCENE-5834) Make empty doc values impls singletons

2014-07-21 Thread ASF subversion and git services (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-5834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adrien Grand resolved LUCENE-5834.
--

Resolution: Fixed

 Make empty doc values impls singletons
 --

 Key: LUCENE-5834
 URL: https://issues.apache.org/jira/browse/LUCENE-5834
 Project: Lucene - Core
  Issue Type: Improvement
Reporter: Adrien Grand
Assignee: Adrien Grand
Priority: Trivial
 Fix For: 5.0, 4.10

 Attachments: LUCENE-5834.patch


 Making these empty instances singletons would allow to use 
 {{unwrapSingleton}} to check if they are single-valued.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5834) Make empty doc values impls singletons


[ 
https://issues.apache.org/jira/browse/LUCENE-5834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068425#comment-14068425
 ] 

ASF subversion and git services commented on LUCENE-5834:
-

Commit 1612252 from [~jpountz] in branch 'dev/branches/branch_4x'
[ https://svn.apache.org/r1612252 ]

LUCENE-5834: Empty sorted set and numeric doc values are now singletons.

 Make empty doc values impls singletons
 --

 Key: LUCENE-5834
 URL: https://issues.apache.org/jira/browse/LUCENE-5834
 Project: Lucene - Core
  Issue Type: Improvement
Reporter: Adrien Grand
Assignee: Adrien Grand
Priority: Trivial
 Fix For: 5.0, 4.10

 Attachments: LUCENE-5834.patch


 Making these empty instances singletons would allow to use 
 {{unwrapSingleton}} to check if they are single-valued.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Created] (SOLR-6262) Make the name attribute optional for components in solrconfig.xml

Noble Paul created SOLR-6262:


 Summary: Make the name attribute optional for components in 
solrconfig.xml
 Key: SOLR-6262
 URL: https://issues.apache.org/jira/browse/SOLR-6262
 Project: Solr
  Issue Type: Improvement
Reporter: Noble Paul
Assignee: Noble Paul


It is not a good idea to let people decide the names of our standard components 
such as update, replication, /get etc. These names can be hard coded in the 
Java file itself. and let us remove the names from solrconfig.xml. 

However it should be possible to override the name by specifying the 'name' 
attribute



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6262) Make the name attribute optional for components in solrconfig.xml

2014-07-21 Thread Yonik Seeley (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068434#comment-14068434
 ] 

Yonik Seeley commented on SOLR-6262:


+1

 Make the name attribute optional for components in solrconfig.xml
 ---

 Key: SOLR-6262
 URL: https://issues.apache.org/jira/browse/SOLR-6262
 Project: Solr
  Issue Type: Improvement
Reporter: Noble Paul
Assignee: Noble Paul

 It is not a good idea to let people decide the names of our standard 
 components such as update, replication, /get etc. These names can be hard 
 coded in the Java file itself. and let us remove the names from 
 solrconfig.xml. 
 However it should be possible to override the name by specifying the 'name' 
 attribute



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6262) Make the name attribute optional for components in solrconfig.xml


[ 
https://issues.apache.org/jira/browse/SOLR-6262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068439#comment-14068439
 ] 

Shalin Shekhar Mangar commented on SOLR-6262:
-

+1

I'd argue the same for the primitive types 
(int/float/long/double/date/boolean/string) in our schema.xml but that can be 
another issue.

 Make the name attribute optional for components in solrconfig.xml
 ---

 Key: SOLR-6262
 URL: https://issues.apache.org/jira/browse/SOLR-6262
 Project: Solr
  Issue Type: Improvement
Reporter: Noble Paul
Assignee: Noble Paul

 It is not a good idea to let people decide the names of our standard 
 components such as update, replication, /get etc. These names can be hard 
 coded in the Java file itself. and let us remove the names from 
 solrconfig.xml. 
 However it should be possible to override the name by specifying the 'name' 
 attribute



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (LUCENE-5838) hunspell buggy with over 64k affixes

2014-07-21 Thread Robert Muir (JIRA)


 [ 
https://issues.apache.org/jira/browse/LUCENE-5838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robert Muir updated LUCENE-5838:


Attachment: LUCENE-5838.patch

patch with test that generates its own file, so it doesnt need a 1MB test data 
file.

 hunspell buggy with over 64k affixes
 

 Key: LUCENE-5838
 URL: https://issues.apache.org/jira/browse/LUCENE-5838
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Robert Muir
 Attachments: LUCENE-5838.patch, LUCENE-5838.patch


 currently we build TreeMapString,ListCharacter in ram, to sort before 
 adding to the FST (which encodes the list as IntsRef). 
 char overflows here if there are more than 64k affixes (e.g. basque).



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6227) ChaosMonkeySafeLeaderTest failures on jenkins

2014-07-21 Thread Deepak Kumar (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-6227?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068486#comment-14068486
 ] 

Deepak Kumar commented on SOLR-6227:


Hello all,

While invoking 
http://localhost:9090/solr/admin/collections?action=CREATEname=corenumShards=1replicationFactor=1collection.configName=coreconf
 I am getting below exception, this has been happening consistently for 4.7.2 
and 4.7.1 version, please help me understand if its the very same thing:

-- solr.log --

[ERROR] [2014-07-21 18:06:20,960] 
[Overseer-92140072928280576-localhost:9090_solr-n_03] 
[cloud.OverseerCollectionProcessor] - [Collection createcollection of 
createcollection failed:org.apache.solr.common.SolrException
at 
org.apache.solr.cloud.OverseerCollectionProcessor.createCollection(OverseerCollectionProcessor.java:1687)
at 
org.apache.solr.cloud.OverseerCollectionProcessor.processMessage(OverseerCollectionProcessor.java:387)
at 
org.apache.solr.cloud.OverseerCollectionProcessor.run(OverseerCollectionProcessor.java:200)
at java.lang.Thread.run(Thread.java:662)
Caused by: org.apache.zookeeper.KeeperException$NodeExistsException: 
KeeperErrorCode = NodeExists for /collections/usersearches
at org.apache.zookeeper.KeeperException.create(KeeperException.java:119)
at org.apache.zookeeper.KeeperException.create(KeeperException.java:51)
at org.apache.zookeeper.ZooKeeper.create(ZooKeeper.java:783)
at 
org.apache.solr.common.cloud.SolrZkClient$10.execute(SolrZkClient.java:429)
at 
org.apache.solr.common.cloud.ZkCmdExecutor.retryOperation(ZkCmdExecutor.java:73)
at 
org.apache.solr.common.cloud.SolrZkClient.makePath(SolrZkClient.java:426)
at 
org.apache.solr.common.cloud.SolrZkClient.makePath(SolrZkClient.java:383)
at 
org.apache.solr.common.cloud.SolrZkClient.makePath(SolrZkClient.java:370)
at 
org.apache.solr.common.cloud.SolrZkClient.makePath(SolrZkClient.java:357)
at 
org.apache.solr.cloud.OverseerCollectionProcessor.createConfNode(OverseerCollectionProcessor.java:1711)
at 
org.apache.solr.cloud.OverseerCollectionProcessor.createCollection(OverseerCollectionProcessor.java:1624)
... 3 more
]

[ERROR] [2014-07-21 16:39:55,906] [http-9090-1] [servlet.SolrDispatchFilter] - 
[null:org.apache.solr.common.SolrException

at 
org.apache.solr.handler.admin.CollectionsHandler.handleResponse(CollectionsHandler.java:248)
at 
org.apache.solr.handler.admin.CollectionsHandler.handleResponse(CollectionsHandler.java:233)
at 
org.apache.solr.handler.admin.CollectionsHandler.handleCreateAction(CollectionsHandler.java:368)
at 
org.apache.solr.handler.admin.CollectionsHandler.handleRequestBody(CollectionsHandler.java:141)
at 
org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:135)
at 
org.apache.solr.servlet.SolrDispatchFilter.handleAdminRequest(SolrDispatchFilter.java:720)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:265)
at 
org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:205)
at 
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:235)
at 
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:206)
at 
org.apache.catalina.core.StandardWrapperValve.invoke(StandardWrapperValve.java:233)
at 
org.apache.catalina.core.StandardContextValve.invoke(StandardContextValve.java:191)
at 
org.apache.catalina.core.StandardHostValve.invoke(StandardHostValve.java:127)
at 
org.apache.catalina.valves.ErrorReportValve.invoke(ErrorReportValve.java:102)
at 
org.apache.catalina.core.StandardEngineValve.invoke(StandardEngineValve.java:109)
at 
org.apache.catalina.connector.CoyoteAdapter.service(CoyoteAdapter.java:298)
at 
org.apache.coyote.http11.Http11Processor.process(Http11Processor.java:857)
at 
org.apache.coyote.http11.Http11Protocol$Http11ConnectionHandler.process(Http11Protocol.java:588)
at 
org.apache.tomcat.util.net.JIoEndpoint$Worker.run(JIoEndpoint.java:489)
at java.lang.Thread.run(Thread.java:662)

]

-- SOLR http response --
?xml version=1.0 encoding=UTF-8?
response
lst name=responseHeaderint name=status500/intint 
name=QTime210/int/lststr name=Operation createcollection caused 
exception:org.apache.solr.common.SolrException:org.apache.solr.common.SolrException/strlst
 name=exceptionnull name=msg/int name=rspCode500/int/lstlst 
name=errorstr name=traceorg.apache.solr.common.SolrException
at 
org.apache.solr.handler.admin.CollectionsHandler.handleResponse(CollectionsHandler.java:248)
at 
org.apache.solr.handler.admin.CollectionsHandler.handleResponse(CollectionsHandler.java:233)
at

[jira] [Commented] (SOLR-5473) Split clusterstate.json per collection and watch states selectively

2014-07-21 Thread Alan Woodward (JIRA)


[ 
https://issues.apache.org/jira/browse/SOLR-5473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068504#comment-14068504
 ] 

Alan Woodward commented on SOLR-5473:
-

I think part of the reason this is so unwieldy is that ClusterState itself is 
monolithic - you call ZkStateReader.getClusterState() and it goes and gets the 
state of the entire cluster, and then you typically only need information for a 
single collection.  So ClusterState needs to know about all the different state 
versions, which bloats it up, and then leaves you with API warts like 
ZkController being responsible for removing watches.

What I think should really happen here is that we should add an intermediate 
layer, CollectionState.  This has three implementations, one that is a 
singleton watching the master clusterstate.json, one that is a separate object 
with watchers for each 'external' collection, one that just directly fetches 
data from ZK whenever it's queried.  When ZkStateReader starts up (and can we 
maybe move createClusterStateWatchersAndUpdate() into the constructor?) it 
works out which CollectionState type it needs for each collection in the 
cluster.  Users of the API just call ZkStateReader.getCollection() and they get 
the right kind of CollectionState object, no need to have external knowledge of 
what the state version of the collection is.

Having stuck my oar in here, I'm now going offline for a couple of weeks :-)  
Maybe this API change should be a separate issue, but I think it should be 
nailed down before this one is committed.


 Split clusterstate.json per collection and watch states selectively 
 

 Key: SOLR-5473
 URL: https://issues.apache.org/jira/browse/SOLR-5473
 Project: Solr
  Issue Type: Sub-task
  Components: SolrCloud
Reporter: Noble Paul
Assignee: Noble Paul
 Fix For: 5.0

 Attachments: SOLR-5473-74 .patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74.patch, 
 SOLR-5473-74.patch, SOLR-5473-74.patch, SOLR-5473-74_POC.patch, 
 SOLR-5473-configname-fix.patch, SOLR-5473.patch, SOLR-5473.patch, 
 SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, 
 SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, 
 SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, 
 SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, SOLR-5473.patch, 
 SOLR-5473_undo.patch, ec2-23-20-119-52_solr.log, ec2-50-16-38-73_solr.log


 As defined in the parent issue, store the states of each collection under 
 /collections/collectionname/state.json node



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-6248) MoreLikeThis Query Parser

2014-07-21 Thread Steve Molloy (JIRA)

[
https://issues.apache.org/jira/browse/SOLR-6248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068501#comment-14068501
]

Steve Molloy commented on SOLR-6248:

Would that approach also support sending in text that isn't in the index? This
is the main reason we're using the MLT handler, which we need to be distributed
(thus SOLR-5480). but if we can have a single approach for both, I agree that
not maintaining 2 configurations (and 2 handlers in the code) would be much
better. Let me know if I can help out.

MoreLikeThis Query Parser
-

Key: SOLR-6248
URL: https://issues.apache.org/jira/browse/SOLR-6248
Project: Solr
Issue Type: New Feature
Reporter: Anshum Gupta

MLT Component doesn't let people highlight/paginate and the handler comes
with an cost of maintaining another piece in the config. Also, any changes to
the default (number of results to be fetched etc.) /select handler need to be
copied/synced with this handler too.
Having an MLT QParser would let users get back docs based on a query for them
to paginate, highlight etc. It would also give them the flexibility to use
this anywhere i.e. q,fq,bq etc.
A bit of history about MLT (thanks to Hoss)
MLT Handler pre-dates the existence of QParsers and was meant to take an
arbitrary query as input, find docs that match that
query, club them together to find interesting terms, and then use those
terms as if they were my main query to generate a main result set.
This result would then be used as the set to facet, highlight etc.
The flow: Query - DocList(m) - Bag (terms) - Query - DocList\(y)
The MLT component on the other hand solved a very different purpose of
augmenting the main result set. It is used to get similar docs for each of
the doc in the main result set.
DocSet\(n) - n * Bag (terms) - n * (Query) - n * DocList(m)
The new approach:
All of this can be done better and cleaner (and makes more sense too) using
an MLT QParser.
An important thing to handle here is the case where the user doesn't have
TermVectors, in which case, it does what happens right now i.e. parsing
stored fields.
Also, in case the user doesn't have a field (to be used for MLT) indexed, the
field would need to be a TextField with an index analyzer defined. This
analyzer will then be used to extract terms for MLT.
In case of SolrCloud mode, '/get-termvectors' can be used after looking at
the schema (if TermVectors are enabled for the field). If not, a /get call
can be used to fetch the field and parse it.

--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Created] (LUCENE-5839) Regex bug in AnalyzingQueryParser

2014-07-21 Thread Tim Allison (JIRA)

Tim Allison created LUCENE-5839:
---

 Summary: Regex bug in AnalyzingQueryParser
 Key: LUCENE-5839
 URL: https://issues.apache.org/jira/browse/LUCENE-5839
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/queryparser
Affects Versions: 4.9
Reporter: Tim Allison
Priority: Trivial


Dennis Walter recently pointed out this bug on dev@lucene.apache.org:

  // gobble escaped chars or find a wildcard character 
  private final Pattern wildcardPattern = Pattern.compile((\\.)|([?*]+));

The first group will match a literal dot (.), while its intention seems to be 
to match a backslash and a single character. So the expression should instead 
be (.)|([?*]+)  





--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[GitHub] lucene-solr pull request: LUCENE-5839: Fix regex in AnalyzingQuery...

2014-07-21 Thread tballison

GitHub user tballison opened a pull request:

https://github.com/apache/lucene-solr/pull/67

LUCENE-5839: Fix regex in AnalyzingQueryParser

LUCENE-5839: Fix regex in AnalyzingQueryParser

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/tballison/lucene-solr trunk

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/lucene-solr/pull/67.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #67


commit 1eac4382dd1ee7a4319096499335d7f7f28f526a
Author: tballison talli...@mitre.org
Date:   2014-07-21T13:22:38Z

Fix regex in AnalyzingQueryParser




---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5839) Regex bug in AnalyzingQueryParser

2014-07-21 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-5839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068511#comment-14068511
 ] 

ASF GitHub Bot commented on LUCENE-5839:


GitHub user tballison opened a pull request:

https://github.com/apache/lucene-solr/pull/67

LUCENE-5839: Fix regex in AnalyzingQueryParser

LUCENE-5839: Fix regex in AnalyzingQueryParser

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/tballison/lucene-solr trunk

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/lucene-solr/pull/67.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #67


commit 1eac4382dd1ee7a4319096499335d7f7f28f526a
Author: tballison talli...@mitre.org
Date:   2014-07-21T13:22:38Z

Fix regex in AnalyzingQueryParser




 Regex bug in AnalyzingQueryParser
 -

 Key: LUCENE-5839
 URL: https://issues.apache.org/jira/browse/LUCENE-5839
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/queryparser
Affects Versions: 4.9
Reporter: Tim Allison
Priority: Trivial

 Dennis Walter recently pointed out this bug on dev@lucene.apache.org:
   // gobble escaped chars or find a wildcard character 
   private final Pattern wildcardPattern = Pattern.compile((\\.)|([?*]+));
 The first group will match a literal dot (.), while its intention seems to 
 be to match a backslash and a single character. So the expression should 
 instead be (.)|([?*]+)  



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5839) Regex bug in AnalyzingQueryParser

2014-07-21 Thread Tim Allison (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-5839?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068520#comment-14068520
 ] 

Tim Allison commented on LUCENE-5839:
-

Created edge case to test this.

LUCENE-5504 is a cleaner way to go to deal with this.

While looking at the code again, I noticed that AnalyzingQueryParser is using 
the superclass' getXQuery.  This will have the effect of lowercasing whatever 
happens after analysis.  If someone's analyzer is upcasing, 
AnalyzingQueryParser will fail.  I can submit another patch to fix this, but I 
think something along the lines of LUCENE-5504 is a better way to go.

 Regex bug in AnalyzingQueryParser
 -

 Key: LUCENE-5839
 URL: https://issues.apache.org/jira/browse/LUCENE-5839
 Project: Lucene - Core
  Issue Type: Bug
  Components: core/queryparser
Affects Versions: 4.9
Reporter: Tim Allison
Priority: Trivial

 Dennis Walter recently pointed out this bug on dev@lucene.apache.org:
   // gobble escaped chars or find a wildcard character 
   private final Pattern wildcardPattern = Pattern.compile((\\.)|([?*]+));
 The first group will match a literal dot (.), while its intention seems to 
 be to match a backslash and a single character. So the expression should 
 instead be (.)|([?*]+)  



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

RE: Bug in AnalyzingQueryParser Pattern

2014-07-21 Thread Allison, Timothy B.

Thank you, Dennis:

https://issues.apache.org/jira/i#browse/LUCENE-5839

From: Dennis Walter [mailto:dennis.wal...@gmail.com]
Sent: Sunday, July 20, 2014 2:52 PM
To: dev@lucene.apache.org
Subject: Bug in AnalyzingQueryParser Pattern

Hi there,

While reading the source code of AnalyzingQueryParser to understand what it 
does, I think I found a bug in the regular expression used to detect wildcards. 
It is defined as

  // gobble escaped chars or find a wildcard character
  private final Pattern wildcardPattern = 
Pattern.compile((\\.)|([?*]+))file:///\\.)|([%3f*]+)%22);

The first group will match a literal dot (.), while its intention seems to be 
to match a backslash and a single character. So the expression should instead 
be (.)|([?*]+)file:///\\\.)|([%3f*]+)

Best Regards
Dennis

[GitHub] lucene-solr pull request: Lucene5205

2014-07-21 Thread tballison

Github user tballison closed the pull request at:

https://github.com/apache/lucene-solr/pull/64


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Updated] (LUCENE-5836) BytesRef.copyBytes and copyChars don't oversize


 [ 
https://issues.apache.org/jira/browse/LUCENE-5836?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adrien Grand updated LUCENE-5836:
-

Attachment: LUCENE-5836.patch

And what if we move the StringBuilder-like methods to a different class? (see 
eg. attached patch which doesn't compile but should give an idea of the change) 
Would that work for you?

 BytesRef.copyBytes and copyChars don't oversize
 ---

 Key: LUCENE-5836
 URL: https://issues.apache.org/jira/browse/LUCENE-5836
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Adrien Grand
Assignee: Adrien Grand
 Attachments: LUCENE-5836.patch


 When copying data from another BytesRef/CharSequence, these methods don't 
 oversize. This is not an issue if this method is used only once per BytesRef 
 instance but I just reviewed the usage of these methods and they are very 
 frequently used in loops to do things like:
  - keep track of the top values in comparators
  - keep track of the previous terms in various loops over a terms enum 
 (lucene49 DV consumer, BlockTreeTermsWriter)
  - etc.
 Although unlikely, it might be possible to hit a worst-case and to resize the 
 underlying byte[] on every call to copyBytes? Should we oversize the 
 underlying array in these methods?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5836) BytesRef.copyBytes and copyChars don't oversize

2014-07-21 Thread Robert Muir (JIRA)


[ 
https://issues.apache.org/jira/browse/LUCENE-5836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068538#comment-14068538
 ] 

Robert Muir commented on LUCENE-5836:
-

that would be a fantastic improvement IMO.

 BytesRef.copyBytes and copyChars don't oversize
 ---

 Key: LUCENE-5836
 URL: https://issues.apache.org/jira/browse/LUCENE-5836
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Adrien Grand
Assignee: Adrien Grand
 Attachments: LUCENE-5836.patch


 When copying data from another BytesRef/CharSequence, these methods don't 
 oversize. This is not an issue if this method is used only once per BytesRef 
 instance but I just reviewed the usage of these methods and they are very 
 frequently used in loops to do things like:
  - keep track of the top values in comparators
  - keep track of the previous terms in various loops over a terms enum 
 (lucene49 DV consumer, BlockTreeTermsWriter)
  - etc.
 Although unlikely, it might be possible to hit a worst-case and to resize the 
 underlying byte[] on every call to copyBytes? Should we oversize the 
 underlying array in these methods?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5836) BytesRef.copyBytes and copyChars don't oversize


[ 
https://issues.apache.org/jira/browse/LUCENE-5836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14068539#comment-14068539
 ] 

Adrien Grand commented on LUCENE-5836:
--

OK, I'll give it a try.

 BytesRef.copyBytes and copyChars don't oversize
 ---

 Key: LUCENE-5836
 URL: https://issues.apache.org/jira/browse/LUCENE-5836
 Project: Lucene - Core
  Issue Type: Bug
Reporter: Adrien Grand
Assignee: Adrien Grand
 Attachments: LUCENE-5836.patch


 When copying data from another BytesRef/CharSequence, these methods don't 
 oversize. This is not an issue if this method is used only once per BytesRef 
 instance but I just reviewed the usage of these methods and they are very 
 frequently used in loops to do things like:
  - keep track of the top values in comparators
  - keep track of the previous terms in various loops over a terms enum 
 (lucene49 DV consumer, BlockTreeTermsWriter)
  - etc.
 Although unlikely, it might be possible to hit a worst-case and to resize the 
 underlying byte[] on every call to copyBytes? Should we oversize the 
 underlying array in these methods?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (SOLR-5473) Split clusterstate.json per collection and watch states selectively