Re: [JENKINS] Lucene-Solr-trunk-Linux-Java7-64 - Build # 438 - Failure!

2012-06-29 Thread david.w.smi...@gmail.com
I added the missing ASL header. On Thu, Jun 28, 2012 at 4:54 PM, Policeman Jenkins Server jenk...@sd-datasolutions.de wrote: Build: http://jenkins.sd-datasolutions.de/job/Lucene-Solr-trunk-Linux-Java7-64/438/ All tests passed Build Log: [...truncated 15182 lines...] BUILD FAILED

BooleanFilter MUST clauses and getDocIdSet(acceptDocs)

2012-11-07 Thread david.w.smi...@gmail.com
I am about to write a Filter that only operates on a set of documents that have already passed other filter(s). It's rather expensive, since it has to use DocValues to examine a value and then determine if its a match. So it scales O(n) where n is the number of documents it must see. The 2nd

Changes as we approach v4

2012-09-21 Thread david.w.smi...@gmail.com
Rob, It appears you are in-effect the Release Manager for v4.0 so I'm asking you this question. Clearly v4 is going to be out soon and consequently we're not pushing new features to the v4 branch. Regarding the new spatial codebase, there isn't a backwards compatibility concern to changes until

Re: Welcome back, Wolfgang Hoschek!

2013-09-26 Thread david.w.smi...@gmail.com
Nice! Welcome back Wolfgang! On Thu, Sep 26, 2013 at 6:21 AM, Uwe Schindler uschind...@apache.orgwrote: Hi, I'm pleased to announce that after a long abstinence, Wolfgang Hoschek rejoined the Lucene/Solr committer team. He is working now at Cloudera and plans to help with the integration

Fwd: [JENKINS] Lucene-Solr-4.x-Linux (32bit/jdk1.6.0_45) - Build # 6066 - Still Failing!

2013-06-14 Thread david.w.smi...@gmail.com
Dawid, Could you please take a look at the reproducibility of this test failure in lucene/spatial? I tried to reproduce it but couldn't, and I thought perhaps you might have some insight because I'm using some RandomizedTesting features that aren't as often used, like @Repeat. For example, one

Re: [JENKINS] Lucene-Solr-4.x-Linux (32bit/jdk1.7.0_51) - Build # 9725 - Still Failing!

2014-03-18 Thread david.w.smi...@gmail.com
I'll look into this one and get it fixed ASAP. On Tue, Mar 18, 2014 at 2:26 AM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build: http://jenkins.thetaphi.de/job/Lucene-Solr-4.x-Linux/9725/ Java: 32bit/jdk1.7.0_51 -server -XX:+UseSerialGC 2 tests failed. FAILED:

Re: [JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.7.0_60-ea-b10) - Build # 9867 - Failure!

2014-03-21 Thread david.w.smi...@gmail.com
I'm definitely looking at it and I've found the problem. I'm working on a fix right now. On Fri, Mar 21, 2014 at 3:27 PM, Michael McCandless luc...@mikemccandless.com wrote: I someone looking at this test failure? Should we @BadApple it, or revert recent spatial changes, or something?

Re: [JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.7.0_60-ea-b10) - Build # 9882 - Still Failing!

2014-03-23 Thread david.w.smi...@gmail.com
I'm looking in to this. On Sun, Mar 23, 2014 at 5:45 AM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/9882/ Java: 64bit/jdk1.7.0_60-ea-b10 -XX:-UseCompressedOops -XX:+UseSerialGC 1 tests failed. FAILED:

Re: [VOTE] Lucene / Solr 4.7.1 RC1

2014-03-26 Thread david.w.smi...@gmail.com
+1 SUCCESS! [2:13:44.301402] On Tue, Mar 25, 2014 at 6:46 PM, Steve Rowe sar...@gmail.com wrote: Please vote for the first Release Candidate for Lucene/Solr 4.7.1. Download it here: http://people.apache.org/~sarowe/staging_area/lucene-solr-4.7.1-RC1-rev1581444/ Smoke tester cmdline:

Re: [VOTE] Lucene / Solr 4.7.1 RC2

2014-03-31 Thread david.w.smi...@gmail.com
+1 SUCCESS! [1:51:37.952160] On Sat, Mar 29, 2014 at 4:46 AM, Steve Rowe sar...@gmail.com wrote: Please vote for the second Release Candidate for Lucene/Solr 4.7.1. Download it here: https://people.apache.org/~sarowe/staging_area/lucene-solr-4.7.1-RC2-rev1582953/ Smoke tester cmdline

Re: Welcome Alan Woodward to the PMC

2014-04-02 Thread david.w.smi...@gmail.com
Welcome Alan! ~ David On Wed, Apr 2, 2014 at 8:23 AM, Steve Rowe sar...@gmail.com wrote: I'm pleased to announce that Alan Woodward has accepted the PMC's invitation to join. Welcome Alan! - Steve - To unsubscribe,

Re: Anticipating a benchmark for direct posting format

2014-04-07 Thread david.w.smi...@gmail.com
Benson, I like your idea. I think your idea can be achieved as a codec, one that wraps another codec that establishes the on-disk format. By default the wrapped codec can be Lucene's default codec. I think, if implemented, this would be a change to DPF instead of an additional DPF-variant

Re: Anticipating a benchmark for direct posting format

2014-04-07 Thread david.w.smi...@gmail.com
a DirectAtomicReader which does that? I believe it can share some code w/ DPF, as long as we don't make these APIs public, or make them @super.experimental and @super.expert. Just throwing some ideas... Shai On Mon, Apr 7, 2014 at 5:35 PM, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote

Re: Welcome Tim Potter as Lucene/Solr committer

2014-04-07 Thread david.w.smi...@gmail.com
Welcome Tim! On Tue, Apr 8, 2014 at 12:40 AM, Steve Rowe sar...@gmail.com wrote: I'm pleased to announce that Tim Potter has accepted the PMC's invitation to become a committer. Tim, it's tradition that you introduce yourself with a brief bio. Once your account has been created - could

Re: 4.7.2

2014-04-08 Thread david.w.smi...@gmail.com
LOL indeed ;-) But in all seriousness, that should have no bearing on this conversation. On Tue, Apr 8, 2014 at 3:00 AM, Alexandre Rafalovitch arafa...@gmail.comwrote: Let's hope nobody is trying to finish any books right now. :-) Personal website: http://www.outerthoughts.com/ Current

Blog post: Indexing Polygons In Lucene With Accuracy

2014-04-11 Thread david.w.smi...@gmail.com
FYI I published this blog post today: http://www.opensourceconnections.com/2014/04/11/indexing-polygons-in-lucene-with-accuracy/ There's a strong Spatial4j connection because the SerializedDVStrategy referenced uses the new BinaryCodec from Spatial4j 0.4. ~ David

Re: Solr: Serving Javadoc from Jetty server

2014-04-17 Thread david.w.smi...@gmail.com
Alex, Yes it would be useful (of course)! In addition, the admin UI should have a link to it, in addition to the generic documentation link. Create an issue and I’ll commit it. ~ David On Thu, Apr 17, 2014 at 6:54 AM, Alexandre Rafalovitch arafa...@gmail.comwrote: Hello, The binary Solr

Re: [JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.7.0_55) - Build # 10106 - Still Failing!

2014-04-18 Thread david.w.smi...@gmail.com
This build started before I fixed the issue; it’s already fixed. On Fri, Apr 18, 2014 at 9:12 AM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10106/ Java: 64bit/jdk1.7.0_55 -XX:+UseCompressedOops -XX:+UseConcMarkSweepGC

Re: maximum number of shards per SolrCloud

2014-04-21 Thread david.w.smi...@gmail.com
Zhifeng, Please ask Solr questions on the solr-user list. Thanks. ~ David On Mon, Apr 21, 2014 at 9:54 PM, Zhifeng Wang zhifeng.wang...@gmail.comwrote: Hi, We are facing a high incoming rate of usually small documents (logs). The incoming rate is initially assumed at 2K/sec but could reach

DocumentsWriterPerThread architecture

2014-04-30 Thread david.w.smi...@gmail.com
Is this still up to date?: https://blog.trifork.com/2011/04/01/gimme-all-resources-you-have-i-can-use-them/ I thought at some point subsequently, some significant work was done, and perhaps it was blogged. But I can’t find it. ~ David

Encoding data in terms; UTF8 concerns?

2014-05-10 Thread david.w.smi...@gmail.com
I’m working on an encoding of numbers / data into indexed terms. In the past I limited the encoding to ASCII but now I’m doing it at a more raw/byte level. Do I have to be aware of UTF8 / sorting issues when I do this? I noticed the following code in NumericUtils.java, line 186: while

Re: Encoding data in terms; UTF8 concerns?

2014-05-11 Thread david.w.smi...@gmail.com
','david.w.smi...@gmail.com');[mailto: david.w.smi...@gmail.comjavascript:_e(%7B%7D,'cvml','david.w.smi...@gmail.com');] *Sent:* Sunday, May 11, 2014 1:17 AM *To:* dev@lucene.apache.orgjavascript:_e(%7B%7D,'cvml','dev@lucene.apache.org'); *Cc:* Uwe Schindler; Michael McCandless *Subject:* Encoding data

Re: [JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.7.0_60-ea-b15) - Build # 10394 - Still Failing!

2014-05-26 Thread david.w.smi...@gmail.com
I’ll dig. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Tue, May 27, 2014 at 12:04 AM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10394/ Java:

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.8.0_20-ea-b15) - Build # 10472 - Failure!

2014-06-04 Thread david.w.smi...@gmail.com
Thanks for fixing, Rob. ~ David On Wed, Jun 4, 2014 at 10:49 PM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10472/ Java: 32bit/jdk1.8.0_20-ea-b15 -server -XX:+UseG1GC 1 tests failed. FAILED:

Re: Trappy behavior with default search field

2014-06-05 Thread david.w.smi...@gmail.com
In my view, solrconfig.xml shouldn’t refer to any field by name out of the box, except for the /browse handler, and perhaps pre-filling the query form in the admin GUI. That’s it. A couple years ago at about the time I became a committer, I finally did something about a feature I am very

Re: Extract values from custom function for ValueSource with multiple indexable fields

2014-06-08 Thread david.w.smi...@gmail.com
I suggest investigating this using a known example that does this, such as LatLonType and geodist(). LatLonType registers the field in a custom way too. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Sun, Jun 8, 2014 at 7:54

Re: Adding Morphline support to DIH - worth the effort?

2014-06-08 Thread david.w.smi...@gmail.com
One of the ideas over DIH discussed earlier is making it standalone. Yeah; my beef with the DIH is that it’s tied to Solr. But I’d rather see something other than the DIH outside Solr; it’s not worthy IMO. Why have something Solr specific even? A great pipeline shouldn’t tie itself to any

Re: [JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.8.0_20-ea-b15) - Build # 10516 - Still Failing!

2014-06-09 Thread david.w.smi...@gmail.com
I’m on it. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Mon, Jun 9, 2014 at 10:36 AM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10516/ Java:

Re: solr client sdk's/libraries for native platforms

2014-11-24 Thread david.w.smi...@gmail.com
FYI see https://wiki.apache.org/solr/IntegratingSolr for a list. This is a great use of the wiki. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Mon, Nov 24, 2014 at 10:35 AM, Alexandre Rafalovitch arafa...@gmail.com wrote:

IntelliJ build

2014-11-24 Thread david.w.smi...@gmail.com
On trunk I cleaned and re-created my IntelliJ based build (ant clean-idea, idea). IntelliJ didn’t get the memo about Java 8 so I changed that (locally). Then I found that the Solr velocity contrib couldn’t resolve a ResourceLoader class in analysis-common. So I simply checked the “export”

Re: Where is the SVN repository only for Lucene project ?

2014-11-26 Thread david.w.smi...@gmail.com
GitHub offers SVN access: svn checkout https://github.com/apache/lucene-solr ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Wed, Nov 26, 2014 at 4:19 AM, Yosuke Yamatani s151...@center.wakayama-u.ac.jp wrote: Dear sir/madam

Re: IntelliJ build

2014-11-26 Thread david.w.smi...@gmail.com
misalignments with the Ant build, and because unwanted transitive deps could improperly influence the IntelliJ build. But if you feel strongly about it, go ahead: -0. Thanks for working on it. Steve On Nov 24, 2014, at 10:37 PM, david.w.smi...@gmail.com wrote: On trunk I cleaned and re

Re: svn commit: r1642294 - in /lucene/dev/trunk/lucene: ./ highlighter/src/java/org/apache/lucene/search/highlight/ highlighter/src/test/org/apache/lucene/search/highlight/ test-framework/src/java/org

2014-11-29 Thread david.w.smi...@gmail.com
Reposting my comment on JIRA: Ouch; so sorry I failed the build! In my checkout I have several pending issues related to highlighting, and apparently the Solr one, SOLR-6680 https://issues.apache.org/jira/browse/SOLR-6680, is dependent. I should have monitored the dev list closely; I recall

Re: solr client sdk's/libraries for native platforms

2014-12-01 Thread david.w.smi...@gmail.com
I meant to reply earlier... On Mon, Nov 24, 2014 at 11:37 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: They are super-stale Yup but it’s a wiki so feel free to freshen it up. I’ll be doing that in a bit. It may also be helpful if these particular pages got more prominence/visibility

Re: solr client sdk's/libraries for native platforms

2014-12-01 Thread david.w.smi...@gmail.com
in the vein of a “do-it-tocracy”, getting the Wiki updated is a perfectly good first step, and then if there is a better approach, hopefully that occurs.… ;-) On Dec 1, 2014, at 10:51 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: On 1 December 2014 at 10:02, david.w.smi...@gmail.com

Re: [JENKINS] Lucene-Solr-Tests-5.x-Java7 - Build # 2267 - Failure

2014-12-02 Thread david.w.smi...@gmail.com
I’ll dig. On Tue, Dec 2, 2014 at 11:57 AM, Apache Jenkins Server jenk...@builds.apache.org wrote: Build: https://builds.apache.org/job/Lucene-Solr-Tests-5.x-Java7/2267/ 1 tests failed. FAILED: org.apache.lucene.spatial.prefix.DateNRStrategyTest.testContains {#9

Re: VOTE: Solr Reference Guide for 4.10

2014-09-03 Thread david.w.smi...@gmail.com
+1, at least for the spatial part I looked at ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Wed, Sep 3, 2014 at 1:20 PM, Chris Hostetter hossman_luc...@fucit.org wrote: +1 :

Re: [VOTE] Move trunk to Java 8

2014-09-12 Thread david.w.smi...@gmail.com
Your arguments really resonate with me, Ryan… +1 to Java 8 (FWIW I’m using coding in Java 8 these days already) ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Fri, Sep 12, 2014 at 1:39 PM, Ryan Ernst r...@iernst.net wrote:

Re: [VOTE] Move trunk to Java 8

2014-09-15 Thread david.w.smi...@gmail.com
Ryan, I’m unclear on what makes a “procedural vote” as such. This seems to me to be about code modifications — in a big way as it’s a large change to the codebase. ~ David

Re: Adding Morphline support to DIH - worth the effort?

2014-06-11 Thread david.w.smi...@gmail.com
LOL I had the very same reaction Alexandre. Most of us don’t have all this big data software sitting around, even if it is free. Complexity. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Thu, Jun 12, 2014 at 12:44 AM,

Re: [JENKINS] Lucene-Solr-trunk-Windows (32bit/jdk1.8.0_20-ea-b15) - Build # 4119 - Failure!

2014-06-15 Thread david.w.smi...@gmail.com
I’m on it. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Sun, Jun 15, 2014 at 10:30 PM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Windows/4119/ Java:

Re: facet.mincount in SolrCloud

2014-06-16 Thread david.w.smi...@gmail.com
That doesn’t make sense to me either, Toke. Have you tried changing it and running tests to see that they pass? ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Mon, Jun 16, 2014 at 8:39 AM, Toke Eskildsen

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.8.0_20-ea-b15) - Build # 10597 - Still Failing!

2014-06-18 Thread david.w.smi...@gmail.com
This is not a spatial bug; it’s another case of: https://issues.apache.org/jira/browse/LUCENE-5713 ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley

Re: [VOTE] 4.9.0

2014-06-21 Thread david.w.smi...@gmail.com
The smoke tester failed for me: *lucene-solr_4x_svn*$ python3.3 -u dev-tools/scripts/smokeTestRelease.py http://people.apache.org/~rmuir/staging_area/lucene_solr_4_9_0_r1604085/ 1604085 4.9.0 /Volumes/RamDisk/tmp JAVA7_HOME is /Library/Java/JavaVirtualMachines/jdk1.7.0_51.jdk/Contents/Home

Re: [VOTE] 4.9.0

2014-06-22 Thread david.w.smi...@gmail.com
, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: The smoke tester failed for me: lucene-solr_4x_svn$ python3.3 -u dev-tools/scripts/smokeTestRelease.py http://people.apache.org/~rmuir/staging_area/lucene_solr_4_9_0_r1604085/ 1604085 4.9.0 /Volumes/RamDisk/tmp JAVA7_HOME

Re: [JENKINS] Lucene-Solr-trunk-MacOSX (64bit/jdk1.7.0) - Build # 1682 - Still Failing!

2014-07-01 Thread david.w.smi...@gmail.com
Another case of: https://issues.apache.org/jira/browse/LUCENE-5713 (cause unknown) ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Tue, Jul 1, 2014 at 6:08 PM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build:

Re: [JENKINS] Lucene-Solr-trunk-MacOSX (64bit/jdk1.7.0) - Build # 1682 - Still Failing!

2014-07-01 Thread david.w.smi...@gmail.com
, 2014 at 6:13 PM, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: Another case of: https://issues.apache.org/jira/browse/LUCENE-5713 (cause unknown) ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley

Re: [JENKINS] Lucene-Solr-trunk-MacOSX (64bit/jdk1.7.0) - Build # 1682 - Still Failing!

2014-07-01 Thread david.w.smi...@gmail.com
On Tue, Jul 1, 2014 at 6:54 PM, Robert Muir rcm...@gmail.com wrote: FieldCache is historically lenient, it allows all kinds of nonsense, such as uninverting a multi-valued field as single-valued (e.g. leaves gaps in ordinals and other bullshit that will cause this assertion to fail). I can

Single Field instance for both DocValues and indexed?

2014-07-03 Thread david.w.smi...@gmail.com
I was experimenting with having a user-provided/customized FieldType for indexing code of (mostly) a set of numeric fields that are of a common type. The user/developer might want the type to both be indexed have docValues, or perhaps just one. Or maybe stored hypothetically for the purposes of

Re: Single Field instance for both DocValues and indexed?

2014-07-03 Thread david.w.smi...@gmail.com
officially? Any way, I’ll go with this for now. FYI this very class is going to show up in spatial BBoxStrategy in a new patch soon. ~ David On Thu, Jul 3, 2014 at 12:48 PM, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: I was experimenting with having a user-provided/customized FieldType

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/ibm-j9-jdk7) - Build # 10760 - Still Failing!

2014-07-08 Thread david.w.smi...@gmail.com
I’m on it; this’ll get fixed momentarily. Some co-related JIRA issues; one got committed without the other. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Tue, Jul 8, 2014 at 2:54 PM, Policeman Jenkins Server

Re: Using a patch review tool for Lucene / Solr development.

2014-07-10 Thread david.w.smi...@gmail.com
On Wed, Jul 9, 2014 at 1:34 PM, Mark Miller markrmil...@gmail.com wrote: A few months ago, I filed INFRA JIRA issue to add the Lucene project to review board (https://reviews.apache.org) and it was just resolved ( https://issues.apache.org/jira/browse/INFRA-7630). Awesome. I’m not the

Re: Hints on constructing/running Solr analyzer chains standalone

2014-07-12 Thread david.w.smi...@gmail.com
That sounds like a wonderful project, Alexandre — I’ve always wanted such a capability! I suggest approaching this very pragmatically based on minimizing the time to get something useful, which means leveraging as much as is available already — that means solr’s existing analysis UI screen. I

Re: [JENKINS] Lucene-Solr-SmokeRelease-trunk - Build # 184 - Still Failing

2014-07-18 Thread david.w.smi...@gmail.com
On Fri, Jul 18, 2014 at 10:18 AM, Timothy Potter thelabd...@gmail.com wrote: sheisty class LOL that error is funny. Tim, Take a look at the smoke tester like 268 or so. It already makes exceptions for certain Solr contrib modules, and apparently you need to augument it further. ~ David

Re: Distributed spellcheck

2014-07-29 Thread david.w.smi...@gmail.com
Steve, File a bug. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Tue, Jul 29, 2014 at 9:14 AM, Steve Molloy smol...@opentext.com wrote: Hi, I'm running into an issue with distributed spellcheck and was wondering if

Re: Welcome Tomás Fernández Löbbe as Lucene/Solr committer!

2014-07-31 Thread david.w.smi...@gmail.com
Welcome Tomas! Will you make it to Lucene/Solr Revolution in November? ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Thu, Jul 31, 2014 at 1:16 PM, Tomás Fernández Löbbe tomasflo...@gmail.com wrote: Thanks everyone, I’m

Re: [JENKINS] Lucene-Solr-Tests-5.x-Java7 - Build # 2267 - Failure

2014-12-04 Thread david.w.smi...@gmail.com
...@gmail.com david.w.smi...@gmail.com wrote: I’ll dig. On Tue, Dec 2, 2014 at 11:57 AM, Apache Jenkins Server jenk...@builds.apache.org wrote: Build: https://builds.apache.org/job/Lucene-Solr-Tests-5.x-Java7/2267/ 1 tests failed. FAILED

Re: solr client sdk's/libraries for native platforms

2014-12-04 Thread david.w.smi...@gmail.com
On 1 December 2014 at 20:04, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: I like the “last updated …” (rounded to the month) idea. It may be difficult to maintain a “last checked” distinction, and create somewhat more of a burden on maintaining the list. I think it’s useful

Re: Interesting blog on G1 GC improvemnts u25 - u60

2014-12-05 Thread david.w.smi...@gmail.com
Thanks for sharing; I’ll add that to my reading list. FWIW I don’t trust the G1 enough for production use. Some random Lucene/Solr build failures are blamed on it, even. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Fri, Dec

Re: Minimum test set for idempotent changes in schema.xml

2014-12-06 Thread david.w.smi...@gmail.com
Hi Alex, There isn’t a way to say “just schema-related ones”. But, if you suspect the tests you want to run will have a string in the class name then you can run tests for classes matching a supplied regexp. For example: ant test -Dtestcase=*High* And do that from within a specific module

Re: Minimum test set for idempotent changes in schema.xml

2014-12-06 Thread david.w.smi...@gmail.com
? Thanks Michael Am 07.12.14 um 04:29 schrieb david.w.smi...@gmail.com: Hi Alex, There isn’t a way to say “just schema-related ones”. But, if you suspect the tests you want to run will have a string in the class name then you can run tests for classes matching a supplied regexp

Re: what is the rule for updating CHANGES.txt

2014-12-07 Thread david.w.smi...@gmail.com
Michael, I recall you’re working on building a test tool that sees changes and runs applicable tests? If that’s the case, why would it matter if CHANGES.txt gets updated? The vast majority of the time there is a reference to a JIRA issue from the commit message, and most JIRA issues that have

Re: facet.mincount=0 returns facet values with 0 counts for q=* query

2014-12-09 Thread david.w.smi...@gmail.com
Abhishek, Please send such questions to the solr-user list, not the Lucene dev list. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Tue, Dec 9, 2014 at 9:52 AM, Abhishek Sharma abhishe...@unbxd.com wrote: Hi, Can any one

Re: Release 4.10.3

2014-12-10 Thread david.w.smi...@gmail.com
IMO there’s no such thing as having too many FSTs :-P ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Wed, Dec 10, 2014 at 6:08 AM, Michael McCandless luc...@mikemccandless.com wrote: Hi Mark, If it's not too late, I'd

Re: [VOTE] Release 4.10.3 RC1

2014-12-10 Thread david.w.smi...@gmail.com
On Wed, Dec 10, 2014 at 2:26 PM, Timothy Potter thelabd...@gmail.com wrote: Wow - that's a fast box Mike! My thoughts exactly. +1 SUCCESS! [0:58:16.394370]

Re: Determining NumericType for a field

2014-12-15 Thread david.w.smi...@gmail.com
Down to practicalities, we need Lucene 4.8 as our DocValues are Disk based and that support was removed in 4.9. I assume you’re referring to the “Disk” DV format/Codec? The standard format has the data on disk too, it’s just that there’s some “small” (relative to the disk data) lookup

Re: [JENKINS] Lucene-Solr-5.x-MacOSX (64bit/jdk1.7.0) - Build # 1959 - Still Failing!

2014-12-19 Thread david.w.smi...@gmail.com
Ouch; thanks! ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Fri, Dec 19, 2014 at 10:29 AM, Michael McCandless luc...@mikemccandless.com wrote: I committed a fix. Mike McCandless http://blog.mikemccandless.com On Fri,

Re: how to do auto suggestion using apache lucene?

2014-10-01 Thread david.w.smi...@gmail.com
On Wed, Oct 1, 2014 at 9:19 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: https://github.com/arafalov/Solr-Javadoc/tree/master/SearchServer Pretty cool, Alex! ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley

Highlighters, accurate highlighting, and the PostingsHighlighter

2014-10-09 Thread david.w.smi...@gmail.com
I’m working on making highlighting both accurate and fast. By “accurate”, I mean the highlights need to accurately reflect a match given the query and various possible query types (to include SpanQueries and MultiTermQueries and obviously phrase queries and the usual suspects). The fastest

Re: Highlighters, accurate highlighting, and the PostingsHighlighter

2014-10-10 Thread david.w.smi...@gmail.com
On Fri, Oct 10, 2014 at 6:39 AM, Michael McCandless luc...@mikemccandless.com wrote: +1 for a completely accurate (each snippet shown matches the query) and fast highlighter, but it's a real challenge because you need a clean way to recursively iterate all positions for any (even

Re: Highlighters, accurate highlighting, and the PostingsHighlighter

2014-10-10 Thread david.w.smi...@gmail.com
On Fri, Oct 10, 2014 at 7:13 AM, Robert Muir rcm...@gmail.com wrote: On Fri, Oct 10, 2014 at 12:38 AM, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: The fastest highlighter we’ve got in Lucene is the PostingsHighlighter but it throws out any positional nature in the query

Re: Highlighters, accurate highlighting, and the PostingsHighlighter

2014-10-10 Thread david.w.smi...@gmail.com
://www.thetaphi.de eMail: u...@thetaphi.de *From:* david.w.smi...@gmail.com [mailto:david.w.smi...@gmail.com david.w.smi...@gmail.com] *Sent:* Friday, October 10, 2014 4:46 PM *To:* dev@lucene.apache.org *Subject:* Re: Highlighters, accurate highlighting, and the PostingsHighlighter On Fri, Oct 10, 2014

Re: [JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.8.0_11) - Build # 11036 - Still Failing!

2014-08-17 Thread david.w.smi...@gmail.com
I’ll look into it. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Sat, Aug 16, 2014 at 10:16 AM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/11036/ Java:

Re: Issue with bin/solr script, collection1, and cloud mode

2014-08-28 Thread david.w.smi...@gmail.com
Ok. I wish the router was an explicit option, separate from declaring numShards. And furthermore, that it would never be “implicit” unless you expressly told it to be. People sometimes get this router because they forget numShards, thinking “1 is fine anyway”. — for now. ~ David Smiley

Re: Question for D. Smiley

2014-08-05 Thread david.w.smi...@gmail.com
Hi Erick, The field type for LatLonType mandates a subFieldSuffix or subFieldType attribute, and so I think there’s clearly a problem if you don’t provide a field type that’s going to match it. The default schema even has a comment on the dynamicField definition for *_coordinate that it’s

Re: Can't assign jiras to myself

2014-08-06 Thread david.w.smi...@gmail.com
Tomás, I put you into the “Committers” role for Lucene Solr in JIRA just now. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Wed, Aug 6, 2014 at 9:51 PM, Tomás Fernández Löbbe tomasflo...@gmail.com wrote: May I be missing

Re: Welcome Gregory Chanan as Lucene/Solr committer

2014-09-20 Thread david.w.smi...@gmail.com
Welcome! On Friday, September 19, 2014, Steve Rowe sar...@gmail.com wrote: I'm pleased to announce that Gregory Chanan has accepted the PMC's invitation to become a committer. Gregory, it's tradition that you introduce yourself with a brief bio. Mark Miller, the Lucene PMC chair, has

Re: Lucene Benchmark

2014-09-24 Thread david.w.smi...@gmail.com
I use the benchmark module for spatial and I intend to for highlighting performance next month. On Wednesday, September 24, 2014, Mikhail Khludnev mkhlud...@griddynamics.com wrote: Hi John, It's obvious

Re: History question: contribution from Solr to Lucene

2014-11-02 Thread david.w.smi...@gmail.com
Alex, You should follow Yonik’s blog (Heliosearch), he has a post on this subject, more or less: http://heliosearch.org/lucene-solr-history/ ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Sun, Nov 2, 2014 at 8:36 PM, Alexandre

Re: An experience and some thoughts about solr/example - solr/server

2014-11-04 Thread david.w.smi...@gmail.com
+1 Yeah, this would be huge. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Tue, Nov 4, 2014 at 2:30 AM, Jan Høydahl jan@cominvent.com wrote: Also a crucial part here is to add a Collection tab in Admin GUI, and a more

Multi-valued fields and TokenStream

2014-11-05 Thread david.w.smi...@gmail.com
Several times now, I’ve had to come up with work-arounds for a TokenStream not knowing it’s processing the first value or a subsequent-value of a multi-valued field. Two of these times, the use-case was ensuring the first position of each value started at a multiple of 1000 (or some other

Re: Multi-valued fields and TokenStream

2014-11-06 Thread david.w.smi...@gmail.com
wrote: Maybe the position increment gap would be useful? If set to a value larger than likely max position for any individual value, it could be used to infer (non-)first-value-ness. On Nov 5, 2014, at 1:03 PM, david.w.smi...@gmail.com wrote: Several times now, I’ve had to come up

Re: Multi-valued fields and TokenStream

2014-11-06 Thread david.w.smi...@gmail.com
the other ideas better though. On Thu, Nov 6, 2014 at 3:13 PM, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: Are you suggesting that DefaultIndexingChain.PerField.invert(boolean firstValue) would, prior to calling reset(), call setPositionIncrement(Integer.MAX_VALUE), but only when

Re: [JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.8.0_40-ea-b09) - Build # 11586 - Failure!

2014-11-08 Thread david.w.smi...@gmail.com
Weird; I can’t reproduce this given the given Ant invocation given. I used JDK 1.8.0_20. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Sat, Nov 8, 2014 at 4:50 PM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build:

Re: Solr: very slow custom Query/Weight/Scorer, post filtering vs sorting

2014-10-14 Thread david.w.smi...@gmail.com
On Mon, Oct 13, 2014 at 11:04 AM, Patrick Schemitz p...@solute.de wrote: This Query/Weight/Scorer construct is obviously very costly, so I don't want it to leapfrog with the other - much faster - filters in the query (especially when using a high threshold). It’s leap-frogging with filters?

Re: Change the name of the implicit router in SolrCloud?

2014-10-16 Thread david.w.smi...@gmail.com
+1 for “manual”. Furthermore, I think specifying the router should become mandatory or default to the has based router. For back-compat, we can keep current behavior but output a warning about what choice was made. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer

Re: Change the name of the implicit router in SolrCloud?

2014-10-16 Thread david.w.smi...@gmail.com
, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: +1 for “manual”. Furthermore, I think specifying the router should become mandatory or default to the has based router. That is the current default (compositeId) -Yonik http://heliosearch.org - native code faceting, facet

Re: Randomized testing talk (your favorite moment)

2015-01-02 Thread david.w.smi...@gmail.com
Some of the bugs it has helped me find that I am most appreciative of are in testing spatial code. One comes to mind when I developed the IsWithin predicate, and others in computing the bounding lat-lon box of a geodetic circle, and… and on and on… but the details are unimportant really and to

Re: StandardTokenizer, maxTokenLength behavior — likely bug

2015-01-26 Thread david.w.smi...@gmail.com
://issues.apache.org/jira/browse/LUCENE-5400 On Mon, Jan 26, 2015 at 11:17 AM, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: On one of my other open-source projects (SolrTextTagger) I have a test that deliberately tests the effect of a very long token with the StandardTokenizer, and that project

StandardTokenizer, maxTokenLength behavior — likely bug

2015-01-26 Thread david.w.smi...@gmail.com
On one of my other open-source projects (SolrTextTagger) I have a test that deliberately tests the effect of a very long token with the StandardTokenizer, and that project is in turn tested against a wide matrix of Lucene/Solr versions. Before Lucene 4.9, if you had a token that exceeded

Re: Give Solr its own port number

2015-02-04 Thread david.w.smi...@gmail.com
-0 Hoss’s points are my view as well. 8983 is already pretty well known amongst Solr users. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Wed, Feb 4, 2015 at 12:37 PM, Chris Hostetter hossman_luc...@fucit.org wrote: : Until

Re: Interesting resource for Unix shell script cleanup

2015-02-05 Thread david.w.smi...@gmail.com
Cool! ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Thu, Feb 5, 2015 at 10:25 AM, Steve Rowe sar...@gmail.com wrote: On Feb 5, 2015, at 9:51 AM, Alexandre Rafalovitch arafa...@gmail.com wrote: Hi, Just saw a link to

Re: Lucene/Solr 5.0 release branch created

2015-01-14 Thread david.w.smi...@gmail.com
Anshum, are you going to create a 5.1 version in JIRA so we can appropriately assign issues? ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Wed, Jan 14, 2015 at 12:07 PM, Anshum Gupta ans...@anshumgupta.net wrote: Thanks for

Re: Lucene/Solr 5.0 release branch created

2015-01-14 Thread david.w.smi...@gmail.com
+1 to SOLR-6976 especially. I’m sad about SpanQueries not getting nuked yet :-( ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley

Re: Potential contribution: Geo 3d package

2015-01-22 Thread david.w.smi...@gmail.com
to be part of the search query. If you are in favor, I'll create a ticket and attach the library. Karl On Thu, Jan 22, 2015 at 4:01 PM, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: Okay. Since this is not _already_ implementing a Spatial4j shape, I can only presume

Re: Potential contribution: Geo 3d package

2015-01-22 Thread david.w.smi...@gmail.com
Nice Karl! I’d love to learn more about this. Does the shapes here implement a Spatial4j Shape and thus would work with SpatialPrefixTree friends for index search? If not, what is the search side of the equation here? ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer

Re: Potential contribution: Geo 3d package

2015-01-22 Thread david.w.smi...@gmail.com
On Thu, Jan 22, 2015 at 3:27 PM, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: Nice Karl! I’d love to learn more about this. Does the shapes here implement a Spatial4j Shape and thus would work with SpatialPrefixTree friends for index search? If not, what is the search side

Re: [VOTE] 5.0.0 RC2

2015-02-11 Thread david.w.smi...@gmail.com
stay java 7. On Wed, Feb 11, 2015 at 6:13 AM, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote: I found two problems, and I’m not sure what to make of them. First, perhaps the simplest. I ran it with Java 8 with this at the command-line (copied from Uwe’s email, inserting my

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.8.0_31) - Build # 11780 - Failure!

2015-02-11 Thread david.w.smi...@gmail.com
It reproduces; I’m on it. ~ David Smiley Freelance Apache Lucene/Solr Search Consultant/Developer http://www.linkedin.com/in/davidwsmiley On Wed, Feb 11, 2015 at 12:30 PM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/11780/

Re: [VOTE] 5.0.0 RC2

2015-02-11 Thread david.w.smi...@gmail.com
I found two problems, and I’m not sure what to make of them. First, perhaps the simplest. I ran it with Java 8 with this at the command-line (copied from Uwe’s email, inserting my environment variable): python3 -u dev-tools/scripts/smokeTestRelease.py --test-java8 $JAVA8_HOME

  1   2   3   >