Encoding data in terms; UTF8 concerns?

2014-05-10 Thread david.w.smi...@gmail.com
I’m working on an encoding of numbers / data into indexed terms. In the past I limited the encoding to ASCII but now I’m doing it at a more raw/byte level. Do I have to be aware of UTF8 / sorting issues when I do this? I noticed the following code in NumericUtils.java, line 186: while

DocumentsWriterPerThread architecture

2014-04-30 Thread david.w.smi...@gmail.com
Is this still up to date?: https://blog.trifork.com/2011/04/01/gimme-all-resources-you-have-i-can-use-them/ I thought at some point subsequently, some significant work was done, and perhaps it was blogged. But I can’t find it. ~ David

Re: maximum number of shards per SolrCloud

2014-04-21 Thread david.w.smi...@gmail.com
Zhifeng, Please ask Solr questions on the solr-user list. Thanks. ~ David On Mon, Apr 21, 2014 at 9:54 PM, Zhifeng Wang zhifeng.wang...@gmail.comwrote: Hi, We are facing a high incoming rate of usually small documents (logs). The incoming rate is initially assumed at 2K/sec but could reach

Re: [JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.7.0_55) - Build # 10106 - Still Failing!

2014-04-18 Thread david.w.smi...@gmail.com
This build started before I fixed the issue; it’s already fixed. On Fri, Apr 18, 2014 at 9:12 AM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/10106/ Java: 64bit/jdk1.7.0_55 -XX:+UseCompressedOops -XX:+UseConcMarkSweepGC

Re: Solr: Serving Javadoc from Jetty server

2014-04-17 Thread david.w.smi...@gmail.com
Alex, Yes it would be useful (of course)! In addition, the admin UI should have a link to it, in addition to the generic documentation link. Create an issue and I’ll commit it. ~ David On Thu, Apr 17, 2014 at 6:54 AM, Alexandre Rafalovitch arafa...@gmail.comwrote: Hello, The binary Solr

Blog post: Indexing Polygons In Lucene With Accuracy

2014-04-11 Thread david.w.smi...@gmail.com
FYI I published this blog post today: http://www.opensourceconnections.com/2014/04/11/indexing-polygons-in-lucene-with-accuracy/ There's a strong Spatial4j connection because the SerializedDVStrategy referenced uses the new BinaryCodec from Spatial4j 0.4. ~ David

Re: 4.7.2

2014-04-08 Thread david.w.smi...@gmail.com
LOL indeed ;-) But in all seriousness, that should have no bearing on this conversation. On Tue, Apr 8, 2014 at 3:00 AM, Alexandre Rafalovitch arafa...@gmail.comwrote: Let's hope nobody is trying to finish any books right now. :-) Personal website: http://www.outerthoughts.com/ Current

Re: Anticipating a benchmark for direct posting format

2014-04-07 Thread david.w.smi...@gmail.com
Benson, I like your idea. I think your idea can be achieved as a codec, one that wraps another codec that establishes the on-disk format. By default the wrapped codec can be Lucene's default codec. I think, if implemented, this would be a change to DPF instead of an additional DPF-variant

Re: Anticipating a benchmark for direct posting format

2014-04-07 Thread david.w.smi...@gmail.com
a DirectAtomicReader which does that? I believe it can share some code w/ DPF, as long as we don't make these APIs public, or make them @super.experimental and @super.expert. Just throwing some ideas... Shai On Mon, Apr 7, 2014 at 5:35 PM, david.w.smi...@gmail.com david.w.smi...@gmail.com wrote

Re: Welcome Tim Potter as Lucene/Solr committer

2014-04-07 Thread david.w.smi...@gmail.com
Welcome Tim! On Tue, Apr 8, 2014 at 12:40 AM, Steve Rowe sar...@gmail.com wrote: I'm pleased to announce that Tim Potter has accepted the PMC's invitation to become a committer. Tim, it's tradition that you introduce yourself with a brief bio. Once your account has been created - could

Re: Welcome Alan Woodward to the PMC

2014-04-02 Thread david.w.smi...@gmail.com
Welcome Alan! ~ David On Wed, Apr 2, 2014 at 8:23 AM, Steve Rowe sar...@gmail.com wrote: I'm pleased to announce that Alan Woodward has accepted the PMC's invitation to join. Welcome Alan! - Steve - To unsubscribe,

Re: [VOTE] Lucene / Solr 4.7.1 RC2

2014-03-31 Thread david.w.smi...@gmail.com
+1 SUCCESS! [1:51:37.952160] On Sat, Mar 29, 2014 at 4:46 AM, Steve Rowe sar...@gmail.com wrote: Please vote for the second Release Candidate for Lucene/Solr 4.7.1. Download it here: https://people.apache.org/~sarowe/staging_area/lucene-solr-4.7.1-RC2-rev1582953/ Smoke tester cmdline

Re: [VOTE] Lucene / Solr 4.7.1 RC1

2014-03-26 Thread david.w.smi...@gmail.com
+1 SUCCESS! [2:13:44.301402] On Tue, Mar 25, 2014 at 6:46 PM, Steve Rowe sar...@gmail.com wrote: Please vote for the first Release Candidate for Lucene/Solr 4.7.1. Download it here: http://people.apache.org/~sarowe/staging_area/lucene-solr-4.7.1-RC1-rev1581444/ Smoke tester cmdline:

Re: [JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.7.0_60-ea-b10) - Build # 9882 - Still Failing!

2014-03-23 Thread david.w.smi...@gmail.com
I'm looking in to this. On Sun, Mar 23, 2014 at 5:45 AM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build: http://jenkins.thetaphi.de/job/Lucene-Solr-trunk-Linux/9882/ Java: 64bit/jdk1.7.0_60-ea-b10 -XX:-UseCompressedOops -XX:+UseSerialGC 1 tests failed. FAILED:

Re: [JENKINS] Lucene-Solr-trunk-Linux (64bit/jdk1.7.0_60-ea-b10) - Build # 9867 - Failure!

2014-03-21 Thread david.w.smi...@gmail.com
I'm definitely looking at it and I've found the problem. I'm working on a fix right now. On Fri, Mar 21, 2014 at 3:27 PM, Michael McCandless luc...@mikemccandless.com wrote: I someone looking at this test failure? Should we @BadApple it, or revert recent spatial changes, or something?

Re: [JENKINS] Lucene-Solr-4.x-Linux (32bit/jdk1.7.0_51) - Build # 9725 - Still Failing!

2014-03-18 Thread david.w.smi...@gmail.com
I'll look into this one and get it fixed ASAP. On Tue, Mar 18, 2014 at 2:26 AM, Policeman Jenkins Server jenk...@thetaphi.de wrote: Build: http://jenkins.thetaphi.de/job/Lucene-Solr-4.x-Linux/9725/ Java: 32bit/jdk1.7.0_51 -server -XX:+UseSerialGC 2 tests failed. FAILED:

Re: Welcome back, Wolfgang Hoschek!

2013-09-26 Thread david.w.smi...@gmail.com
Nice! Welcome back Wolfgang! On Thu, Sep 26, 2013 at 6:21 AM, Uwe Schindler uschind...@apache.orgwrote: Hi, I'm pleased to announce that after a long abstinence, Wolfgang Hoschek rejoined the Lucene/Solr committer team. He is working now at Cloudera and plans to help with the integration

Fwd: [JENKINS] Lucene-Solr-4.x-Linux (32bit/jdk1.6.0_45) - Build # 6066 - Still Failing!

2013-06-14 Thread david.w.smi...@gmail.com
Dawid, Could you please take a look at the reproducibility of this test failure in lucene/spatial? I tried to reproduce it but couldn't, and I thought perhaps you might have some insight because I'm using some RandomizedTesting features that aren't as often used, like @Repeat. For example, one

BooleanFilter MUST clauses and getDocIdSet(acceptDocs)

2012-11-07 Thread david.w.smi...@gmail.com
I am about to write a Filter that only operates on a set of documents that have already passed other filter(s). It's rather expensive, since it has to use DocValues to examine a value and then determine if its a match. So it scales O(n) where n is the number of documents it must see. The 2nd

Changes as we approach v4

2012-09-21 Thread david.w.smi...@gmail.com
Rob, It appears you are in-effect the Release Manager for v4.0 so I'm asking you this question. Clearly v4 is going to be out soon and consequently we're not pushing new features to the v4 branch. Regarding the new spatial codebase, there isn't a backwards compatibility concern to changes until

Re: [JENKINS] Lucene-Solr-trunk-Linux-Java7-64 - Build # 438 - Failure!

2012-06-29 Thread david.w.smi...@gmail.com
I added the missing ASL header. On Thu, Jun 28, 2012 at 4:54 PM, Policeman Jenkins Server jenk...@sd-datasolutions.de wrote: Build: http://jenkins.sd-datasolutions.de/job/Lucene-Solr-trunk-Linux-Java7-64/438/ All tests passed Build Log: [...truncated 15182 lines...] BUILD FAILED

<    1   2   3