Re: odd error; JCC bug?

2012-11-01 Thread Andi Vajda
Hi Bill, On Thu, 1 Nov 2012, Bill Janssen wrote: I've installed JCC-2.14 in Python 2.7.3 on OS X Leopard with gcc-4.2, and I'm trying to wrap a library with % python -m jcc --jar ../bin/GoodStuff.jar --jar ../bin/MoreGoodStuff.jar --include ../bin/Prereqs.jar --python goodstuff --files 10

Re: odd error; JCC bug?

2012-11-01 Thread Bill Janssen
Andi Vajda va...@apache.org wrote: Hi Bill, Hi, Andi. On Thu, 1 Nov 2012, Bill Janssen wrote: I've installed JCC-2.14 in Python 2.7.3 on OS X Leopard with gcc-4.2, and I'm trying to wrap a library with % python -m jcc --jar ../bin/GoodStuff.jar --jar ../bin/MoreGoodStuff.jar

[jira] [Commented] (SOLR-4020) SolrJ 4.0.0 not serializing multilple filter queries correctly

2012-11-01 Thread Indika Tantrigoda (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488502#comment-13488502 ] Indika Tantrigoda commented on SOLR-4020: - Hoss: Thanks for the response. I

[jira] [Commented] (SOLR-3876) Solr Admin UI is completely dysfunctional on IE 9

2012-11-01 Thread Daniel Collins (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488554#comment-13488554 ] Daniel Collins commented on SOLR-3876: -- For what its worth, IE 8 is just as bad.

Re: [JENKINS] Lucene-Solr-trunk-Linux (32bit/jdk1.8.0-ea-b58) - Build # 2126 - Failure!

2012-11-01 Thread Simon Willnauer
that is a test-bug I committed a fix. On Thu, Nov 1, 2012 at 6:43 AM, Policeman Jenkins Server jenk...@sd-datasolutions.de wrote: Build: http://jenkins.sd-datasolutions.de/job/Lucene-Solr-trunk-Linux/2126/ Java: 32bit/jdk1.8.0-ea-b58 -client -XX:+UseG1GC 1 tests failed. REGRESSION:

[jira] [Resolved] (LUCENE-4520) ValueSource.getSortField shouldn't throw IOException

2012-11-01 Thread Alan Woodward (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alan Woodward resolved LUCENE-4520. --- Resolution: Fixed Assignee: Alan Woodward Fixed: trunk r1404523, 4x r1404537

[jira] [Commented] (SOLR-3993) SolrCloud leader election on single node stucks the initialization

2012-11-01 Thread Markus Jelsma (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488596#comment-13488596 ] Markus Jelsma commented on SOLR-3993: - We're seeing this too using a current trunk on a

[jira] [Created] (LUCENE-4522) CompressingStoredFieldsFormat: remove support for DISK_DOC

2012-11-01 Thread Adrien Grand (JIRA)
Adrien Grand created LUCENE-4522: Summary: CompressingStoredFieldsFormat: remove support for DISK_DOC Key: LUCENE-4522 URL: https://issues.apache.org/jira/browse/LUCENE-4522 Project: Lucene - Core

[jira] [Created] (LUCENE-4523) BYTES_*_DEREFshould expose getValueCount, getDocToOrd, getBytes(ord)

2012-11-01 Thread Adrien Grand (JIRA)
Adrien Grand created LUCENE-4523: Summary: BYTES_*_DEREFshould expose getValueCount, getDocToOrd, getBytes(ord) Key: LUCENE-4523 URL: https://issues.apache.org/jira/browse/LUCENE-4523 Project: Lucene

[jira] [Updated] (LUCENE-4523) BYTES_*_DEREF should expose getValueCount, getDocToOrd, getBytes(ord)

2012-11-01 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adrien Grand updated LUCENE-4523: - Summary: BYTES_*_DEREF should expose getValueCount, getDocToOrd, getBytes(ord) (was:

[jira] [Commented] (LUCENE-4523) BYTES_*_DEREFshould expose getValueCount, getDocToOrd, getBytes(ord)

2012-11-01 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488622#comment-13488622 ] Adrien Grand commented on LUCENE-4523: -- Maybe we could just add a {{DerefSource}}

[jira] [Commented] (LUCENE-4523) BYTES_*_DEREF should expose getValueCount, getDocToOrd, getBytes(ord)

2012-11-01 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488640#comment-13488640 ] Michael McCandless commented on LUCENE-4523: I think DerefSource could work

[jira] [Commented] (LUCENE-4523) BYTES_*_DEREF should expose getValueCount, getDocToOrd, getBytes(ord)

2012-11-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488641#comment-13488641 ] Robert Muir commented on LUCENE-4523: - +1, lets pull these up from SortedSource since

[jira] [Updated] (LUCENE-4515) Make MemoryIndex more memory efficient

2012-11-01 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-4515: Attachment: LUCENE-4515.patch here is a new patch removing all the nocommits (except of

[jira] [Commented] (LUCENE-4515) Make MemoryIndex more memory efficient

2012-11-01 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488643#comment-13488643 ] Michael McCandless commented on LUCENE-4515: bq. the comment is on start but

[jira] [Commented] (SOLR-4026) Create a custom schema with creating collection

2012-11-01 Thread Erick Erickson (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488645#comment-13488645 ] Erick Erickson commented on SOLR-4026: -- I don't quite understand what you mean here.

[jira] [Commented] (LUCENE-4515) Make MemoryIndex more memory efficient

2012-11-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488655#comment-13488655 ] Robert Muir commented on LUCENE-4515: - {quote} we should mark that ctor experimental

[jira] [Commented] (LUCENE-4523) BYTES_*_DEREF should expose getValueCount, getDocToOrd, getBytes(ord)

2012-11-01 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488657#comment-13488657 ] Simon Willnauer commented on LUCENE-4523: - I think that is good. we just need to

[jira] [Commented] (LUCENE-4515) Make MemoryIndex more memory efficient

2012-11-01 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488660#comment-13488660 ] Simon Willnauer commented on LUCENE-4515: - bq. It should definitely be

[jira] [Commented] (SOLR-3876) Solr Admin UI is completely dysfunctional on IE 9

2012-11-01 Thread Erick Erickson (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3876?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488659#comment-13488659 ] Erick Erickson commented on SOLR-3876: -- FWIW, I don't think supporting all browsers

[jira] [Commented] (LUCENE-4515) Make MemoryIndex more memory efficient

2012-11-01 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488673#comment-13488673 ] Michael McCandless commented on LUCENE-4515: What makes me nervous is the

Re: [JENKINS] Lucene-Solr-SmokeRelease-trunk - Build # 28 - Still Failing

2012-11-01 Thread Michael McCandless
I can't reproduce this ... I run ant nightly-smoke on a trunk checkout and it succeeds ... maybe it's the particular version of java 7? (I'm using 1.7.0_05-b06). Which java7 version is running on lucene.zones.apache.org...? Mike McCandless http://blog.mikemccandless.com On Thu, Nov 1, 2012

[jira] [Commented] (LUCENE-4515) Make MemoryIndex more memory efficient

2012-11-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488684#comment-13488684 ] Robert Muir commented on LUCENE-4515: - I have two concerns: 1. the passing of

RE: [JENKINS] Lucene-Solr-SmokeRelease-trunk - Build # 28 - Still Failing

2012-11-01 Thread Uwe Schindler
On ASF Jenkins runs a heavy-patched OpenJDK. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Thursday, November 01, 2012 2:22 PM To:

Re: [JENKINS] Lucene-Solr-SmokeRelease-trunk - Build # 28 - Still Failing

2012-11-01 Thread Michael McCandless
Hmm ... I wonder why suddenly it started failing. Did we upgrade it recently? OK I ran /usr/local/openjdk7/bin/java -fullversion - 1.7.0_06-b24 I'll try upgrading my java7 and see if it causes these failures (which would be annoying!). Mike McCandless http://blog.mikemccandless.com On Thu,

[jira] [Commented] (SOLR-3915) Color Legend for Cloud UI

2012-11-01 Thread Kristopher Kane (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488707#comment-13488707 ] Kristopher Kane commented on SOLR-3915: --- Great Stefan. What about a core that is

Re: [JENKINS] Lucene-Solr-SmokeRelease-trunk - Build # 28 - Still Failing

2012-11-01 Thread Robert Muir
We should upgrade it. I got an email recently that it was updated, but i forgot to forward to Uwe as I usually do. On Thu, Nov 1, 2012 at 9:49 AM, Michael McCandless luc...@mikemccandless.com wrote: Hmm ... I wonder why suddenly it started failing. Did we upgrade it recently? OK I ran

RE: [JENKINS] Lucene-Solr-SmokeRelease-trunk - Build # 28 - Still Failing

2012-11-01 Thread Uwe Schindler
Will do it! I can also run the nightly smoker on SDDS Jenkins, maybe a Linux one is better for this test. - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent:

Re: [JENKINS] Lucene-Solr-SmokeRelease-trunk - Build # 28 - Still Failing

2012-11-01 Thread Robert Muir
This sounds like a good workaround if its still failing after upgrade. Because if it is, its possibly a bug in freebsd's jdk: which is good for us to try to boil down to a test case for them (e.g. the locale bug we found was serious), but time consuming! On Thu, Nov 1, 2012 at 10:15 AM, Uwe

[jira] [Commented] (SOLR-3816) Need a more granular nrt system that is close to a realtime system.

2012-11-01 Thread Nagendra Nagarajayya (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488720#comment-13488720 ] Nagendra Nagarajayya commented on SOLR-3816: Attached a patch for the 4.x

Re: [JENKINS] Lucene-Solr-SmokeRelease-trunk - Build # 28 - Still Failing

2012-11-01 Thread Michael McCandless
OK I ran ant nightly-smoke using 1.7.0_06 and it succeeded ... Mike McCandless http://blog.mikemccandless.com On Thu, Nov 1, 2012 at 10:17 AM, Robert Muir rcm...@gmail.com wrote: This sounds like a good workaround if its still failing after upgrade. Because if it is, its possibly a bug in

RE: [JENKINS] Lucene-Solr-SmokeRelease-trunk - Build # 28 - Still Failing

2012-11-01 Thread Uwe Schindler
I updated both ports on FreeBSD: [root@lucene /usr/ports/java/openjdk7]# /usr/local/openjdk7/bin/java -version openjdk version 1.7.0_09 OpenJDK Runtime Environment (build 1.7.0_09-b05) OpenJDK 64-Bit Server VM (build 23.5-b02, mixed mode) [root@lucene /usr/ports/java/openjdk6]#

[jira] [Updated] (SOLR-4019) Service Unavailable errors not being logged

2012-11-01 Thread JIRA
[ https://issues.apache.org/jira/browse/SOLR-4019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tomás Fernández Löbbe updated SOLR-4019: Attachment: SOLR-4019.patch Updated patch. In the ShowFileRequestHandler I also

[jira] [Commented] (LUCENE-4522) CompressingStoredFieldsFormat: remove support for DISK_DOC

2012-11-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488749#comment-13488749 ] Robert Muir commented on LUCENE-4522: - +1

Re: [JENKINS] Lucene-Solr-SmokeRelease-trunk - Build # 28 - Still Failing

2012-11-01 Thread Michael McCandless
Thanks Uwe. Let's see if nightly-smoke passes tonite... Mike McCandless http://blog.mikemccandless.com On Thu, Nov 1, 2012 at 10:56 AM, Uwe Schindler u...@thetaphi.de wrote: I updated both ports on FreeBSD: [root@lucene /usr/ports/java/openjdk7]# /usr/local/openjdk7/bin/java -version

[jira] [Commented] (SOLR-3816) Need a more granular nrt system that is close to a realtime system.

2012-11-01 Thread Nagendra Nagarajayya (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488752#comment-13488752 ] Nagendra Nagarajayya commented on SOLR-3816: @Otis: The 70,000 update

[jira] [Commented] (SOLR-3816) Need a more granular nrt system that is close to a realtime system.

2012-11-01 Thread Nagendra Nagarajayya (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488755#comment-13488755 ] Nagendra Nagarajayya commented on SOLR-3816: @David: You have to disable the

Data Import from database with expired certificate via SSL

2012-11-01 Thread Alexandru Toader
Dear all, I have just started working with Solr but I have stumbled upon a problem and I would really appreciate assistance. I want to use the DataImportHandler to get data from a remote database that requires Basic HTTP Authentication. I have tried the following configuration: dataConfig

[jira] [Comment Edited] (LUCENE-4523) BYTES_*_DEREF should expose getValueCount, getDocToOrd, getBytes(ord)

2012-11-01 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488796#comment-13488796 ] Adrien Grand edited comment on LUCENE-4523 at 11/1/12 4:16 PM:

[jira] [Commented] (LUCENE-4523) BYTES_*_DEREF should expose getValueCount, getDocToOrd, getBytes(ord)

2012-11-01 Thread Adrien Grand (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488796#comment-13488796 ] Adrien Grand commented on LUCENE-4523: -- Any idea on how we should fix VAR_DEREF to

[jira] [Commented] (LUCENE-4523) BYTES_*_DEREF should expose getValueCount, getDocToOrd, getBytes(ord)

2012-11-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488827#comment-13488827 ] Robert Muir commented on LUCENE-4523: - Does this hurt anything (just pretend address

[jira] [Commented] (LUCENE-4523) BYTES_*_DEREF should expose getValueCount, getDocToOrd, getBytes(ord)

2012-11-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488842#comment-13488842 ] Robert Muir commented on LUCENE-4523: - Or just keep the ord in SortedSource? I'm

[jira] [Commented] (SOLR-3816) Need a more granular nrt system that is close to a realtime system.

2012-11-01 Thread Otis Gospodnetic (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488905#comment-13488905 ] Otis Gospodnetic commented on SOLR-3816: [~nnagarajayya] Right, that is what I

[jira] [Commented] (LUCENE-4518) Suggesters: highlighting (explicit markup of user-typed portions vs. generated portions in a suggestion)

2012-11-01 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4518?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488906#comment-13488906 ] Michael McCandless commented on LUCENE-4518: Hmm... it could be that if we

Re: [JENKINS] Lucene-Solr-NightlyTests-4.x - Build # 81 - Still Failing

2012-11-01 Thread Michael McCandless
I think Adrien fixed this in rev 1404456. Mike McCandless http://blog.mikemccandless.com On Wed, Oct 31, 2012 at 7:30 PM, Apache Jenkins Server jenk...@builds.apache.org wrote: Build: https://builds.apache.org/job/Lucene-Solr-NightlyTests-4.x/81/ 1 tests failed. FAILED:

[jira] [Commented] (SOLR-4020) SolrJ 4.0.0 not serializing multilple filter queries correctly

2012-11-01 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488919#comment-13488919 ] Hoss Man commented on SOLR-4020: {quote} I've used SolrParams solrParams =

[jira] [Resolved] (SOLR-4020) SolrJ 4.0.0 not serializing multilple filter queries correctly

2012-11-01 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man resolved SOLR-4020. Resolution: Duplicate SolrJ 4.0.0 not serializing multilple filter queries correctly

[jira] [Assigned] (SOLR-1666) SolrParams conversion to NamedList and back to SolrParams misses the Arrays with more than one value

2012-11-01 Thread Hoss Man (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-1666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hoss Man reassigned SOLR-1666: -- Assignee: Hoss Man SolrParams conversion to NamedList and back to SolrParams misses the Arrays

[jira] [Commented] (SOLR-3589) Edismax parser does not honor mm parameter if analyzer splits a token

2012-11-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488932#comment-13488932 ] Robert Muir commented on SOLR-3589: --- I pinged hossman on IRC for some feedback, ill

Re: Posting updated ConcatFilter code, using 4.0.0 compatible classes

2012-11-01 Thread Otis Gospodnetic
Hi Mark, Out of curiosity, what was your use case? Thanks, Otis -- Search Analytics - http://sematext.com/search-analytics/index.html Performance Monitoring - http://sematext.com/spm/index.html On Wed, Oct 31, 2012 at 10:56 PM, Mark Bennett mbenn...@ideaeng.com wrote: This filter lets you

[jira] [Updated] (SOLR-3856) DIH: Better tests for SqlEntityProcessor

2012-11-01 Thread James Dyer (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] James Dyer updated SOLR-3856: - Affects Version/s: 3.6 4.0 Fix Version/s: 4.1 DIH: Better tests for

[jira] [Updated] (SOLR-3856) DIH: Better tests for SqlEntityProcessor

2012-11-01 Thread James Dyer (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] James Dyer updated SOLR-3856: - Attachment: SOLR-3856.patch This patch tests both full-import and delta-import with a fair amount of

[jira] [Commented] (LUCENE-4515) Make MemoryIndex more memory efficient

2012-11-01 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13488947#comment-13488947 ] Michael McCandless commented on LUCENE-4515: {quote} 1. reduce the GC load on

RE: Posting updated ConcatFilter code, using 4.0.0 compatible classes

2012-11-01 Thread Uwe Schindler
Hi Otis, One use case I had for a similar filter for a customer was some ngramming approach. The tokenization before was there to create “normalized” tokens, which were then be glued together (with or w/o whitespace) and ngrammed (means several ngram tokens created from the glued-together

Re: Posting updated ConcatFilter code, using 4.0.0 compatible classes

2012-11-01 Thread Simon Willnauer
I used combine filters before too. I think there is a usecase for this stuff we do similar things in suggesters with TokenStreamToAutomaton and finite strings. That is really the same kind of thing though. maybe we can wrap it in a tokenstream and emit the finite path as synonyms ie . on the same

Re: Posting updated ConcatFilter code, using 4.0.0 compatible classes

2012-11-01 Thread Mark Bennett
Hi Otis, Forgive my vagueness, it's an NDA thing. Generally speaking you might want to do record matching based on a number of fields. But since text fields are input by humans, they can be a bit inconsistent about how values are entered. One answer is to remove things like stop words,

Re: Posting updated ConcatFilter code, using 4.0.0 compatible classes

2012-11-01 Thread Mark Bennett
Hi Simon, I'd love to see a ConcatFilter and factory find a permanent home as part of the stable to standard filters. But perhaps for the Automaton function it'd need to be packaged differently? -- Mark Bennett / New Idea Engineering, Inc. / mbenn...@ideaeng.com Direct: 408-733-0387 / Main:

Re: Posting updated ConcatFilter code, using 4.0.0 compatible classes

2012-11-01 Thread Otis Gospodnetic
Hi Mark, Thanks for the explanation - makes sense! Re ES - yes. But I pasted your Q in http://blog.sematext.com/2012/09/04/solr-vs-elasticsearch-part-2-data-handling/comments, too, so you should get a more thorough answer there soon. Otis -- Search Analytics -

[jira] [Updated] (SOLR-3589) Edismax parser does not honor mm parameter if analyzer splits a token

2012-11-01 Thread Robert Muir (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3589?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Muir updated SOLR-3589: -- Attachment: SOLR-3589.patch patch with the added synonyms test. Edismax parser does

[DISCUSS] Merge DocsEnum and DocsAndPositionsEnum into PostingsEnum

2012-11-01 Thread Simon Willnauer
hey folks, I have spend a hell lot of time on the positions branch to make positions and offsets working on all queries if needed. The one thing that bugged me the most is the distinction between DocsEnum and DocsAndPositionsEnum. Really when you look at it closer DocsEnum is a DocsAndFreqsEnum

Re: [DISCUSS] Merge DocsEnum and DocsAndPositionsEnum into PostingsEnum

2012-11-01 Thread Robert Muir
On Thu, Nov 1, 2012 at 4:26 PM, Simon Willnauer simon.willna...@gmail.com wrote: hey folks, I have spend a hell lot of time on the positions branch to make positions and offsets working on all queries if needed. The one thing that bugged me the most is the distinction between DocsEnum and

RE: [DISCUSS] Merge DocsEnum and DocsAndPositionsEnum into PostingsEnum

2012-11-01 Thread Uwe Schindler
+1, I think PostingsEnum ist he much better idea! I was thinking about that several times. In fact DocsEnum is just a specialized DocIdSetIterator, so I never understood the difference in the early Lucene 4 days. Now we have some extra methods, but most of them are optional and a PostingsEnum

Re: [DISCUSS] Merge DocsEnum and DocsAndPositionsEnum into PostingsEnum

2012-11-01 Thread Robert Muir
On Thu, Nov 1, 2012 at 4:55 PM, Uwe Schindler u...@thetaphi.de wrote: +1, I think PostingsEnum ist he much better idea! I was thinking about that several times. In fact DocsEnum is just a specialized DocIdSetIterator, so I never understood the difference in the early Lucene 4 days. Now we

[jira] [Commented] (SOLR-3936) QueryElevationComponent: Wrong order when result grouping is activated

2012-11-01 Thread Michael Berger (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-3936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13489039#comment-13489039 ] Michael Berger commented on SOLR-3936: -- I tried to fix by myself: debug:{

[jira] [Updated] (LUCENE-4515) Make MemoryIndex more memory efficient

2012-11-01 Thread Simon Willnauer (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Simon Willnauer updated LUCENE-4515: Attachment: LUCENE-4515.patch alright guys. Here is a new patch that accepts a long

Re: [DISCUSS] Merge DocsEnum and DocsAndPositionsEnum into PostingsEnum

2012-11-01 Thread Michael McCandless
+1, this makes total sense! Mike McCandless http://blog.mikemccandless.com On Thu, Nov 1, 2012 at 5:04 PM, Robert Muir rcm...@gmail.com wrote: On Thu, Nov 1, 2012 at 4:55 PM, Uwe Schindler u...@thetaphi.de wrote: +1, I think PostingsEnum ist he much better idea! I was thinking about that

[jira] [Created] (LUCENE-4524) Merge DocsEnum and DocsAndPositionsEnum into PostingsEnum

2012-11-01 Thread Simon Willnauer (JIRA)
Simon Willnauer created LUCENE-4524: --- Summary: Merge DocsEnum and DocsAndPositionsEnum into PostingsEnum Key: LUCENE-4524 URL: https://issues.apache.org/jira/browse/LUCENE-4524 Project: Lucene -

[jira] [Commented] (LUCENE-4515) Make MemoryIndex more memory efficient

2012-11-01 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/LUCENE-4515?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13489138#comment-13489138 ] Michael McCandless commented on LUCENE-4515: Maybe rename maxBufferedBytes to

[jira] [Commented] (SOLR-4026) Create a custom schema with creating collection

2012-11-01 Thread milesli (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13489204#comment-13489204 ] milesli commented on SOLR-4026: --- 1 Not modify an existing schema, but rather put a new

[jira] [Comment Edited] (SOLR-4026) Create a custom schema with creating collection

2012-11-01 Thread milesli (JIRA)
[ https://issues.apache.org/jira/browse/SOLR-4026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13489204#comment-13489204 ] milesli edited comment on SOLR-4026 at 11/2/12 2:23 AM: 1 Not

[jira] [Created] (SOLR-4027) Parse exception with geofilt in parens

2012-11-01 Thread Mauricio Scheffer (JIRA)
Mauricio Scheffer created SOLR-4027: --- Summary: Parse exception with geofilt in parens Key: SOLR-4027 URL: https://issues.apache.org/jira/browse/SOLR-4027 Project: Solr Issue Type: Bug