max across documents?

2014-09-22 Thread William Bell
Is there an easy way to get max() across documents? i.e. 1. Age = multivalued 56 67 89 2. Next document 5 43 23 How do I get this ? http://localhost:8983/solr/people/select?q=*:*fl=sum(max_across_document(age), 6.0, max(age)) I need the max value cross all docs and then add 6 and the

AW: AW: AW: slorj - httpclient 4, but we already have httpclient 3 in use

2014-09-22 Thread Clemens Wyss DEV
and hopefully there isn't a stick I'll let you know ... so hopefully no follower to this post ;) -Ursprüngliche Nachricht- Von: Guido Medina [mailto:guido.med...@temetra.com] Gesendet: Freitag, 19. September 2014 10:29 An: solr-user@lucene.apache.org Betreff: Re: AW: AW: slorj -

Re: Ping handler during initial wamup

2014-09-22 Thread William Bell
I put in a fix a long time ago, so it wouldn't be too hard to redo. 1. Send 500 error until core 0 numFound. 2. Give it a list of cores. localhost:8983/solr/ping?check=core1,core2,core3 Then the Amazon load balancer can just check this url with HTTP check On Thu, Sep 18, 2014 at 3:57 AM, Ere

Re: MergeReduceIndexerTool takes a lot of time for a limited number of documents

2014-09-22 Thread rulinma
-D 'mapred.child.java.opts=-Xmx500m' set this as your needs. I think it will work well. -- View this message in context: http://lucene.472066.n3.nabble.com/MergeReduceIndexerTool-takes-a-lot-of-time-for-a-limited-number-of-documents-tp4138163p4160362.html Sent from the Solr - User mailing

Solr upgrade to latest version

2014-09-22 Thread Danesh Kuruppu
Hi all, I currently working on upgrade sorl 1.4.1 to sorl latest stable release. What is the latest stable release I can use? Is there specfic things I need to look at when upgrade. Need help Thanks Danesh

[ANNOUNCE] Apache Solr 4.9.1 released

2014-09-22 Thread Michael McCandless
September 2014, Apache Solr™ 4.9.1 available The Lucene PMC is pleased to announce the release of Apache Solr 4.9.1 Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Lucene project. Its major features include powerful full-text search, hit highlighting, faceted

Search multiple cores, one result

2014-09-22 Thread Clemens Wyss DEV
As mentioned in antoher post we (already) have a (Lucene-based) generic indexing framework which allows any source/entity to provide indexable/searchable data. Sources may be: pages events products customers ... As their names imply they have nothing in common ;) Never the less we'd like to

Re: Solr upgrade to latest version

2014-09-22 Thread Alexandre Rafalovitch
4.10.1 out shortly is a good bet. No idea about the upgrade specifically, but I would probably do some reading of recent solrconfig.xml to get a hint of new features. Also, schema.xml has a version number at the top. The default changed which is controlled by that version number. So, it is

Re: copyfield with wildcard-source?

2014-09-22 Thread Alexandre Rafalovitch
On 22 September 2014 01:04, Clemens Wyss DEV clemens...@mysign.ch wrote: All I have at hand is Solr in Action which doesn't (didn't) mention the copyField-wildcards... Well, unless your implementation is also fully theoretical, you also have all the various examples in the Solr distribution.

Re: [ANNOUNCE] Apache Solr 4.9.1 released

2014-09-22 Thread Bernd Fehling
This confuses me a bit, aren't we already at 4.10.0? But CHANGES.txt of 4.10.0 doesn't know anything about 4.9.1. Is this an interim version or something about backward compatibility? Regards Am 22.09.2014 um 11:36 schrieb Michael McCandless: September 2014, Apache Solr™ 4.9.1 available

Re: [ANNOUNCE] Apache Solr 4.9.1 released

2014-09-22 Thread Shalin Shekhar Mangar
This is a bug fix release on top of 4.9. Only some important fixes from 4.10 and beyond were back-ported to the 4.9 branch. There may be a 4.10.1 release too very soon. On Mon, Sep 22, 2014 at 5:54 PM, Bernd Fehling bernd.fehl...@uni-bielefeld.de wrote: This confuses me a bit, aren't we

fuzzy terms, DirectSolrSpellChecker and alternativeTermCount

2014-09-22 Thread Nathaniel Rudavsky-Brody
Hello, I'm trying find the best way to fake the terms component for fuzzy queries. That is, I need the full set of index terms for each of the two queries quidam~1 and quidam~2. I tried defining two suggesters with FuzzyLookupFactory, with maxEdits=1 and 2 respectively, but the results for

Re: [ANNOUNCE] Apache Solr 4.9.1 released

2014-09-22 Thread Shawn Heisey
On 9/22/2014 6:24 AM, Bernd Fehling wrote: This confuses me a bit, aren't we already at 4.10.0? But CHANGES.txt of 4.10.0 doesn't know anything about 4.9.1. Is this an interim version or something about backward compatibility? It's a bugfix release, fixing some showstopper bugs in a recent

Re: [ANNOUNCE] Apache Solr 4.9.1 released

2014-09-22 Thread Michael McCandless
I'll merge back the 4.9.1 CHANGES entries so when we do a 4.10.1, they'll be there ... and I'll also make sure any fix we backported for 4.9.1, we also backport for 4.10.1. Mike McCandless http://blog.mikemccandless.com On Mon, Sep 22, 2014 at 9:11 AM, Shawn Heisey s...@elyograg.org wrote: On

Re: Issue Adding Filter Query

2014-09-22 Thread aaguilar
Hello Erick. Below is the information you requested. Thanks for your help! fieldType name=text_ws_finer class=solr.TextField positionIncrementGap= 100 analyzer type=index tokenizer class= solr.WhitespaceTokenizerFactory/ filter class= solr.WordDelimiterFilterFactory splitOnNumerics=0

[ANN] Lucidworks Fusion 1.0.0

2014-09-22 Thread Grant Ingersoll
Hi All, We at Lucidworks are pleased to announce the release of Lucidworks Fusion 1.0. Fusion is built to overlay on top of Solr (in fact, you can manage multiple Solr clusters -- think QA, staging and production -- all from our Admin).In other words, if you already have Solr, simply

RE: fuzzy terms, DirectSolrSpellChecker and alternativeTermCount

2014-09-22 Thread Dyer, James
Nathaniel, Can you show us all of the parameters you are sending to the spellchecker? When you specify alternativeTermCount with spellcheck.q=quidam, what are the terms you expect to get back? Also, are you getting any query results back? If you are using a q that returns results, or more

RE: fuzzy terms, DirectSolrSpellChecker and alternativeTermCount

2014-09-22 Thread Nathaniel Rudavsky-Brody
Hi James, The request /spellcheck?spellcheck=truespellcheck.q=quiamspellcheck.dictionary=fuzzy2 returns quidam, quam, quia, quoniam, quidem, quadam, quodam, quoad, quedam, quis, quae, quas, quem, quid, quin, qui, qua Replacing quiam (not in the index) by quidam (in the index) returns

RE: fuzzy terms, DirectSolrSpellChecker and alternativeTermCount

2014-09-22 Thread Dyer, James
Did you try spellcheck.alternativeTermCount with DirectSolrSpellChecker? You can set it to whatever low value you actually want it to return back to you (perhaps 20 suggestions max?). James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: Nathaniel Rudavsky-Brody

RE: fuzzy terms, DirectSolrSpellChecker and alternativeTermCount

2014-09-22 Thread Nathaniel Rudavsky-Brody
Yep, I tried it both as a default param in the request handler (as in the config I sent), and in the request, but with no effect... That's what surprised me, since it seems it should work. On Mon, Sep 22, 2014 at 4:38 , Dyer, James james.d...@ingramcontent.com wrote: Did you try

Re: Problems for indexing large documents on SolrCloud

2014-09-22 Thread Olivier
Hi, First thanks for your advices. I did some several tests and finally I could index all the data on my SolrCloud cluster. The error was client side, it's documented in this post : http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201406.mbox/%3ccfc09ae1.94f8%25rebecca.t...@ucsf.edu%3E

Re: max across documents?

2014-09-22 Thread Shawn Heisey
On 9/22/2014 12:05 AM, William Bell wrote: Is there an easy way to get max() across documents? I think the stats component is probably what you want. That component seems to be enabled by default. http://wiki.apache.org/solr/StatsComponent Thanks, Shawn

RE: Help on custom sort

2014-09-22 Thread Scott Smith
I'll take a look at that. Thanks -Original Message- From: Apoorva Gaurav [mailto:apoorva.gau...@myntra.com] Sent: Sunday, September 21, 2014 11:32 PM To: solr-user Subject: Re: Help on custom sort Try using a custom value source parser and pass the formula of computing the price to

RE: fuzzy terms, DirectSolrSpellChecker and alternativeTermCount

2014-09-22 Thread Dyer, James
You cannot use 100% because, as you say, 1 is intepreted as 1 document. But you can do something like 99.9% . James Dyer Ingram Content Group (615) 213-4311 -Original Message- From: Nathaniel Rudavsky-Brody [mailto:nathaniel.rudav...@gmail.com] Sent: Monday, September 22, 2014

Solr Boosting Unique Values

2014-09-22 Thread O. Olson
I use Solr to index some products that have an ImageUrl field. Obviously some of the images are duplicates. I would like to boost the rankings of products that have unique images (i.e. more specifically, unique ImageUrl field values, because I don't deal with the image binary). By this I mean,

Re: Schema Parsing Failed: unknown field 'id' [Zookeeper, SolrCloud]

2014-09-22 Thread Chris Hostetter
: Thanks. There is definitely a field name=id .../ in each of the schemas. : : I am using 4.7.2. if this conig is working for you when you don't use zookeeper/hdfs then you must be using a newer version of Solr when you test w/ zk/hdfs 4.8.0 is when the fields and type section tags were

Re: Search multiple cores, one result

2014-09-22 Thread Erick Erickson
Depending on the size, I'd go for (a). IOW, I wouldn't change the sharding to use (a), but if you have the same shard setup in that case, it's easier. You'd index a type field with each doc indicating the source of your document. Then use the grouping feature to return the top N from each of the

Re: Solr Boosting Unique Values

2014-09-22 Thread Erick Erickson
This should be happening automatically by the tf/idf calculations, which weighs terms that are rare in the index more heavily than ones that are more common. That said, at very low numbers this may be invisibly, I'm not sure the relevance calculations for 3 as opposed to 1 are very consequential.

Re: Static Fields Performance vs Dynamic Fields Performance

2014-09-22 Thread Erick Erickson
The example schema and solrconfig are intended to show you a large number of possibilities, they are not necessarily intended to be best practices. I would argue that if you do _not_ want to have dynamic fields defined, you should take them all out. And you should take all of the other field

Re: Schema Parsing Failed: unknown field 'id' [Zookeeper, SolrCloud]

2014-09-22 Thread Erick Erickson
One other possibility in addition to Hoss' comments. Did you load a version of your configs to ZooKeeper sometime that didn't have these fields? I don't quite know where the schema and solrconfig files came from, but the fact that they're on a local disk says nothing about what's in ZooKeeper.

Re: Solr cloud setup question

2014-09-22 Thread Erick Erickson
That page is talking about leaders/followers coming up and going down, but pretty much after they've been assigned in the first place. Your problem is just the assigned in the first place bit. Since Solr 4.8, there's the addreplica collections API command that is what you want I think, see:

Re: Formatting dates

2014-09-22 Thread Erick Erickson
I think this'll help: http://wiki.apache.org/solr/ScriptUpdateProcessor Essentially, each time a document comes in to Solr, this will get invoked on it. You'll have to do some fiddling to get it right, you have to remove the field from the doc and transform it then put it back. None of this is

Re: Issue Adding Filter Query

2014-09-22 Thread aaguilar
Hello Erick, Thank you so much for your help. That makes perfect sense. I will do the changes you suggest and let you know how it goes. Thanks! On Mon, Sep 22, 2014 at 4:12 PM, Erick Erickson [via Lucene] ml-node+s472066n4160547...@n3.nabble.com wrote: You have your index and query time

Re: Formatting dates

2014-09-22 Thread Manohar Kanuri
Thanks Erick, I expected to hear the dreaded word programming at some point and I guess that point has arrived. Now that I know where and what to tinker with. And I should have said 4.10 below, not 5.0. On Sep 22, 2014, at 4:44 PM, Erick Erickson erickerick...@gmail.com wrote: I think

Re: Schema Parsing Failed: unknown field 'id' [Zookeeper, SolrCloud]

2014-09-22 Thread paulparsons
Thanks for the suggestions. I actually had both problems. I couldn't figure out how to remove the configs from zookeeper through the cloud scripts, so I just manually removed the files in the zookeeper data directory. -- View this message in context:

Re: Formatting dates

2014-09-22 Thread Alexandre Rafalovitch
You could try - for your ideal scenario - creating an UpdateRequestProcessor (URP) chain, that includes:ParseDateFieldUpdateProcessorFactory https://lucene.apache.org/solr/4_10_0/solr-core/org/apache/solr/update/processor/ParseDateFieldUpdateProcessorFactory.html Notice that it has been designed

Re: Schema Parsing Failed: unknown field 'id' [Zookeeper, SolrCloud]

2014-09-22 Thread Chris Hostetter
: out how to remove the configs from zookeeper through the cloud scripts, so I : just manually removed the files in the zookeeper data directory. https://cwiki.apache.org/confluence/display/solr/Using+ZooKeeper+to+Manage+Configuration+Files

Performance with fast vector highlighter in solr 4.x

2014-09-22 Thread lei
Hi there, I'm using Solr 4.7 and find the fast vector highlighter is not as fast as it used to be in solr 3.x. It seems the results are not cached, even after several hits of the same query, it still takes dozens of milliseconds to return. Any idea or solution is appreciated. Thanks.

Re: Formatting dates

2014-09-22 Thread Erick Erickson
Alexandre: Honest, I looked for that but was in a rush and couldn't find it and thought I was remembering something _else_. That's definitely a better approach, thanks! Perhaps this time I'll remember Erick On Mon, Sep 22, 2014 at 3:23 PM, Alexandre Rafalovitch arafa...@gmail.com wrote:

Re: Solr upgrade to latest version

2014-09-22 Thread Danesh Kuruppu
Thanks Alex and Erick for quick response, This is really helpful. On Tue, Sep 23, 2014 at 1:19 AM, Erick Erickson erickerick...@gmail.com wrote: Probably go for 4.9.1. There'll be a 4.10.1 out in the not-too-distant future that you can upgrade to if you wish. 4.9.1 - 4.10.1 should be quite