Re: How does text-rev work?

2015-01-09 Thread Alexandre Rafalovitch
So, Query Parser does some sort of magic and looks for the field with the same name and _rev suffix? But what populates that field? In the example schema, it seems to be standalone and empty. Is there a copyField missing? Regards, Alex. Sign up for my Solr resources newsletter at http://w

Re: Garbage Collection tuning - G1 is now a good option

2015-01-09 Thread William Bell
It looks like 32 bit is affected. > On 2013-08-14 08:27, Dawid Weiss wrote: >> >> Hi everyone, >> >> I am a committer to the Lucene/Solr project. We've recently hit what >> we believe is a JIT/GC bug -- it manifests itself only when G1GC is >> used, on a 32-bit VM: On Fri, Jan 9, 2015 at 7:10 PM,

Re: Garbage Collection tuning - G1 is now a good option

2015-01-09 Thread Shawn Heisey
On 1/1/2015 12:10 PM, Shawn Heisey wrote: > I've been working with Oracle employees to find better GC tuning > options. The results are good enough to share with the community: > > https://wiki.apache.org/solr/ShawnHeisey#GC_Tuning > > With the latest Java 7 or Java 8 version, and a couple of tu

Re: Unexplained leader initiated recovery after updates

2015-01-09 Thread Shawn Heisey
On 1/9/2015 4:54 PM, Lindsay Martin wrote: > I am experiencing a problem where Solr nodes go into recovery following an > update cycle. > For background, here are some details about our configuration: > * Solr 4.10.2 (problem also observed with Solr 4.6.1) > * 12 shards with 2 nodes per shard >

Unexplained leader initiated recovery after updates

2015-01-09 Thread Lindsay Martin
Hi all, I am experiencing a problem where Solr nodes go into recovery following an update cycle. Examination of the logs indicates that the recovery is initiated by the shard master while processing regular update events, because the replica is unreachable. For example, the following is recor

Re: How does text-rev work?

2015-01-09 Thread Jack Krupansky
Or a Jira to document it. The basic idea is that if a normal leading wildcard is too slow, the user can index a copy of their text fields using the text_rev type, which indexes terms with their characters reversed and with a special marker. Then the query parser detects a leading wildcard and that

Re: Request two databases at the same time ?

2015-01-09 Thread Bruno Mannina
Dear Erick, thank you for your answer. My answers are below. Le 09/01/2015 20:43, Erick Erickson a écrit : bq: I don't want to modify my BigDB1 to update documents with abstract because BigDB1 is always updated twice by week. Why not? Solr/Lucene handle updating docs, if a doc in the index ha

Re: How does text-rev work?

2015-01-09 Thread Alexandre Rafalovitch
Anybody? Otherwise, I guess it is a JIRA to delete the unused field? Regards, Alex. Sign up for my Solr resources newsletter at http://www.solr-start.com/ On 28 December 2014 at 13:16, Alexandre Rafalovitch wrote: > I am looking at the collection1/techproducts schema and I can't figure

RE: can't make sense of spellchecker results when using techproducts example

2015-01-09 Thread Dyer, James
Chris, - DirectSpellChecker has a setting for "minPrefix" which the techproducts example sets to 1 (also the default). So it will never try to correct the first character. I think this is both a performance optimization and is based on the assumption that we rarely misspell the first characte

Re: Best way to implement Spotlight of certain results

2015-01-09 Thread Michał B . .
Maybe I understand you badly but I thing that you could use grouping to achieve such effect. If you could prepare two group queries one with exact match and other, let's say, default than you will be able to extract matches from grouping results. i.e (using default solr example collection) http://

Re: filter on solr pivot data

2015-01-09 Thread Erick Erickson
Why not just add an fq clause like &fq=-mappings_iphoto_exist:[* TO *]? note the "-" sign. On Fri, Jan 9, 2015 at 11:14 AM, Darniz wrote: > Hello > > i need to know how can i filter on solr pivot data. > > For exampel we have a dealer which might have many cars in his lot and car > has photo

Re: Best way to implement Spotlight of certain results

2015-01-09 Thread Erick Erickson
Hmm, I wonder if the RerankingQueryParser might help here? See: https://cwiki.apache.org/confluence/display/solr/Query+Re-Ranking Best, Erick On Fri, Jan 9, 2015 at 10:35 AM, Dan Davis wrote: > I have a requirement to spotlight certain results if the query text exactly > matches the title or see

Re: Request two databases at the same time ?

2015-01-09 Thread Erick Erickson
bq: I don't want to modify my BigDB1 to update documents with abstract because BigDB1 is always updated twice by week. Why not? Solr/Lucene handle updating docs, if a doc in the index has the same , the old doc is deleted and the new one takes its place. So why not just put the new abstracts into

Re: GC tuning question - can improving GC pauses cause indexing to slow down?

2015-01-09 Thread Walter Underwood
For throughput with G1, get rid of the pause time goal (-XX:MaxGCPauseMillis), so the GC can pause as long as it wants. Beyond that, use a non-concurrent collector and make sure that everything is OK with pauses that last a few seconds. This is a pretty detailed paper about balancing throughput

filter on solr pivot data

2015-01-09 Thread Darniz
Hello i need to know how can i filter on solr pivot data. For exampel we have a dealer which might have many cars in his lot and car has photos, i need to find out a dealer which has cars which has no photos so i have dealer1 -> has 20 cars -> all of them has photos dealer2 -> has 20 cars ->

Best way to implement Spotlight of certain results

2015-01-09 Thread Dan Davis
I have a requirement to spotlight certain results if the query text exactly matches the title or see reference (indexed by me as alttitle_t). What that means is that these matching results are shown above the top-10/20 list with different CSS and fields. Its like feeling lucky on google :) I hav

Request two databases at the same time ?

2015-01-09 Thread Bruno Mannina
Dear All, I use Apache-SOLR3.6, on Ubuntu (newbie user). I have a big database named BigDB1 with 90M documents, each document contains several fields (docid, title, author, date, etc...) I received today from another source, abstract of some documents (there are also the same docid field in thi

Re: Tokenizer or Filter ?

2015-01-09 Thread Jack Krupansky
Consider an update processor - it can take any input, break it up any way you want, and then output multiple field values. You can even us the stateless script update processor to write the logic in JavaScript. -- Jack Krupansky On Fri, Jan 9, 2015 at 6:47 AM, tomas.kalas wrote: > Hello, i hav

Re: Tokenizer or Filter ?

2015-01-09 Thread tomas.kalas
I'm used the same regex and it doesn't work unfortunately. Or should I somehow change the regex? Thanks. -- View this message in context: http://lucene.472066.n3.nabble.com/Tokenizer-or-Filter-tp4178346p4178389.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: How to return child documents with parent

2015-01-09 Thread yliu
Thanks. That solved my problem. Y -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-return-child-documents-with-parent-tp4178081p4178378.html Sent from the Solr - User mailing list archive at Nabble.com.

Re: Tokenizer or Filter ?

2015-01-09 Thread Ahmet Arslan
Can't you use solr.PatternTokenizerFactory for this task? On Friday, January 9, 2015 1:48 PM, tomas.kalas wrote: Hello, i have a question what i have to use tokenizer or filter ? I need separate 2 chanels. I wrote this here earlier, but realize it with solr basic tools it is not probably possib

Tokenizer or Filter ?

2015-01-09 Thread tomas.kalas
Hello, i have a question what i have to use tokenizer or filter ? I need separate 2 chanels. I wrote this here earlier, but realize it with solr basic tools it is not probably possible. And i',m trying to write own tool for this task. I have this input HelloHelloHow are you ?Fine and you're? d

Re: Determining the Number of Solr Shards

2015-01-09 Thread Toke Eskildsen
On Thu, 2015-01-08 at 22:55 +0100, Nishanth S wrote: > Thanks guys for your inputs I would be looking at around 100 Tb of total > index size with 5100 million documents [...] That is a large corpus when coupled with your high indexing & QPS requirements. Are the queries complex too? Will you be