Re: SolrCloud setup - any advice?

2013-09-20 Thread Neil Prosser
Sorry, my bad. For SolrCloud soft commits are enabled (every 15 seconds). I do a hard commit from an external cron task via curl every 15 minutes. The version I'm using for the SolrCloud setup is 4.4.0. Document cache warm-up times are 0ms. Filter cache warm-up times are between 3 and 7 seconds.

Spellchecking

2013-09-20 Thread Gastone Penzo
Hi, i'd like to know if is it possibile to have suggests only of a part of indexes. for example: an ecommerce: there are a lot of typologies of products (book, dvd, cd..) if i search inside books, i want only suggests of books products, not cds but the spellchecking indexs are all together. is

Hash range to shard assignment

2013-09-20 Thread lochri
Hello folks, we would like to have control of where certain hash values or ranges are being located. The reason is that we want to shard per user but we know ahead that one or more specific users could grow way faster than others. Therefore we would like to locate them on separate shards (which

Re: Hash range to shard assignment

2013-09-20 Thread Noble Paul നോബിള്‍ नोब्ळ्
This would need you to plug your own router . It is not yet possible But , you can split that shard repeatedly and keep the no:of users in that shard limited On Fri, Sep 20, 2013 at 3:52 PM, lochri loc...@web.de wrote: Hello folks, we would like to have control of where certain hash values

RE: Spellchecking

2013-09-20 Thread Dyer, James
If you're using spellcheck.collate you can also set spellcheck.maxCollationTries to validate each collation against the index before suggesting it. This validation takes into account any fq parameters on your query, so if your original query has fq=Product:Book, then the collations returned

Re: Will Solr work with a mapped drive?

2013-09-20 Thread Aloke Ghoshal
Hi, Try the UNC path instead: http://wiki.apache.org/tomcat/FAQ/Windows#Q6 Regards, Aloke On 9/20/13, johnmu...@aol.com johnmu...@aol.com wrote: Hi, I'm having this same problem as described here:

Re: check which file/document cause solr to work hard

2013-09-20 Thread Erick Erickson
you can always commit them one at a time to the ExtractingRequestHandler http://wiki.apache.org/solr/ExtractingRequestHandler Best, Erick On Tue, Sep 17, 2013 at 6:47 AM, Yossi Nachum nachum...@gmail.com wrote: Hi, I am trying to index my windows pc files with manifoldcf version 1.3 and

Re: Solr node goes down while trying to index records

2013-09-20 Thread Erick Erickson
What happens if you bump up you zookeeper timeout? This has been an issue at times in the past. Best, Erick On Tue, Sep 17, 2013 at 1:48 PM, Furkan KAMACI furkankam...@gmail.comwrote: Could you give some information about your jetty.xml and give more info about your index rate and RAM usage

Re: Limits of Document Size at SolrCloud and Faced Problems with Large Size of Documents

2013-09-20 Thread Erick Erickson
You're probably exceeding the size that your servlet container allows. This assumes you're using curl or some such. You can change it. How big is the document and how are you sending it to Solr? Best, Erick On Tue, Sep 17, 2013 at 2:24 PM, Furkan KAMACI furkankam...@gmail.comwrote: Currently

Need help understanding the use cases behind core auto-discovery

2013-09-20 Thread Timothy Potter
Trying to add some information about core.properties and auto-discovery in Solr in Action and am at a loss for what to tell the reader is the purpose of this feature. Can anyone point me to any background information about core auto-discovery? I'm not interested in the technical implementation

Problem running EmbeddedSolr (spring data)

2013-09-20 Thread JMill
What is the cause of this Stactrace? Working with the following solr maven dependancies solr-core-version4.4.0/ solr-core-version spring-data-solr-version1.0.0.RC1/spring-data-solr-version Stacktrace SEVERE: Exception sending context initialized event to listener instance of class

Re: Need help understanding the use cases behind core auto-discovery

2013-09-20 Thread Yonik Seeley
On Fri, Sep 20, 2013 at 11:56 AM, Timothy Potter thelabd...@gmail.com wrote: Trying to add some information about core.properties and auto-discovery in Solr in Action and am at a loss for what to tell the reader is the purpose of this feature. IMO, it was more a removal of unnecessary central

Re: Need help understanding the use cases behind core auto-discovery

2013-09-20 Thread Timothy Potter
Exactly the insight I was looking for! Thanks Yonik ;-) On Fri, Sep 20, 2013 at 10:37 AM, Yonik Seeley yo...@lucidworks.com wrote: On Fri, Sep 20, 2013 at 11:56 AM, Timothy Potter thelabd...@gmail.com wrote: Trying to add some information about core.properties and auto-discovery in Solr

Re: Solr node goes down while trying to index records

2013-09-20 Thread Erick Erickson
What happens if you bump up you zookeeper timeout? This has been an issue at times in the past. Best, Erick On Tue, Sep 17, 2013 at 1:48 PM, Furkan KAMACI furkankam...@gmail.com wrote: Could you give some information about your jetty.xml and give more info about your index rate and RAM usage

Re: check which file/document cause solr to work hard

2013-09-20 Thread Erick Erickson
you can always commit them one at a time to the ExtractingRequestHandler http://wiki.apache.org/solr/ExtractingRequestHandler Best, Erick On Tue, Sep 17, 2013 at 6:47 AM, Yossi Nachum nachum...@gmail.com wrote: Hi, I am trying to index my windows pc files with manifoldcf version 1.3 and

Re: Migrating from Endeca

2013-09-20 Thread Shawn Heisey
On 9/19/2013 5:50 AM, Gareth Poulton wrote: A customer wants us to move their entire enterprise platform - of which one of the many components is Oracle Endeca - to open source. However, customers being the way they are, they don't want to have to give up any of the features they currently

Cause of NullPointer Exception? (Solr with Spring Data)

2013-09-20 Thread JMill
I am unsure about the cause of the following NullPointer Exception. Any Ideas? Thanks Exception in thread main org.springframework.beans.factory.BeanCreationException: Error creating bean with name 'aDocumentService': Injection of autowired dependencies failed; nested exception is

Re: SolrCloud setup - any advice?

2013-09-20 Thread Shawn Heisey
On 9/19/2013 9:20 AM, Neil Prosser wrote: Apologies for the giant email. Hopefully it makes sense. Because of its size, I'm going to reply inline like this and I'm going to trim out portions of your original message. I hope that's not horribly confusing to you! Looking through my archive of

Re: JVM Crash using solr 4.4 on Centos

2013-09-20 Thread Oak McIlwain
Thanks Michael, I thought I had the latest but it turned out to be from July 2011. Working Fine with the latest build :-) On Thu, Sep 19, 2013 at 7:29 PM, Michael Ryan mr...@moreover.com wrote: This is a known bug in that JDK version. Upgrade to a newer version of JDK 7 (any build within the

java.lang.LinkageError when using custom filters in multiple cores

2013-09-20 Thread Hayden Muhl
I have two cores favorite and user running in the same Tomcat instance. In each of these cores I have identical field types text_en, text_de, text_fr, and text_ja. These fields use some custom token filters I've written. Everything was going smoothly when I only had the favorite core. When I added

Re: Limits of Document Size at SolrCloud and Faced Problems with Large Size of Documents

2013-09-20 Thread Erick Erickson
A, good to know Shawn... Erick On Fri, Sep 20, 2013 at 1:04 PM, Shawn Heisey s...@elyograg.org wrote: On 9/20/2013 12:34 PM, Erick Erickson wrote: You're probably exceeding the size that your servlet container allows. This assumes you're using curl or some such. You can change it.

Re: Limits of Document Size at SolrCloud and Faced Problems with Large Size of Documents

2013-09-20 Thread Shawn Heisey
On 9/20/2013 12:34 PM, Erick Erickson wrote: You're probably exceeding the size that your servlet container allows. This assumes you're using curl or some such. You can change it. How big is the document and how are you sending it to Solr? The maximum form size is configurable in Solr, not

Getting term offsets from Solr

2013-09-20 Thread Nalini Kartha
Hi, We're looking at implementing highlighting for some fields which may be too large to store in the index. As an alternative to using the Solr Highlighter (which needs fields to be stored), I was wondering if a) the offsets of terms are stored BY DEFAULT in the index (even if we're not using

Re: Getting term offsets from Solr

2013-09-20 Thread Jack Krupansky
Set: termVectors=true termPositions=true termOffsets=true And use the fast vector highlighter. -- Jack Krupansky -Original Message- From: Nalini Kartha Sent: Friday, September 20, 2013 7:34 PM To: solr-user@lucene.apache.org Subject: Getting term offsets from Solr Hi, We're

Re: Limits of Document Size at SolrCloud and Faced Problems with Large Size of Documents

2013-09-20 Thread Erick Erickson
You're probably exceeding the size that your servlet container allows. This assumes you're using curl or some such. You can change it. How big is the document and how are you sending it to Solr? Best, Erick On Tue, Sep 17, 2013 at 4:28 PM, Otis Gospodnetic otis.gospodne...@gmail.com wrote: Hi

Re: Getting term offsets from Solr

2013-09-20 Thread Nalini Kartha
Thanks for the reply. We tried enabling these options but that's also causing too much index bloat so I was wondering if there's a way to get at the offset information more cheaply? Thanks, Nalini On Fri, Sep 20, 2013 at 4:41 PM, Jack Krupansky j...@basetechnology.comwrote: Set:

Re: Getting term offsets from Solr

2013-09-20 Thread Nalini Kartha
I'm wondering if storing just the offset as a payload would be cheaper from storage perspective than enabling termOffsets, termVectors and termPositions? Maybe we could get the offset info to return with results from there then? Thanks, Nalini On Fri, Sep 20, 2013 at 5:02 PM, Nalini Kartha