Re: Slow Highlighter Performance Even Using FastVectorHighlighter

2013-06-15 Thread Michael Sokolov
If you have very large documents (many MB) that can lead to slow highlighting, even with FVH. See https://issues.apache.org/jira/browse/LUCENE-3234 and try setting phraseLimit=1 (or some bigger number, but not infinite, which is the default) -Mike On 6/14/13 4:52 PM, Andy Brown wrote:

Re: yet another optimize question

2013-06-15 Thread Otis Gospodnetic
Hi Robi, I'm going to guess you are seeing smaller heap also simply because you restarted the JVM recently (hm, you don't say you restarted, maybe I'm making this up). If you are indeed indexing continuously then you shouldn't optimize. Lucene will merge segments itself. Lower mergeFactor will

Re: Adding pdf/word file using JSON/XML

2013-06-15 Thread Grant Ingersoll
On Jun 13, 2013, at 11:24 AM, Walter Underwood wun...@wunderwood.org wrote: That was my thought exactly. Contribute a REST request handler. --wunder +1. The bits are already in place for a lot of it now that RESTlet is in. That being said, it truly amazes me that people were ever able to

Re: Slow Highlighter Performance Even Using FastVectorHighlighter

2013-06-15 Thread Michael McCandless
You could also try the new[ish] PostingsHighlighter: http://blog.mikemccandless.com/2012/12/a-new-lucene-highlighter-is-born.html Mike McCandless http://blog.mikemccandless.com On Sat, Jun 15, 2013 at 8:50 AM, Michael Sokolov msoko...@safaribooksonline.com wrote: If you have very large

Re: Adding pdf/word file using JSON/XML

2013-06-15 Thread Alexandre Rafalovitch
On Sat, Jun 15, 2013 at 10:35 AM, Grant Ingersoll gsing...@apache.org wrote: That being said, it truly amazes me that people were ever able to implement Solr, given some of the FUD in this thread. I guess those tens of thousands of deployments out there were all done by above average devs...

Re: Adding pdf/word file using JSON/XML

2013-06-15 Thread Grant Ingersoll
On Jun 15, 2013, at 12:54 PM, Alexandre Rafalovitch arafa...@gmail.com wrote: On Sat, Jun 15, 2013 at 10:35 AM, Grant Ingersoll gsing...@apache.org wrote: That being said, it truly amazes me that people were ever able to implement Solr, given some of the FUD in this thread. I guess those

Managing SolrCloud

2013-06-15 Thread Furkan KAMACI
I want to design a controlling mechanism for my SolrCloud. I have two choices. First one is controlling every Solr node from a single point and when I want to start and stop jetty from remote I will connect to my nodes via an ssh library at Java. I will send backup command and recovery process

Re: Adding pdf/word file using JSON/XML

2013-06-15 Thread Jack Krupansky
[My apologies to Roland for hijacking his original thread for this rant! Look what you started!!] And I will stand by my statement: Solr is too much of a beast for average app developers to master. And the key word there, in case a too-casual reader missed it is master - not use in the

Re: Suggest and Filtering

2013-06-15 Thread Brendan Grainger
Hi Otis and Jorge, I probably wasn't phrasing my question too well, but I think I was looking for FuzzySuggest. Messing around with the configs found here seems to be doing what I want:

Solr large boolean filter

2013-06-15 Thread Igor Kustov
I know i'm not the first one with this problem. I'm currently using solr 4.2.1 with approximately 10 mln documents in the index. The index is updated frequently. The filter_query is just a one big boolean or query by id. fq=id:(1 2 3 4 ... 50950) ids list is always different and not