Nutch + Solr - Indexer causes java.lang.OutOfMemoryError: Java heap space

2014-09-07 Thread glumet
Hello everyone, I have configured my 2 servers to run in distributed mode (with Hadoop) and my configuration for crawling process is Nutch 2.2.1 - HBase (as a storage) and Solr. Solr is run by Tomcat. The problem is everytime I try to do the last step - I mean when I want to index data from

Re: SolrCloud : node recovery fails with No registered leader was found

2014-09-07 Thread heaven
Seeing the same thing after a crash of one ZK node (from 5): {code} org.apache.solr.common.SolrException: No registered leader was found after waiting for 4000ms , collection: crm-prod slice: shard1 at org.apache.solr.common.cloud.ZkStateReader.getLeaderRetry(ZkStateReader.java:545)

Re: Query ReRanking question

2014-09-07 Thread Erick Erickson
Joel: I find that whenever I say something totally wrong publicly, I remember the correction really really well... Thanks for straightening that out! Erick On Sat, Sep 6, 2014 at 12:58 PM, Joel Bernstein joels...@gmail.com wrote: This folllowing query:

New cloud - replica in recovering state?

2014-09-07 Thread Jakov Sosic
Hi guys, I'm trying to set up new solr cloud, with two core's, each with two shards and two replicas. This is my solr.xml: ?xml version=1.0 encoding=UTF-8 ? solr persistent=true zkHost=10.200.1.104:2181,10.200.1.105:2181,10.200.1.106:2181 cores adminPath=/admin/cores

ANNOUNCE: Solr Reference Guide for Solr 4.10

2014-09-07 Thread Chris Hostetter
The Lucene PMC is pleased to announce that there is a new version of the Solr Reference Guide for Solr 4.10. The 511 page PDF serves as the definitive user's manual for Solr 4.10. It can be downloaded from the Apache mirror network:

Re: Query ReRanking question

2014-09-07 Thread Joel Bernstein
Ok, just reviewed the code. The ReRankingQParserPlugin always tracks the scores from the main query. So this explains things. Speaking of explaining things, the ReRankingParserPlugin also works with Lucene's explain. So if you use debugQuery=true we should see that the score from the initial query

Re: Query ReRanking question

2014-09-07 Thread Joel Bernstein
Oops wrong usage pattern. It should be: 1) Main query is sorted by a field (scores tracked silently in the background). 2) Reranker is reRanking docs based on the score from the main query. Joel Bernstein Search Engineer at Heliosearch On Sun, Sep 7, 2014 at 4:43 PM, Joel Bernstein

Re: statuscode list

2014-09-07 Thread Koji Sekiguchi
Hi Jan, (2014/09/05 21:01), Jan Verweij - Reeleez wrote: Hi, If I'm correct you will get a statuscode=0 in the response if you use XML messages for updating the solr index. I think you mean by statuscode=0 is status=0 here. ?xml version=1.0 encoding=UTF-8? response lst

Re: New cloud - replica in recovering state?

2014-09-07 Thread Erick Erickson
I really recommend you use the new-style core discovery, if for no other reason than this style is deprecated in 5.0. See: https://wiki.apache.org/solr/Solr.xml%204.4%20and%20beyond FWIW, Erick On Sun, Sep 7, 2014 at 8:51 AM, Jakov Sosic jso...@gmail.com wrote: Hi guys, I'm trying to set up

[ANN] Heliosearch 0.07 released

2014-09-07 Thread Yonik Seeley
http://heliosearch.org/download Heliosearch v0.07 Features o Heliosearch v0.07 is based on (and contains all features of) Lucene/Solr 4.10.0 o An optimized Terms Query with native code performance enhancements for efficiently matching multiple terms in a field.

Re: Performance of Boolean query with hundreds of OR clauses.

2014-09-07 Thread Yonik Seeley
Solr 4.10 has added a {!terms} query that should speed up these cases. Benchmarks here: http://heliosearch.org/solr-terms-query/ -Yonik http://heliosearch.org - native code faceting, facet functions, sub-facets, off-heap data On Tue, Aug 19, 2014 at 2:57 PM, SolrUser1543 osta...@gmail.com

Re: How to implement multilingual word components fields schema?

2014-09-07 Thread Ilia Sretenskii
Thank you for the replies, guys! Using field-per-language approach for multilingual content is the last thing I would try since my actual task is to implement a search functionality which would implement relatively the same possibilities for every known world language. The closest references are