Re: Apache solr sink issue

2014-08-19 Thread Gopal Patwa
Do you have this tag uniqueKeyid/uniqueKey define in your schema , it is not mandatory to have unique field but if you need it then u have to provide it else you can remove it, see below wiki page for more details http://wiki.apache.org/solr/SchemaXml#The_Unique_Key_Field Some options to

Any recommendation for Solr Cloud version.

2014-08-19 Thread Lee Chunki
Hi, I am trying to build a new Solr Cloud which will replace sold cluster ( 2 indexers + 2 searchers ). the version what I am using is 4.1. the sooner the better? i.e. version 4.9.0. Please give any suggestion for me. Thanks, Chunki.

Exact match?

2014-08-19 Thread William Bell
If I have a long string, how do I match on 90% of the terms to see if there is a duplicate? If I add the field and index it, what is the best way to return 90%? # terms match # of terms in the field? -- Bill Bell billnb...@gmail.com cell 720-256-8076

Re: Syntax unavailable for parameter substitution Solr 3.5

2014-08-19 Thread deepaksshettigar
Thanks Chris, Yes I am comfortable writing Java code will try to give it a shot. Thanks Deepak -- View this message in context: http://lucene.472066.n3.nabble.com/Syntax-unavailable-for-parameter-substitution-Solr-3-5-tp4153197p4153722.html Sent from the Solr - User mailing list archive at

Solr, weblogic managed server and log4j logging

2014-08-19 Thread Croci Francesco Luigi (ID SWS)
Maybe some of you uses Solr with Weblogic and can help me... I have weblogic 12.1.3 and would like to deploy/run solr on a managed server. I started the node manager, created a server named server-solr and deployed solr(4.7.9). In the server start tab of the server configuration I added

Re: solr cloud going down repeatedly

2014-08-19 Thread Jakov Sosic
On 08/18/2014 08:38 PM, Shawn Heisey wrote: With an 8GB heap and UseConcMarkSweepGC as your only GC tuning, I can pretty much guarantee that you'll see occasional GC pauses of 10-15 seconds, because I saw exactly that happening with my own setup. This is what I use now:

sample Cell schema question

2014-08-19 Thread jmlucjav
In the sample schema.xml I can see this: !-- Main body of document extracted by SolrCell. NOTE: This field is not indexed by default, since it is also copied to text using copyField below. This is to save space. Use this field for returning and highlighting

Re: sample Cell schema question

2014-08-19 Thread jmlucjav
ok, I had not noticed text contains also the other metadata like keywords, description etc, nevermind! On Tue, Aug 19, 2014 at 11:28 AM, jmlucjav jmluc...@gmail.com wrote: In the sample schema.xml I can see this: !-- Main body of document extracted by SolrCell. NOTE: This

Re: BlendedInfixSuggester index write.lock failures on core reload

2014-08-19 Thread Varun Thacker
Hi, Yes this indeed is a bug. I am currently trying to get a patch for it. This is the Jira issue - https://issues.apache.org/jira/browse/SOLR-6246 On Thu, Aug 14, 2014 at 7:52 PM, Zisis Tachtsidis zist...@runbox.com wrote: Hi all, I'm using Solr 4.9.0 and have setup a spellcheck component

Re: sample Cell schema question

2014-08-19 Thread Aman Tandon
I have a question, does storing the data in copyfields save space? With Regards Aman Tandon On Tue, Aug 19, 2014 at 3:02 PM, jmlucjav jmluc...@gmail.com wrote: ok, I had not noticed text contains also the other metadata like keywords, description etc, nevermind! On Tue, Aug 19, 2014 at

Re: sample Cell schema question

2014-08-19 Thread jmlucjav
no it does not. Here the intent, I think, is not to duplicate stored info, as other metadata fields like author, keywords etc already are stored, if 'text' was stored (text is where all fields: content, author etc are copyed), then it would contain some duplicate info. On Tue, Aug 19, 2014 at

Re: sample Cell schema question

2014-08-19 Thread Aurélien MAZOYER
indexed means you can search it, stored means you can return the value to the user or highlight it. Both consum disk space. A copyfield is not a kind of special field : it is a directive that copies one field values to another field. They are many use cases for using copy fields. In the

Re: Exact match?

2014-08-19 Thread Erik Hatcher
Maybe use dismax for this? Something like q={!dismax qf=field_name mm=90%}query_string, or more verbosely and separately, q=query_stringdefType=dismaxmm=90% Erik On Aug 19, 2014, at 2:43 AM, William Bell billnb...@gmail.com wrote: If I have a long string, how do I match on 90% of

Re: faceted query with stats not working in solrj

2014-08-19 Thread tedsolr
That's a good suggestion, I hadn't checked that log file. What I found that works, is hitting these methods on the SolrQuery object: query.setGetFieldStatistics(true); query.setParam(stats.field, MyStatsFieldName); query.setParam(stats.facet, MyFacetFieldName); Now I see the stats in the

Re: Any recommendation for Solr Cloud version.

2014-08-19 Thread Mark Miller
On August 19, 2014 at 2:39:32 AM, Lee Chunki (lck7...@coupang.com) wrote: the sooner the better? i.e. version 4.9.0. Yes, certainly.  --  Mark Miller about.me/markrmiller

Re: Apache Solr Wiki

2014-08-19 Thread Julie . Voss
Can I also have access to the wiki? We are at the outset of a Solr/Hybris implementation. From: Mark Sun mark...@motionelements.com To: solr-user@lucene.apache.org Date: 08/18/2014 08:06 PM Subject:Apache Solr Wiki Dear Solr Wiki admin, We are using Solr for our

Re: solr cloud going down repeatedly

2014-08-19 Thread Shawn Heisey
On 8/19/2014 3:12 AM, Jakov Sosic wrote: Thank you for your comment. How did you test these settings? I mean, that's a lot of tuning and I would like to set up some test environment to be certain this is what I want... I included a section on tools when I wrote this page:

Near Realtime get

2014-08-19 Thread Philippe Soares
Hi, I tried the realtime get today in a solrcloud setup, and it's returning only a subset of my stored fields. Did I miss any parameter that would return all the fields ? Thanks for your help ! Philippe This email message and any attachments are confidential and may be privileged. If you are

Indexing and Querying MS SQL Server 2012 Spatial

2014-08-19 Thread Bostic, Alex
Hello I'm new to Solr: I have a SQL Server 2012 database with spatial columns (points/lines/polys) Do you have any resources to point to for the following Creating a Solr index of a sql server spatial table Bounding Box query (intersect) example, possibly with front-end from GMaps or OpenLayers

Re: Apache solr sink issue

2014-08-19 Thread Erick Erickson
While Gopal is correct, having a uniqueKey is not mandatory, if you're using SolrCloud, it _is_ necessary. And I don't know the internals of the Flume Solr Sink, but if it uses CloudSolrServer under the covers I'd be surprised if it worked without a uniqueKey defined. And I'd guess it does use

Re: Apache Solr Wiki

2014-08-19 Thread Erick Erickson
Julie: bq: Can I also have access to the wiki? Sure. Sou need to create a Wiki logon and let us know what that is before we can add you to the list. Best, Erick On Tue, Aug 19, 2014 at 6:54 AM, julie.v...@anixter.com wrote: Can I also have access to the wiki? We are at the outset of a

Substring and Case In sensitive Search

2014-08-19 Thread Nishanth S
Hi, I am very new to solr.How can I allow solr search on a string field case insensitive and substring?. Thanks, Nishanth

Re: Substring and Case In sensitive Search

2014-08-19 Thread Jack Krupansky
Substring search a string field using wildcard, *, at beginning and end of query term. Case-insensitive match on string field is not supported. Instead, copy the string field to a text field, use the keyword tokenizer, and then apply the lower case filter. But... review your use case to

Re: Apache Solr Wiki

2014-08-19 Thread Julie . Voss
user name: julievoss From: Erick Erickson erickerick...@gmail.com To: solr-user@lucene.apache.org Date: 08/19/2014 10:34 AM Subject:Re: Apache Solr Wiki Julie: bq: Can I also have access to the wiki? Sure. Sou need to create a Wiki logon and let us know what that is before

Replication of full index to replica after merge index into leader not working

2014-08-19 Thread Timothy Potter
Hi, Using the coreAdmin mergeindexes command to merge an index into a leader (SolrCloud mode on 4.9.0) and the replica does not do a snap pull from the leader as I would have expected. The merge into the leader worked like a charm except I had to send a hard commit after that (which makes sense).

Re: Replication of full index to replica after merge index into leader not working

2014-08-19 Thread Mark Miller
I’d just file a JIRA. Merge, like optimize and a few other things, were never tested or considered in early SolrCloud days. It’s used in the HDFS stuff, but in that case, the index is merged to all replicas and no recovery is necessary. If you want to make the local filesystem merge work well

Re: Replication of full index to replica after merge index into leader not working

2014-08-19 Thread Mark Miller
On August 19, 2014 at 1:33:10 PM, Mark Miller (markrmil...@gmail.com) wrote: sounds like we should write a test and make it work. Keeping in mind that when using a shared filesystem like HDFS or especially if using the MapReduce contrib, you probably won’t want this new behavior. --  Mark

Help with StopFilterFactory

2014-08-19 Thread heaven
Hi, I have the next text field: fieldType name=words_ngram class=solr.TextField omitNorms=false analyzer tokenizer class=solr.PatternTokenizerFactory pattern=[^\w]+ / filter class=solr.StopFilterFactory words=url_stopwords.txt ignoreCase=true / filter

Performance of Boolean query with hundreds of OR clauses.

2014-08-19 Thread SolrUser1543
I am using Solr to perform search for finding similar pictures. For this purpose, every image indexed as a set of descriptors ( descriptor is a string of 6 chars ) . Number of descriptors for every image may vary ( from few to many thousands) When I want to search for a similar image , I am

Re: Replication of full index to replica after merge index into leader not working

2014-08-19 Thread Timothy Potter
Was able to get around it for now sending the REQUESTRECOVERY command to the replica. Will open an improvement JIRA but not sure if it's worth it as the work-around is pretty clean (IMO). Tim On Tue, Aug 19, 2014 at 5:33 PM, Mark Miller markrmil...@gmail.com wrote: I’d just file a JIRA. Merge,

Re: Replicating Between Solr Clouds

2014-08-19 Thread reparker23
Are there any more OOB solutions for inter-SolrCloud replication now? Our indexing is so slow that we cannot rely on a complete re-index of data from our DB of record (SQL) to recover data in the Solr indices. -- View this message in context:

Index not respecting Omit Norms

2014-08-19 Thread Tim.Cardwell
Please reference the below images: http://lucene.472066.n3.nabble.com/file/n4153863/Schema.png http://lucene.472066.n3.nabble.com/file/n4153863/SolrDescriptionSchemaBrowser.png http://lucene.472066.n3.nabble.com/file/n4153863/SolrDescriptionDebugResults.png As you can see from the first

Question on Solr Relevancy using Okapi BM25F

2014-08-19 Thread rks_lucene
I am trying to get OkapiBM25F working over some press release articles I am indexing. The data has text portions spread across 3 fields - Title, Summary and Full Article. I would like to influence the standard BM25 by giving more weight to words in title and summary of the article than the full

Re: Performance of Boolean query with hundreds of OR clauses.

2014-08-19 Thread Jack Krupansky
A large number of query terms is definitely an anti-pattern and not a recommended use case for Solr, but I'm a little surprised that it takes minutes, as opposed to 10 to 20 seconds. Does your index fit entirely in the OS system memory available for file caching? IOW, are those few minutes

Re: Index not respecting Omit Norms

2014-08-19 Thread Chris Hostetter
: As you can see from the first image, the text field-type doesn't define the : omitNorms flag, meaning it is set to false. Also on the first image you can : see that the description field doesn't define the omitNorms flag, again : meaning it is set to false. (Default for omitNorms is false).

Re: Replicating Between Solr Clouds

2014-08-19 Thread Jeff Wartes
I¹ve been working on this tool, which wraps the collections API to do more advanced cluster-management operations: https://github.com/whitepages/solrcloud_manager One of the operations I¹ve added (copy) is a deployment mechanism that uses the replication handler¹s snap puller to hot-load a

Re: sample Cell schema question

2014-08-19 Thread Aman Tandon
Thanks Aurélien On Aug 19, 2014 3:00 PM, jmlucjav jmluc...@gmail.com wrote: In the sample schema.xml I can see this: !-- Main body of document extracted by SolrCell. NOTE: This field is not indexed by default, since it is also copied to text using copyField

Re: logging in solr

2014-08-19 Thread Aman Tandon
As you are using tomcat you can configure the log file name, folder,etc. by configuring the server.xml present in the Conf directory of tomcat. On Aug 19, 2014 4:17 AM, Shawn Heisey s...@elyograg.org wrote: On 8/18/2014 2:43 PM, M, Arjun (NSN - IN/Bangalore) wrote: Currently in my

Re: Apache Solr Wiki

2014-08-19 Thread Erick Erickson
Done, have fun! On Tue, Aug 19, 2014 at 10:07 AM, julie.v...@anixter.com wrote: user name: julievoss From: Erick Erickson erickerick...@gmail.com To: solr-user@lucene.apache.org Date: 08/19/2014 10:34 AM Subject:Re: Apache Solr Wiki Julie: bq: Can I also have access

Re: Help with StopFilterFactory

2014-08-19 Thread Jack Krupansky
What release of Solr? Do you have autoGeneratePhraseQueries=true on the field? And when you said But any of these does, did you mean But NONE of these does? -- Jack Krupansky -Original Message- From: heaven Sent: Tuesday, August 19, 2014 2:34 PM To: solr-user@lucene.apache.org

Inconsistent Solr Index Behavior

2014-08-19 Thread Ethan
A while back we added a span support for multi-value fields and did a full re-index for data spanning over 4 years. It worked perfectly for a month, and then suddenly results are not reliable anymore. We are noticing that the span is not working on most of the data and is returning wrong

Intermittent error indexing SolrCloud 4.7.0

2014-08-19 Thread S.L
Hi All, I get No Live SolrServers available to handle this request error intermittently while indexing in a SolrCloud cluster with 3 shards and replication factor of 2. I am using Solr 4.7.0. Please see the stack trace below. org.apache.solr.client.solrj.SolrServerException: No live

Re: Integrating Solr with HBase Using Lily Project

2014-08-19 Thread rulinma
try cloudear search. -- View this message in context: http://lucene.472066.n3.nabble.com/Integrating-Solr-with-HBase-Using-Lily-Project-tp4147868p4153906.html Sent from the Solr - User mailing list archive at Nabble.com.