Re: Highlighting tag problem

2015-12-03 Thread Zheng Lin Edwin Yeo
Hi Scott, No, what's describe in SOLR-8334 is the tag appearing at the result, but at the wrong position. For this problem, the situation is that when I do a highlight query, some of the results in the resultset does not contain the search word in title, content_type, last_modified and url, as

Use multiple istance simultaneously

2015-12-03 Thread Gian Maria Ricci - aka Alkampfer
Suppose that for some reason you are not able to use SolrCloud and you are forced to use the old Master-Slave approach to guarantee High Availability. In such a scenario, if the master failed, application are still able to search with slaves, but clearly, no more data can be indexed until the

Re: Protect against duplicates with the Migrate statement

2015-12-03 Thread Shalin Shekhar Mangar
Hi Philippa, The migrate command actually splits the lucene index from the source and merges it into the target collection. Whereas, the de-duplication is applied only to incoming updates. So you see migrate is lower level than de-duplication and therefore they cannot work together. If you want

Re: Use multiple istance simultaneously

2015-12-03 Thread Shawn Heisey
On 12/3/2015 1:25 AM, Gian Maria Ricci - aka Alkampfer wrote: > In such a scenario could it be feasible to simply configure 2 or 3 > identical instance of Solr and configure the application that transfer > data to solr to all the instances simultaneously (the approach will be a > DIH incremental

schema fileds and Typefield in solr-5.3.1

2015-12-03 Thread kostali hassan
I start working in solr 5x by extract solr in D://solr and run solr server with : D:\solr\solr-5.3.1\bin>solr start ; Then I create a core in standalone mode : D:\solr\solr-5.3.1\bin>solr create -c mycore I need indexing from system files (word and pdf) and the schema API don’t have a field

Re: curl adapter in solarium 3x

2015-12-03 Thread kostali hassan
Thank you Gora , in fact Curl is default adapter for solarium-3x and I am not using zend framwork. 2015-12-03 11:05 GMT+00:00 Gora Mohanty : > On 3 December 2015 at 16:20, kostali hassan > wrote: > > How to force the connection to explicitly close

Re: curl adapter in solarium 3x

2015-12-03 Thread Gora Mohanty
On 3 December 2015 at 16:20, kostali hassan wrote: > How to force the connection to explicitly close when it has finished > processing, and not be pooled for reuse. > they are a way to tell to server may send a keep-alive timeout (with > default Apache install, it is

(sem assunto)

2015-12-03 Thread sabrina_rodrigues
Hello! I would like to stop receiving emails from lucene apache. Please remove my email from every list. Thanks!

Highlighting tag problem

2015-12-03 Thread Zheng Lin Edwin Yeo
Hi, I'm using Solr 5.3.0. Would like to find out, during a search, sometimes there is a match in content, but it is not highlighted (the word is not in the stopword list)? Did I make any mistakes in my configuration? This is my highlighting request handler from solrconfig.xml.

Solr 4.8 Overseer/Queue Processing

2015-12-03 Thread Durham, Russell
Hello, So I’m running SolrCloud 4.8 on a two node cluster and 3 zookeeper instances on a separate set of machines. The cluster has roughly 150 collections, each running as 1 shard with a replication factor of 2. There is also a management tool we have running that will periodically request

curl adapter in solarium 3x

2015-12-03 Thread kostali hassan
How to force the connection to explicitly close when it has finished processing, and not be pooled for reuse. they are a way to tell to server may send a keep-alive timeout (with default Apache install, it is 15 seconds or 100 requests, whichever comes first) - but cURL will just open another

Re: Solr Auto-Complete

2015-12-03 Thread Alessandro Benedetti
"Sounds good but I heard "/suggest" component is the recommended way of doing auto-complete" This sounds fantastic :) We "heard" that as well, we know what the suggest component does. The point is that you would like to retrieve the suggestions + some consistent payload in different fields.

Re: Grouping by simhash signature

2015-12-03 Thread Nikola Smolenski
On Wed, Dec 2, 2015 at 9:00 PM, Nickolay41189 wrote: > I try to implement NearDup detection by SimHash > algorithm in Solr. > Let's say: > 1) each document has a field /simhash_signature/ that stores a sequence of >

Re: Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-03 Thread Shawn Heisey
On 12/3/2015 8:09 AM, Kelly, Frank wrote: > Just wondering if folks have any suggestions on using Schema.xml vs. Managed > Schema going forward. > > Our deployment will be >> 3 Zk, 3 Shards, 3 replicas >> Copies of each collection in 5 AWS regions (EBS-backed EC2 instances) >> Planning at least

Re: Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-03 Thread Erick Erickson
Shawn: Managed schema is _used_ by "schemaless", but not the same thing at all. For "schemaless" (i.e. "data driven"), you need to include the update processor chains that do the guessing for you and makes use of the managed veatures to add fields to your schema. You can also use a managed

Re: Nested Docs issue

2015-12-03 Thread Bogdan Marinescu
Hi Mikhail, I would expect the same behaviour as for a database. Meaning if I have a field declared as an uniqueKey, then there should only be one document with that key, regardless if it has a child or not. If you add the childless document first and afterwards the child, then sol'r should

Re: Stop adding content in Solr through /update URL

2015-12-03 Thread Alexandre Rafalovitch
You could add 'enable' flag in the solrconfig.xml and then enable/disable it differently on different servers: https://wiki.apache.org/solr/SolrConfigXml#Enable.2Fdisable_components Example:

Re: Failed to create collection in Solrcloud

2015-12-03 Thread Mugeesh Husain
Thanks you Zheng, I ahve found the issue, there was server IP, when i check, one of my live_collection was pointed to localhost. So i did mention hostname into solr.xml. -- View this message in context:

Stop adding content in Solr through /update URL

2015-12-03 Thread pradeepandey24
We have master/slave architecture of Solr and we are updating index on slave server through ReplicationHandler. We want that no body can directly update data into slave server using /update from url. Can we do it? If yes please tell how. Thanks in advance Pradeep -- View this message in

Re: (sem assunto)

2015-12-03 Thread Ahmet Arslan
Hi Sabrina, Please send a blank e-mail to solr-user-unsubscr...@lucene.apache.org if you haven't already. If you still have problems, see : https://wiki.apache.org/solr/Unsubscribing%20from%20mailing%20lists Ahmet On Thursday, December 3, 2015 11:37 AM, "sabrina_rodrig...@iol.pt"

Re: Highlighting tag problem

2015-12-03 Thread Scott Stults
Edwin, Is this related to what's described in SOLR-8334? k/r, Scott On Thu, Dec 3, 2015 at 5:07 AM, Zheng Lin Edwin Yeo wrote: > Hi, > > I'm using Solr 5.3.0. > Would like to find out, during a search, sometimes there is a match in > content, but it is not highlighted

How to list all collections in solr-4.7.2

2015-12-03 Thread rashi gandhi
Hi all, I have setup two solr-4.7.2 server instances on two diff machines with 3 zookeeper severs in solrcloud mode. Now, I want to retrieve list of all the collections that I have created in solrcloud mode. I tried LIST command of collections api, but its not working with solr-4.7.2. Error:

Re: AW: Is it possible to sort on a BooleanField?

2015-12-03 Thread Chris Hostetter
: Guess then I must set indexed="true" ;) Is it true the BooleanField may not have docValues? Yeah ... open jira to add this still: SOLR-7264 FWIW: you could also use an EnumField (which does support docvalues) with 2 values ("true" and "false") ... that should be just as efficient as

Spellcheck error

2015-12-03 Thread Matt Pearce
Hi, We're using Solr 5.3.1, and we're getting a StringIndexOutOfBoundsException from the SpellCheckCollator. I've done some investigation, and it looks like the problem is that the corrected string is shorter than the original query. For example, the search term is "theatre", the suggested

Re: Nested Docs issue

2015-12-03 Thread Mikhail Khludnev
Hello Bogdan, You described how it works now. That's how it was implemented. And I can explain why it was done so. Could you please describe the expected behavior for you? Notice, I want to enforce nested (block) behavior always in scope of https://issues.apache.org/jira/browse/SOLR-5211. So,

Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-03 Thread Kelly, Frank
Just wondering if folks have any suggestions on using Schema.xml vs. Managed Schema going forward. Our deployment will be > 3 Zk, 3 Shards, 3 replicas > Copies of each collection in 5 AWS regions (EBS-backed EC2 instances) > Planning at least 1 Billion objects indexed (currently < 100 million)

Nested Docs issue

2015-12-03 Thread Bogdan Marinescu
Hi, I have a problem with nested docs. If I create a document with id: 1 and fieldA:sometext and then add it to sol'r, I get one doc in sol'r. Afterwards if I add a child/nested doc to this document I additionally get a _root_:1 to the document but the problem is I now have two documents

Re: How to list all collections in solr-4.7.2

2015-12-03 Thread Pushkar Raste
Will 'wget http://host;port//solr/admin/collections?action=LIST' help? On 3 December 2015 at 12:12, rashi gandhi wrote: > Hi all, > > I have setup two solr-4.7.2 server instances on two diff machines with 3 > zookeeper severs in solrcloud mode. > > Now, I want to

Using properties placeholder ${someProperty} for xml node attribute in solrconfig

2015-12-03 Thread Pushkar Raste
Hi, I want to make turning filter cache on/off configurable (I really have a use case to turn off filter cache), can I use properties placeholders like ${someProperty} in the filter cache config. i.e. In short, can I use properties placeholders for attributes for xml node in solrconfig. Follow

Can someone put up a guide to integrate uima with solr

2015-12-03 Thread vaibhavlella
-- View this message in context: http://lucene.472066.n3.nabble.com/Can-someone-put-up-a-guide-to-integrate-uima-with-solr-tp4243464.html Sent from the Solr - User mailing list archive at Nabble.com.

RE: Spellcheck error

2015-12-03 Thread Dyer, James
Matt, Can you give some information about how your spellcheck field is analyzed and also if you're using a custom query converter. Also, try and place the bare terms you want checked in spellcheck.q (ex, if your query is q=+movie +theatre, then spellcheck.q=movie theatre). Does it work in

Re: Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-03 Thread Jeff Wartes
I’ve never used the managed schema, so I’m probably biased, but I’ve never seen much of a point to the Schema API. I need to make changes sometimes to solrconfig.xml, in addition to schema.xml and other config files, and there’s no API for those, so my process has been like: 1. Put the entire

solr with Isilon HDFS

2015-12-03 Thread Gaurav Patel
Hi We are facing below challenge: Product Use Case: Analytics Hardware: 3 Physical Machines with 60 cpu cores and 512 GB RAM each. EMC Isilon Appliance with PB storage. It can be accessed via HDFS or NFS. Questions: Can we use solr cloud for this setup? How many instances of SOLR are

Re: Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-03 Thread Erick Erickson
It Depends (tm). Managed Schema is way cool if you have a front end that lets you manipulate the schema via a browser or other program. There's really no other way to deal with changing the schema from a browser without allowing uploading xml files, which is a security problem. Trust me on this

Re: Collection Name is different than what i specify in API

2015-12-03 Thread Chris Hostetter
: I am using solr 4.10 in cloud mode. I am able to create collection using : : ./zkcli.sh -cmd upconfig -confdir $CONF_DIR -confname techproducts : -collection techproducts -z $ZOOKEEPER_URL && curl : "http://$SOLR_URL/solr/admin/collections?action=CREATE=techproducts=compositeId=1=1; : :

Re: Spellcheck error

2015-12-03 Thread Matt Pearce
Hi James, Thanks for responding. The query we were testing looks like this: http://localhost:8983/solr/testdata/select?q=theatre=theatre I did some further investigation, after discovering that omitting the spellcheck.q parameter stops the error appearing, and it looks like synonym expansion

Wildcard searches - field:aaaa* works but field:a*a does not

2015-12-03 Thread Kelly, Frank
Hello Lucene Folks, Newbie here - I've found how Solr does Wildcard searches of the form field:a* using the EdgeNGramFilterFactory https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.EdgeNGramFilterFactory but I can't seem to dig up how to support wildcards in the

Collection Name is different than what i specify in API

2015-12-03 Thread abhayd
hi I am using solr 4.10 in cloud mode. I am able to create collection using ./zkcli.sh -cmd upconfig -confdir $CONF_DIR -confname techproducts -collection techproducts -z $ZOOKEEPER_URL && curl "http://$SOLR_URL/solr/admin/collections?action=CREATE=techproducts=compositeId=1=1; Instead of

Re: Grouping by simhash signature

2015-12-03 Thread Chris Hostetter
: I try to implement NearDup detection by SimHash I'm not really familiar with simhash, but based on your description of it, i'm not sure that any of Solr's deduplication, grouping, or collapsing features will really help you here... : 1) each document has a field /simhash_signature/ that

Re: Using properties placeholder ${someProperty} for xml node attribute in solrconfig

2015-12-03 Thread Erick Erickson
Hmmm, never tried it. You can check by looking at the admin UI>>plugins/stats>>cahces>>filterCache with a property defined like you want. And assuming that works, yes. the filterCache is turned off if its size is zero. Another option might be to add {!cache=false} to your fq clauses on the

Re: Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-03 Thread Upayavira
They are different beasts, but I bet on the managed schema winning in the long run. With the bulk API, you can post a heap of fields/etc in one go, so basically, rather than pushing the schema to Zookeeper, you push it to Solr. Look at Solr 5.4 when it comes out shortly. It'll change the way

Re: Solr 5: Schema.xml vs. Managed Schema - which is advisable?

2015-12-03 Thread Don Bosco Durai
My experience is, once managed-schema is created, then schema.xml even if present is ignored. When both are present, you will get a warning in the Solr log. I have stopped using schema.xml. Actually, I use it once, start Solr and after it generates managed-schema, I export it and pretty much

Re: How to list all collections in solr-4.7.2

2015-12-03 Thread Jeff Wartes
Looks like LIST was added in 4.8, so I guess you’re stuck looking at ZK, or finding some tool that looks in ZK for you. The zkCli.sh that ships with zookeeper would probably suffice for a one-off manual inspection: https://zookeeper.apache.org/doc/trunk/zookeeperStarted.html#sc_ConnectingT

Re: schema fileds and Typefield in solr-5.3.1

2015-12-03 Thread Erick Erickson
Have you looked at Solr Cell? See: https://cwiki.apache.org/confluence/display/solr/Uploading+Data+with+Solr+Cell+using+Apache+Tika When working with things like MS word, there are a couple of things to be aware of: 1> there has to be a mapping between the meta-data (last_edited, author,

Re: Wildcard searches - field:aaaa* works but field:a*a does not

2015-12-03 Thread Erik Hatcher
You don't need to ngram at all if your queries themselves are going to be wildcarded. Erik > On Dec 3, 2015, at 17:21, Kelly, Frank wrote: > > Hello Lucene Folks, > > Newbie here - I've found how Solr does Wildcard searches of the form > field:a* using

Re: Collection Name is different than what i specify in API

2015-12-03 Thread Upayavira
On Thu, Dec 3, 2015, at 10:49 PM, Chris Hostetter wrote: > > : I am using solr 4.10 in cloud mode. I am able to create collection > using > : > : ./zkcli.sh -cmd upconfig -confdir $CONF_DIR -confname techproducts > : -collection techproducts -z $ZOOKEEPER_URL && curl > : >

RE: Help With Phrase Highlighting

2015-12-03 Thread Teague James
Thanks everyone who replied! The FastVectorHighlighter did the trick. Here is how I configured it: In solrconfig.xml: In the requestHandler I added: on text true 100 In schema.xml: I modified the text field: I restarted Solr, re-indexed the documents and tested. All phrases are correctly