Re: Changing Leadership in SolrCloud

2018-03-02 Thread Zahra Aminolroaya
Dear Mr. Shalin, Yes. I mean "state" in Cluster State API and UI. Let me explain what happened previous days by detail: Think I have Collection A distributed across node1 (the leader), node2 and node 3. I used the following command to block node 1 solr and zookeeper ports from being listend:

Re: Rename solr to another name

2018-03-02 Thread Zheng Lin Edwin Yeo
Hi Shawn, Thanks for the info. I have managed to change the one that starts Solr, and it's working so far. Now I'm working on changing things like solr.xml and the JAR file in the dist directory, like solr-cell-6.5.1.jar to names like my-cell-6.5.1.jar. Can we change the name of those JAR file

Re: Filesystems supported by Solr

2018-03-02 Thread Shawn Heisey
On 3/2/2018 2:30 PM, Ritesh Chaman wrote: > I am trying to deploy solr on my ADLS subscription. can you tell me if that > is tested and is compatible. Walter says that this is storage related to Azure. https://azure.microsoft.com/en-us/services/data-lake-store/ If this is what you are talking

Re: Performance Implications of Different Routing Schemes

2018-03-02 Thread Shawn Heisey
On 3/2/2018 11:43 AM, Stephen Lewis wrote: > I'm wondering what information you may be able to provide on performance > implications of implicit routing VS composite ID routing. In particular, > I'm curious what the horizontal scaling behavior may be of implicit routing > or composite ID routing

Re: Updating documents and commit/rollback

2018-03-02 Thread Shawn Heisey
On 3/2/2018 10:39 AM, Christopher Schultz wrote: > The problem is that I'm updating the index after my SQL UPDATE(s) have > run, but before my SQL COMMIT occurs. I have had a problem where the SQL > fails and rolls-back, but the solrClient is not rolled-back. > > I'm a little wary of rolling-back

Re: solr url control

2018-03-02 Thread Shawn Heisey
On 3/2/2018 10:29 AM, Becky Bonner wrote: > We are trying to setup one solr server for several applications each with a > different collection. Is there a way to have have 2 collections under one > folder and the url be something like this: > http://mysolrinstance.com/solr/myParent1/collection1

Re: Rename solr to another name

2018-03-02 Thread Shawn Heisey
On 3/2/2018 4:07 PM, Zheng Lin Edwin Yeo wrote: > Does this means that we have to recompile some of the JAR files that comes > with Solr in order for it to work? As they have been hard-coded with things > like "solr-webapp"? I don't see it in any of the java source code, so a recompile wouldn't

Re: Filesystems supported by Solr

2018-03-02 Thread Walter Underwood
From a quick google search, ADLS seems like the Azure version of S3. Putting Solr indexes on S3 would be unbelievably slow, if it worked at all. Years ago, I accidentally put indexes on NFS and it was 100X slower. Tell us more about what you are trying to do. It is unusual to put Solr indexes

Re: Filesystems supported by Solr

2018-03-02 Thread Ritesh Chaman
Hi team I am trying to deploy solr on my ADLS subscription. can you tell me if that is tested and is compatible. Regards On Tue, Feb 20, 2018 at 2:22 PM, Ritesh Chaman wrote: > Hi team > > May I know what all filesystems are supported by Solr. For eg ADLS,WASB, > S3

Re: Rename solr to another name

2018-03-02 Thread Zheng Lin Edwin Yeo
Hi Shawn, Thanks for the reply. Regarding this: > The scripts included with Solr have this path hardcoded, and for that reason, Solr probably won't even start without manual script edits if the webapp directory is changed Does this means that we have to recompile some of the JAR files that

Re: Solr 7.2.0 CDCR Issue with TLOG collections

2018-03-02 Thread Webster Homer
It looks like the data is getting to the target servers. I see tlog files with the right timestamps. Looking at the timestamps on the documents in the collection none of the data appears to have been loaded. In the solr.log I see lots of /cdcr messages action=LASTPROCESSEDVERSION,

Re: Shard replica labels in Solr Admin graph?

2018-03-02 Thread Shawn Heisey
On 3/2/2018 12:51 PM, Scott Prentice wrote: > I made the adjustment to /etc/hosts, and now all's well. This also > fixed an underlying problem that I hadn't noticed at the time I send > my query .. that only one Solr server was actually running. Turns out > that Zookeeper saw them all as 127.0.1.1

Re: Shard replica labels in Solr Admin graph?

2018-03-02 Thread Scott Prentice
Thanks Shawn! I made the adjustment to /etc/hosts, and now all's well. This also fixed an underlying problem that I hadn't noticed at the time I send my query .. that only one Solr server was actually running. Turns out that Zookeeper saw them all as 127.0.1.1 and didn't let the other

Solr 7.2.0 CDCR Issue with TLOG collections

2018-03-02 Thread Webster Homer
We have been having strange behavior with CDCR on Solr 7.2.0. We have a number of replicas which have identical schemas. We found that TLOG replicas give much more consistent search results. We created a collection using TLOG replicas in our QA clouds. We have a locally hosted solrcloud with 2

RE: solr url control

2018-03-02 Thread Becky Bonner
So the thing is ... these collections all have very unique schemas and the data are unrelated to each other. And we do a lot of field queries on the content. We would not want the data from one collection to ever show up in another collection query. They are used by different audiences and

RE: solr url control

2018-03-02 Thread Becky Bonner
Sorry Webster - I meant to make this a new question ... but accidentally sent it. You wrote From: Webster Homer [mailto:webster.ho...@sial.com] Sent: Friday, March 2, 2018 12:20 PM To: solr-user@lucene.apache.org Subject: Re: NRT replicas miss hits and return duplicate hits when paging

Performance Implications of Different Routing Schemes

2018-03-02 Thread Stephen Lewis
Hello! I'm wondering what information you may be able to provide on performance implications of implicit routing VS composite ID routing. In particular, I'm curious what the horizontal scaling behavior may be of implicit routing or composite ID routing with and without the "/" param appended on.

Re: NRT replicas miss hits and return duplicate hits when paging solrcloud searches

2018-03-02 Thread Webster Homer
Becky, This should have been its own question. Solrcloud is different from standalone solr, the configurations live in Zookeeper and the index is created under SOLR_HOME. You might want to rethink your solution, What problem are you trying to solve with that layout? Would it be solved by creating

Re: NRT replicas miss hits and return duplicate hits when paging solrcloud searches

2018-03-02 Thread Webster Homer
Thanks Shawn. Commenting it out works to remove it. If I change the values e.g. change the 512 to 0, it does require a restart to take effect. Tested using statsCache set to org.apache.solr.search.stats.ExactSharedStatsCache, with the queryResultCache disabled, and I still see the problem with

Updating documents and commit/rollback

2018-03-02 Thread Christopher Schultz
Hey, folks. I've been a long-time Lucene user (running a hilariously-old 1.9.1 version forever), but I'm only just now getting into using Solr. My particular use-case is storing information about web-application users so they can be found more quickly than our current RDBMS-based search (SELECT

solr url control

2018-03-02 Thread Becky Bonner
We are trying to setup one solr server for several applications each with a different collection. Is there a way to have have 2 collections under one folder and the url be something like this: http://mysolrinstance.com/solr/myParent1/collection1

RE: NRT replicas miss hits and return duplicate hits when paging solrcloud searches

2018-03-02 Thread Becky Bonner
We are trying to setup one solr server for several applications each with a different collection. Is there a way to have have 2 collections under one folder and the url be something like this: http://mysolrinstance.com/solr/myParent1/collection1

Re: [poll] which loadbalancer are you using for SolrCloud

2018-03-02 Thread Shawn Heisey
On 3/2/2018 9:11 AM, David Hastings wrote: Ill have to take a look at HAProxy. How much faster than nginx is it? I know very little about nginx. Here's some information about haproxy performance.  It's information they provide themselves, so configure your grain of salt accordingly. :)

Re: NRT replicas miss hits and return duplicate hits when paging solrcloud searches

2018-03-02 Thread Shawn Heisey
On 3/2/2018 9:28 AM, Webster Homer wrote: I've never disabled this before. I edited the solrconfig.xml setting the sizes to 0. I'm not sure if this is how to disable the cache or not. To completely disable a cache, either comment it out or remove it from the config.  I do not know

Re: Rename solr to another name

2018-03-02 Thread Shawn Heisey
On 3/2/2018 8:27 AM, Zheng Lin Edwin Yeo wrote: Are we able to rename the folder name like solr-webapp or the names like solr-jetty-context.xml to the customised name like my-webapp and my-jetty-context.xml? I'm currently using Solr 6.5.1, and will upgrade to Solr 7.2.1 soon. When people

Re: NRT replicas miss hits and return duplicate hits when paging solrcloud searches

2018-03-02 Thread Webster Homer
I am trying to test if enabling stats cache as suggested by Eric would also address this issue. I added this to my solrconfig.xml I executed queries and saw no differences. Then I re-indexed the data, again I saw no differences in behavior. Then I found this, SOLR-10952. It seems we need to

Re: [poll] which loadbalancer are you using for SolrCloud

2018-03-02 Thread Daniel Carrasco
I use HAProxy, because is much more configurable than Nginx and I can send commands to solr collection and search for text to check if the node is healthy. Nginx is very fast too, but health check are worst than HAProxy. Greetings!! 2018-03-02 17:11 GMT+01:00 David Hastings

Re: [poll] which loadbalancer are you using for SolrCloud

2018-03-02 Thread David Hastings
Ill have to take a look at HAProxy. How much faster than nginx is it? To answer the question, I personally use nginx for load balancing/failovers and its been good, use the same nginx servers to load balance a Galera cluster as well. On Fri, Mar 2, 2018 at 11:09 AM, Shawn Heisey

Re: solo source build in local error

2018-03-02 Thread Shawn Heisey
On 3/2/2018 7:42 AM, ramyogi wrote: solr-repo/lucene-solr/build.xml:21: The following error occurred while executing this line:/solr-repo/lucene-solr/lucene/common-build.xml:623: java.lang.NullPointerException at java.util.Arrays.stream(Arrays.java:5004) at

Re: solo source build in local error

2018-03-02 Thread Erick Erickson
Ant 1.10.2 has a bug, are you using that version? 1.10.1 works fine. On Fri, Mar 2, 2018 at 6:42 AM, ramyogi wrote: > solr-repo/lucene-solr/build.xml:21: The following error occurred while > executing this line:/solr-repo/lucene-solr/lucene/common-build.xml:623: >

Re: [poll] which loadbalancer are you using for SolrCloud

2018-03-02 Thread Shawn Heisey
On 3/2/2018 6:13 AM, Bernd Fehling wrote: I would like to poll for the loadbalancer you are using for SolrCloud. Are you using a loadbalancer for SolrCloud? If yes, which one (SolrJ, HAProxy, Varnish, Nginx,...) and why? I use haproxy for Solr -- not SolrCloud.  It is an amazing and FAST

dataimporthandler ignoring configured timezone for indexStartTime?

2018-03-02 Thread Elizabeth Haubert
I'm getting the incorrect the reported time deltas on the admin console for "indexing since" and "started". It looks like DIH is converting the last start time to UTC: Last Update: 09:57:15 Indexing completed. Added/Updated: 94078 documents. Deleted 0 documents. (Duration: 06s) Requests: 1 ,

Re: Configuring Solr Data and Index directories

2018-03-02 Thread Shawn Heisey
On 3/2/2018 2:15 AM, YELESWARAPU, VENKATA BHAN wrote: While deploying Solr I just see one parameter where we provide solr_home path. For ex: -Dsolr.solr.home=/usr/local/clo/ven/solr_home 1) Is there any path where we can configure data and index directories. 2) Can we separate data

Re: SolrCloud 7.2.1 - UnsupportedOperationException thrown after query on specific environments

2018-03-02 Thread Andy Jolly
Erick Erickson wrote > Maybe your remote job server is using a different set of jars than > your local one? How does the remote job server work? The remote job server and our local are running the same code as our local, and both our local and remote job server are making queries against the same

Is there a way to sort by conditional function in the Solr 7.2 JSON API?

2018-03-02 Thread Tom Van Cuyck
Hi, In the Solr 7.2 JSON API, when faceting over terms, I would like to sort the buckets over the average of a numerical property, as shown below curl http://localhost:8983/solr/core/select -d ' q=*:*& rows=0& wt=json& json.facet={ "field" : { "type" : "terms",

Rename solr to another name

2018-03-02 Thread Zheng Lin Edwin Yeo
Hi, Are we able to rename the folder name like solr-webapp or the names like solr-jetty-context.xml to the customised name like my-webapp and my-jetty-context.xml? I'm currently using Solr 6.5.1, and will upgrade to Solr 7.2.1 soon. Regards, Edwin

Re: 7.2.1 ExactStatsCache seems no longer functioning

2018-03-02 Thread Webster Homer
Your problem seems a lot like an issue I see with Near Real Time (NRT) replicas. I posted about it in this forum. I was told that a possible solution was to use the Global Stats feature. I am looking at testing that now. Have you tried using Tlog replicas? That fixed my issues with relevancy

solo source build in local error

2018-03-02 Thread ramyogi
solr-repo/lucene-solr/build.xml:21: The following error occurred while executing this line:/solr-repo/lucene-solr/lucene/common-build.xml:623: java.lang.NullPointerException at java.util.Arrays.stream(Arrays.java:5004) at java.util.stream.Stream.of(Stream.java:1000) at

[poll] which loadbalancer are you using for SolrCloud

2018-03-02 Thread Bernd Fehling
Dear list, I would like to poll for the loadbalancer you are using for SolrCloud. Are you using a loadbalancer for SolrCloud? If yes, which one (SolrJ, HAProxy, Varnish, Nginx,...) and why? If not, why not? Regards, Bernd

index mail with MailEntityProcessor

2018-03-02 Thread Dimitris Kardarakos
Hello everyone. I have created a collection and indexed mails from a gmail mailbox. Nevertheless, only plain text is indexed. Neither html formatted nor attachments' indexing works. To index mails, I have included the below libs to solrconfig: regex=".*\.jar" /> regex="solr-cell-\d.*\.jar"

Re: Word / PDF document snippet rendering in search

2018-03-02 Thread Charlie Hull
On 02/03/2018 00:15, T Wild wrote: I'm interested in building a software system which will connect to various document sources, extract the content from the documents contained within each source, and make the extracted content available to a search engine such Solr. This search engine will

Re: Solr thread problems

2018-03-02 Thread 苗海泉
Thank you for reading my question in detail. Let me explain. With 1169 threads, our collection is only 937 in number, and it is the number of threads in a solr node, not the total number of threads in the cluster. With more than a thousand collections, solrCloud is in a poor condition, so we've

Configuring Solr Data and Index directories

2018-03-02 Thread YELESWARAPU, VENKATA BHAN
Information Classification: ** Limited Access Dear Team, While deploying Solr I just see one parameter where we provide solr_home path. For ex: -Dsolr.solr.home=/usr/local/clo/ven/solr_home 1) Is there any path where we can configure data and index directories. 2) Can we separate data