Difference in scoring for Solr 4.8.x vs 8.6.2 (BoostQuery vs FunctionScoreQuery)

2020-10-24 Thread Amrit Sarkar
and the scoring implementations between these two major versions (4 versions apart), and it's taking a while for us to figure out. Kindly requesting help on this. Anything helps! Explanation of the scoring or point me to a direction to understand and read about it? Amrit Sarkar Engineer | Search and Kubernetes https://seamadic.com/ Twitter https://twitter.com/sarkaramrit2 LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com/@sarkaramrit2

Multi-synonyms with sow=false, and Minimum match

2020-07-26 Thread Amrit Sarkar
together, with sow=false? Or the recommendation by Doug T. still holds? Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com/@sarkaramrit2

Prefix for IndexBasedSpellChecker in Solr

2020-06-06 Thread Amrit Sarkar
check of incoming queries with a (hardened) minimum prefix chars, e.g. *mushroom *doesn't spell check with *washroom *with *IndexBasedSpellChecker*? Is there any? If not, what is the recommended way apart from using DirectSolrSpellChecker itself? Thanks in advance. Amrit Sarkar Search Engineer

Re: [CDCR]Unable to locate core

2019-05-19 Thread Amrit Sarkar
Sounds legit to me. Can you create a Jira and list down the problem statement and design solution there. I am confident it will attract committers' attention and they can review the design and provide feedback. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter

Re: CDCR one source multiple targets

2019-05-19 Thread Amrit Sarkar
Thanks, Arnold, Is the documentation not clear with the manner multiple CDCR targets can be configured? Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https

Re: CDCR - shards not in sync

2019-05-19 Thread Amrit Sarkar
Hi Jay, Can you look at the logs and identify if there are any exceptions occurring at particular Solr nodes the lagging shard is hosted? Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com

[CDCR]Unable to locate core

2019-05-19 Thread Amrit Sarkar
> > Thanks Natrajan, > > Solid analysis and I saw the issue being reported by multiple users in > past few months and unfortunately I baked an incomplete code. > > I think the correct way of solving this issue is to identify the correct > base-url for the respective core we need to trigger

Re: Bidirectional CDCR not working

2019-03-14 Thread Amrit Sarkar
Hi Arnold, You need "cdcr-processor-chain" definitions in solrconfig.xml on both clusters' collections. Both clusters need to act as source and target. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks Linke

Re: Negative CDCR Queue Size?

2018-11-09 Thread Amrit Sarkar
osts.get(state.getZkHost()) == null) { > hosts.add(state.getZkHost(), new NamedList()); > } > ((NamedList) hosts.get(state.getZkHost())).add(state.getTargetCollection(), > queueStats); > } > rsp.add(CdcrParams.QUEUES, hosts); > > Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9

Re: partial update in solr

2018-10-29 Thread Amrit Sarkar
/AtomicUpdateProcessorFactory.html> are broken and I am working on fixing it. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com/@sarkaramrit2 On Mo

Non-Solr-related | Reporting abuse | Harshit Arora

2018-09-27 Thread Amrit Sarkar
of Technology Jaipur, India Apologies in advance and kindly ignore if this doesn't concern you. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com

Re: SolrCloud CDCR with 3+ DCs

2018-09-07 Thread Amrit Sarkar
Yeah, I am not sure about how the Authentication band aid feature will work, the mentioned stackoverflow link. It is about time we include basic authentication support in CDCR. On Thu, 6 Sep 2018, 8:41 pm cdatta, wrote: > Hi Amrit, Thanks for your response. > > We wiped out our complete

Solr CDCR replication not working

2018-09-07 Thread Amrit Sarkar
Basic Authentication in clusters is not supported as of today in CDCR. On Fri, 7 Sep 2018, 4:53 pm Mrityunjaya Pathak, wrote: > I have setup two solr cloud instances in two different Datacenters Target > solr cloud machine is copy of source machine with basicAuth enabled on > them. I am unable

Re: SolrCloud CDCR with 3+ DCs

2018-08-20 Thread Amrit Sarkar
cluster server logs? Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com/@sarkaramrit2 On Fri, Aug 17, 2018 at 11:49 PM cdatta wrote: > Any pointer wo

Re: SolrCloud CDCR issue

2018-08-14 Thread Amrit Sarkar
Hi, Yeah if you look above I have stated the same jira. I see your question on 3DCs with Active-Active scenario, will respond there. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com

Re: SolrCloud CDCR issue

2018-08-13 Thread Amrit Sarkar
To the concerned, I am afraid in informing, Authentication are not supported between Solr clusters: https://issues.apache.org/jira/browse/SOLR-11959. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https

Re: SolrCloud CDCR issue

2018-08-10 Thread Amrit Sarkar
. Start CDCR on cluster-1 Go to logs and verify "forwarding has been started" 8. Start CDCR on cluster-2 Do the same sanity check. I understand this is unnecessarily complex but it is the manner CDCR was designed in the beginning. Please give it a shot and let us know. Amrit Sarkar Searc

Re: SolrCloud CDCR issue

2018-08-10 Thread Amrit Sarkar
was started first and then target. You need to shut down all the nodes both at source and target. Get the targe nodes up, all of them before starting the source ones. Logs will be initialized positively. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http:/

Anthill Inside and The Fifth Elephant Bengaluru India 2018 Edition

2018-07-23 Thread Amrit Sarkar
ensorFlow (27 July).The Fifth Elephant: 26 and 27 July Registration link with 10% discount on conference: https://fifthelephant.in/2018/?code=SG65IC <https://fifthelephant.in/2018/?code=SG65IC>For more details about any of these, write to i...@hasgeek.com or call 7676332020.* Amrit Sarkar Search

Re: CDCR traffic

2018-07-10 Thread Amrit Sarkar
rect. SSL and Kerberized cluster will have the payload/updates encrypted. Thank you for pointing it out. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com/@sarkaramrit

Re: CDCR Custom Document Routing

2018-07-02 Thread Amrit Sarkar
Jay, Can you sample delete command you are firing at the source to understand the issue with Cdcr. On Tue, 3 Jul 2018, 4:22 am Jay Potharaju, wrote: > Hi > The current cdcr setup does not work if my collection uses implicit > routing. > In my testing i found that adding documents works without

Re: tlogs not deleting

2018-06-25 Thread Amrit Sarkar
Brian, If you are still facing the issue after disabling buffer, kindly shut down all the nodes at source and then start them again, stale tlogs will start purging themselves. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks

Re: CDCR traffic

2018-06-25 Thread Amrit Sarkar
Hi Rajeswari, No it is not. Source forwards the update to the Target in classic manner. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com/@sarkaramrit2

Re: CDCR broken for Mixed Replica Collections

2018-04-25 Thread Amrit Sarkar
Pardon, * I have added extensive tests for both the use-cases. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com/@sarkaramrit2 On Thu, Apr 26, 2018 at 3

Re: CDCR broken for Mixed Replica Collections

2018-04-25 Thread Amrit Sarkar
for pull type replicas can be designed better, apart from that, if this is urgent need for you, please apply the patches for your packages and probably give a shot. I will added extensive tests for both the use-cases. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter

Re: Weird transaction log behavior with CDCR

2018-04-17 Thread Amrit Sarkar
Chris, Try to index few dummy documents and analyse if the tlogs are getting cleared or not. Ideally on the restart, it clears everything and keeps max 2 tlog per data folder. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks

Re: CdcrReplicator Forwarder not working on some shards

2018-04-17 Thread Amrit Sarkar
Susheel, At the time of core reload, logs must be complaining or atleast pointing to some direction. Each leader of shard is responsible to spawn a threadpool for cdcr replicator to get the data over. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http

Re: Weird transaction log behavior with CDCR

2018-04-17 Thread Amrit Sarkar
Chris, After disabling the buffer on source, kind shut down all the nodes of source cluster first and then start them again. The tlogs will be removed accordingly. BTW CDCR doesn't abide by 100 numRecordsToKeep or 10 numTlogs. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269

Re: Does CDCR Bootstrap sync leaves replica's out of sync

2018-04-16 Thread Amrit Sarkar
Hi Susheel, Pretty sure you are talking about this: https://issues.apache.org/jira/browse/SOLR-11724 Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com

Re: solrcloud Auto-commit doesn't seem reliable

2018-03-23 Thread Amrit Sarkar
Elaino, When you say commits not working, the solr logs not printing "commit" messages? or documents are not appearing when we search. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.li

Re: CDCR performance issues

2018-03-23 Thread Amrit Sarkar
Susheel, That is the correct behavior, "commit" operation is not propagated to target and the documents will be visible in the target as per commit strategy devised there. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidwork

Re: CDCR performance issues

2018-03-23 Thread Amrit Sarkar
ly batched in transit from the > source to the target and instead each document is posted separately? The batchsize and schedule regulate how many docs are sent across target. This has more details: https://lucene.apache.org/solr/guide/7_2/cdcr-config.html#the-replicator-element Amrit Sarkar Se

Re: CDCR Invalid Number on deletes

2018-03-20 Thread Amrit Sarkar
Hi Chris, Sorry I was off work for few days and didn't follow the conversation. The link is directing me to https://issues.apache.org/jira/projects/SOLR/issues/SOLR-12063. I think we have fixed the issue stated by you in the jira, though the symptoms were different than yours. Amrit Sarkar

Re: Solr 7.2.0 CDCR Issue with TLOG collections

2018-03-07 Thread Amrit Sarkar
a very dirty patch which fixes the problem with basic tests to prove it works. I will try to polish and finish this as soon as possible. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkar

Re: CDCR Invalid Number on deletes

2018-03-07 Thread Amrit Sarkar
nwhile if it is possible for you to apply the patch, build the jar and try it out, please do and let us know. For, *SOLR-9394* <https://issues.apache.org/jira/browse/SOLR-9394>, if you can comment on the JIRA and post the sample docs, solr logs, relevant information, I can give it a thoro

Re: Solr CDCR doesn't work if the authentication is enabled

2018-03-05 Thread Amrit Sarkar
Nice. Can you please post the details on the JIRA too if possible: https://issues.apache.org/jira/browse/SOLR-11959 and we can probably put up a small patch of adding this bit of information in official documentation. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com

Re: Solr 6.6.0 - Error: can not use FieldCache on multivalued field: categoryLevels

2018-02-26 Thread Amrit Sarkar
re memory? Not sure, someone else can weigh in. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com/@sarkaramrit2 On Mon, Feb 26, 2018 at 7:37 PM, Vincenzo

Re: Identify Reference Leak in Custom Code related to Solr

2017-12-18 Thread Amrit Sarkar
Emir, Solr version: 6.6, SolrCloud We followed the instructions on README.md on the github project. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com

Identify Reference Leak in Custom Code related to Solr

2017-12-18 Thread Amrit Sarkar
% of the total heap memory allocated (16GB). Looking forward to positive responses. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com/@sarkaramrit2

Re: Issue with CDCR bootstrapping in Solr 7.1

2017-12-05 Thread Amrit Sarkar
Tom, Thank you for trying out bunch of things with CDCR setup. I am successfully able to replicate the exact issue on my setup, this is a problem. I have opened a JIRA for the same: https://issues.apache.org/jira/browse/SOLR-11724. Feel free to add any relevant details as you like. Amrit Sarkar

Re: Issue with CDCR bootstrapping in Solr 7.1

2017-11-30 Thread Amrit Sarkar
Tom, (and take care not to restart the leader node otherwise it will replicate > from one of the replicas which is missing the index). How is this possible? Ok I will look more into it. Appreciate if someone else also chimes in if they have similar issue. Amrit Sarkar Search Engineer Lucidwo

Re: Issue with CDCR bootstrapping in Solr 7.1

2017-11-30 Thread Amrit Sarkar
get after the bootstrapping is done. Reloading makes the core opening a new searcher. While explicit commit is issued at target leader after the BS is done, follower are left unattended though the docs are copied over. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twi

Re: Issue with CDCR bootstrapping in Solr 7.1

2017-11-30 Thread Amrit Sarkar
the entire index of Leader unless the difference in versions in docs are more than "numRecordsToKeep", which is default 100, unless you have modified in solrconfig.xml. Looking forward to your analysis. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Tw

Re: Leading wildcard searches very slow

2017-11-17 Thread Amrit Sarkar
Sundeep, You would like to explore http://lucene.apache.org/solr/6_6_1/solr-core/org/apache/solr/analysis/ReversedWildcardFilterFactory.html here probably. Thanks Amrit Sarkar On 18 Nov 2017 6:06 a.m., "Sundeep T" <sundeep@gmail.com> wrote: > Hi, > > We have seve

Re: Index time boosting

2017-11-14 Thread Amrit Sarkar
Hi Venkat, FYI: Index time boosting has been deprecated from latest versions of Solr: https://issues.apache.org/jira/browse/LUCENE-6819. Not sure which version you are on, but best consider the comments on the JIRA before using it. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269

Re: SOLR not deleting records

2017-11-14 Thread Amrit Sarkar
A little more information would be beneficial; COLO1 and COLO2 are collections? if yes, both have same configurations and you are positively issuing deletes to the IDs already present in index etc. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http

Re: How to routing document for send to particular shard range

2017-11-13 Thread Amrit Sarkar
t a shard, which will divide the index and hence the hash range. I will strongly recommend you to reconsider your SolrCloud design technique for your use-case. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://

Re: Nested facet complete wrong counts

2017-11-10 Thread Amrit Sarkar
Kenny, This is a known behavior in multi-sharded collection where the field values belonging to same facet doesn't reside in same shard. Yonik Seeley has improved the Json Facet feature by introducing "overrequest" and "refine" parameters. Kindly checkout Jira:

Re: How to routing document for send to particular shard range

2017-11-10 Thread Amrit Sarkar
oute.field" is collection-specific property maintained at zookeeper (state.json / clusterstate.json). https://lucene.apache.org/solr/guide/6_6/collections-api.html#CollectionsAPI-create I highly recommend not to alter core.properties manually when dealing with SolrCloud and instead relying o

Re: Make search on the particular field to be case sensitive

2017-11-09 Thread Amrit Sarkar
tokenizer that breaks up the input, WhitespaceTokenizer for > instance. > > Best, > Erick > > On Thu, Nov 9, 2017 at 3:24 AM, Amrit Sarkar <sarkaramr...@gmail.com> > wrote: > > Behavior of the field values is defined by fieldType analyzer > declaration. >

Re: Make search on the particular field to be case sensitive

2017-11-09 Thread Amrit Sarkar
is "string". *You need to write analyzer chain for the same fieldType and don't include:* LowerCaseFilterFactory is responsible lowercase the token coming in query and while indexing. Something like this will work for you: I listed "KeywordTokenizerFactory" considering this is st

Re: Atomic Updates with SolrJ

2017-11-09 Thread Amrit Sarkar
Hi Martin, I tested the same application SolrJ code on my system, it worked just fine on Solr 6.6.x. My Solrclient is "CloudSolrJClient", which I think doesn't make any difference. Can you show the response and field declarations if you are continuously facing the issue. Amrit Sar

Re: How to routing document for send to particular shard range

2017-11-07 Thread Amrit Sarkar
d from each document to identify a shard where the document belongs. If > the field specified is missing in the document, however, the document will > be rejected. You could also use the _route_ parameter to name a specific > shard. Amrit Sarkar Search Engineer Lucidworks, Inc. 415

Re: Streaming Expression usage

2017-11-07 Thread Amrit Sarkar
ng expression: expr=rollup( > > search(collection1, > > zkHost="localhost:9983", > > qt="/export", > > q="*:*", > > fq=a_s:filter_a > > fl="id,a_s,a_i,a_f", > >

Re: Long blocking during indexing + deleteByQuery

2017-11-07 Thread Amrit Sarkar
Maybe not a relevant fact on this, but: "addAndDelete" is triggered by "*Reordering of DBQs'; *that means there are non-executed DBQs present in the updateLog and an add operation is also received. Solr makes sure DBQs are executed first and than add operation is executed. Amri

Re: Incorrect ngroup count

2017-11-07 Thread Amrit Sarkar
values in same shard. Can you share what is the architecture of the setup? Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 Medium: https://medium.com/@sarkaramrit2 On Tue, Nov 7, 201

Re: SolrClould 6.6 stability challenges

2017-11-04 Thread Amrit Sarkar
ommit strategy in indexing. With auto-commit so highly set, are you committing after batch, if yes, what's the number. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Sat, No

Re: Streaming Expression - cartesianProduct

2017-11-01 Thread Amrit Sarkar
Following Pratik's spot-on comment and not really related to your question, Even the "partitionKeys" parameter needs to be specified the "over" field while using "parallel" streaming. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.

Solr require both hl.fl and df same for correct highlighting.

2017-10-26 Thread Amrit Sarkar
tokenStream, has analysed tokens: "vacat" which obviously doesn't match with extracted term. Why the df, qf, values concern with what we pass in "hl.fl"? Isn't the query which is to be highlighted be analysed by field passed in "hl.fl", but then multiple fields can be passed in "hl.fl". Just wondering how it is suppose to be done. Any explanation will be fine. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2

Re: Merging is not taking place with tiered merge policy

2017-10-23 Thread Amrit Sarkar
Chandru, Didn't try the above config bu whyt have you defined both "mergePolicy" and "mergePolicyFactory"? and pass different values for same parameters? > 10 > 1 > > > 10 > 10 > > Amrit Sarkar Search Engineer Lucidworks, Inc.

Re: Using pint field as uniqueKey

2017-10-17 Thread Amrit Sarkar
https://issues.apache.org/jira/browse/SOLR-10829: IndexSchema should enforce that uniqueKey field must not be points based The description tells the real reason. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https

Re: solr 7.0: What causes the segment to flush

2017-10-17 Thread Amrit Sarkar
ing observation, Nawab, with ramBufferSizeMB=20G, you are getting 20GB segments on 6.5 or less? a GB? Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Tue, Oct 17, 2017 at 12:48 PM,

Re: Howto verify that update is "in-place"

2017-10-17 Thread Amrit Sarkar
James, @Amrit: Are you saying that the _version_ field should not change when > performing an atomic update operation? It should change. a new version will be allotted to the document. I am not that sure about in-place updates, probably a test run will verify that. Amrit Sarkar Search Engin

Re: Using pint field as uniqueKey

2017-10-17 Thread Amrit Sarkar
ow new SolrException(ErrorCode.SERVER_ERROR, msg); } Not sure the reason behind; someone else can weigh in here, but PointFields are not allowed to be unique keys, probably because how they are structures and stored on disk. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitte

Re: Howto verify that update is "in-place"

2017-10-17 Thread Amrit Sarkar
Hi James, As for each update you are doing via atomic operation contains the "id" / "uniqueKey". Comparing the "_version_" field value for one of them would be fine for a batch. Rest, Emir has list them out. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9

Re: HOW DO I UNSUBSCRIBE FROM GROUP?

2017-10-16 Thread Amrit Sarkar
Hi, If you wish the emails to "stop", kindly "UNSUBSCRIBE" by following the instructions on the http://lucene.apache.org/solr/community.html. Hope this helps. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidw

Re: solr 7.0.1: exception running post to crawl simple website

2017-10-13 Thread Amrit Sarkar
ent-not-allowed-in-prolog-error https://stackoverflow.com/questions/3030903/content-is-not-allowed-in-prolog-when-parsing-perfectly-valid-xml-on-gae Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.

Re: solr 7.0.1: exception running post to crawl simple website

2017-10-13 Thread Amrit Sarkar
"text/html", try with both. If you get past this hurdle this hurdle, let me know. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Fri, Oct 13, 2017 at 8:22 PM, Kevin La

Re: solr 7.0.1: exception running post to crawl simple website

2017-10-13 Thread Amrit Sarkar
pardon: [solr-home]/server/log/solr.log Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Fri, Oct 13, 2017 at 8:10 PM, Amrit Sarkar <sarkaramr...@gmail.com> wrote: &

Re: solr 7.0.1: exception running post to crawl simple website

2017-10-13 Thread Amrit Sarkar
ah oh, dockers. They are placed under [solr-home]/server/log/solr/log in the machine. I haven't played much with docker, any way you can get that file from that location. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn

Re: solr 7.0.1: exception running post to crawl simple website

2017-10-13 Thread Amrit Sarkar
Hi Kevin, Can you post the solr log in the mail thread. I don't think it handled the .md by itself by first glance at code. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2

Re: solr 7.0.1: exception running post to crawl simple website

2017-10-13 Thread Amrit Sarkar
; String type = rawContentType.split(";")[0]; if(typeSupported(type) || "*".equals(fileTypes)) { String encoding = conn.getContentEncoding(); Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks Link

Re: solr 7.0.1: exception running post to crawl simple website

2017-10-13 Thread Amrit Sarkar
pendocument.text"); mimeMap.put("ott", "application/vnd.oasis.opendocument.text"); mimeMap.put("odp", "application/vnd.oasis.opendocument.presentation"); mimeMap.put("otp", "application/vnd.oasis.opendocument.presentation"); mimeMap.put("o

Re: solr 7.0.1: exception running post to crawl simple website

2017-10-13 Thread Amrit Sarkar
his is not the issue. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Fri, Oct 13, 2017 at 7:04 PM, Amrit Sarkar <sarkaramr...@gmail.com> wrote: > Kevin, > > Jus

Re: solr 7.0.1: exception running post to crawl simple website

2017-10-13 Thread Amrit Sarkar
Strange, Can you add: "text/html;charset=utf-8". This is wiki.apache.org page's Content-Type. Let's see what it says now. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramri

Re: solr 7.0.1: exception running post to crawl simple website

2017-10-13 Thread Amrit Sarkar
= (HttpURLConnection) u.openConnection(); Can you check at your webpage level headers are properly set and it has key "content-type". Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/i

Re: Solr related questions

2017-10-13 Thread Amrit Sarkar
rtHandler), depends on the file format, its csv, xml, json, but mind it is single threaded. Hope this clarifies some of it. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Fri, O

Re: Getting user-level KeeperException

2017-10-12 Thread Amrit Sarkar
Gunalan, Zookeeper throws KeeperException at /overseer for most of the solr issues, namely indexing. Sync the timestamp of zookeeper error with solr log; the problem lies there most probably. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com

Re: Very high number of deleted docs

2017-10-04 Thread Amrit Sarkar
Hi Markus, Emir already mentioned tuning *reclaimDeletesWeight which *affects segments about to merge priority. Optimising index time by time, preferably scheduling weekly / fortnight / ..., at low traffic period to never be in such odd position of 80% deleted docs in total index. Amrit Sarkar

Re: CDCR does not work

2017-09-28 Thread Amrit Sarkar
Pretty much what Webster and Erick mentioned, else please try the pdf I attached. I followed the official documentation doing that. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com

Re: Highlighting Performance improvement suggestions required - Solr 6.5.1

2017-08-09 Thread Amrit Sarkar
://www.youtube.com/watch?v=tv5qKDKW8kk Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Wed, Aug 9, 2017 at 7:45 PM, sasarun <sasa...@gmail.com> wrote: > Hi All, > > I fo

Re: SOLR Metric Reporting to graphite

2017-08-06 Thread Amrit Sarkar
Hi, I didn't had a chance to go through the steps you are doing, but I followed the one written by Varun Thacker via influxdb: https://github.com/vthacker/solr-metrics-influxdb, and it works fine. Maybe it can be of some help. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269

Re: Sum of double fields in JSON Facet

2017-07-25 Thread Amrit Sarkar
Zheng, You may want to check https://issues.apache.org/jira/browse/SOLR-7452. I don't know whether they are absolutely related but I am sure I have seen complaints and enquiries regarding not precise statistics with JSON Facets. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269

Re: atomic updates in conjunction with optimistic concurrency

2017-07-21 Thread Amrit Sarkar
ersion_field_s", map); > document.addField("_version_", > response.getResults().get(0).get("_version_")); > docs.add(document); > updateRequest = new UpdateRequest(); > updateRequest.add(docs); > client.request(updateRequest, collection); >

Re: atomic updates in conjunction with optimistic concurrency

2017-07-21 Thread Amrit Sarkar
Hendrik, Can you list down the error snippet so that we can refer the code where exactly that is happening. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Fri, Jul 21

Re: CDCR - how to deal with the transaction log files

2017-07-21 Thread Amrit Sarkar
nodes, the leaders of shard will try to create the same COLLECTIONCHECKPOINT, which may or may not be successful. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Fri, Jul 21

Re: multiValued=false is not working in Solr 6.4 in RHEL/CentOS

2017-07-20 Thread Amrit Sarkar
By saying: I am just adding multiValued=false in the managed-schema file. Are you modifying in the local filesystem "conf" or going into the core conf directory and changing there? If you are SolrCloud, you should change the same on Zookeeper.

Re: Parent child documents partial update

2017-07-18 Thread Amrit Sarkar
Sujay, Lucene index is in flat-object document style, so I really not think nested documents at index / storage will ever be supported unless someone change the very intricacy of the index. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com

Re: CloudSolrClient preferred over LBHttpSolrClient

2017-07-17 Thread Amrit Sarkar
of servers. Hope this helps. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Mon, Jul 17, 2017 at 11:38 PM, S G <sg.online.em...@gmail.com> wrote: > Hi, &

Re: Help with updateHandler commit stats

2017-07-17 Thread Amrit Sarkar
+Reference for more details. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Fri, Jul 7, 2017 at 4:15 PM, Antonio De Miguel <deveto...@gmail.com> wrote: > Hi, >

Re: Parent child documents partial update

2017-07-17 Thread Amrit Sarkar
ments along with it. I am not sure whether this can be done with current code or it will be fixed / improved in the future. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Mon, Jul 1

Re: Solr Subfaceting

2017-07-17 Thread Amrit Sarkar
ad? In my opinion, you should use the better feature. Though you may hit some limitations of json faceting and their respective would be jiras opened too. Rest Mr. Seeley would be the the best person the 2nd. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Tw

Re: Solr Subfaceting

2017-07-17 Thread Amrit Sarkar
= client.query(params); Setting key and value via SolrParams is available. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Mon, Jul 17, 2017 at 8:48 PM, Ponnuswamy, Poornima (

Re: TransactionLog doesn't know how to serialize class java.util.UUID; try implementing ObjectResolver?

2017-07-17 Thread Amrit Sarkar
alize " + o.getClass() + "; try implementing ObjectResolver?"); } }; While UUID implements serializable, so should be BytesRef instance to?? :: public final class UUID implements java.io.Serializable, Comparable Can you share the payload with you are trying to update? Amrit Sarkar Search

Re: dynamic datasource password in db_data_config file

2017-07-17 Thread Amrit Sarkar
Javed, Can you let us know if you are running in standalone or cloud mode? Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Mon, Jul 17, 2017 at 11:54 AM, javeed &l

Re: async backup

2017-06-27 Thread Amrit Sarkar
Damien, then I poll with REQUESTSTATUS REQUESTSTATUS is an API which provided you the status of the any API (including other heavy duty apis like SPLITSHARD or CREATECOLLECTION) associated with async_id at that current timestamp / moment. Does that give you "state"="completed&

Re: com.ibm.icu dependency errors when building solr source code

2017-06-22 Thread Amrit Sarkar
Running "ant eclipse" or "ant test" in verbose mode will provide you the exact lib in ivy2 cache which is corrupt. Delete that particular lib and run "ant" again. Also don't try to get out / exit "ant" commands via Ctrl+C or Ctrl+V while it is downloading the libraries to ivy2 folder.

Re: I want "john smi" to find "john smith" in my custom "fullname_s" field

2017-06-06 Thread Amrit Sarkar
Erik, Thank you for correcting. Things I miss out on daily bases: _text_ :) Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https://www.linkedin.com/in/sarkaramrit2 On Tue, Jun 6, 2017 at 5:12 PM, Nick Way &l

Re: I want "john smi" to find "john smith" in my custom "fullname_s" field

2017-06-06 Thread Amrit Sarkar
quot;:"on", "wt":"json"}}, "response":{"numFound":1,"start":0,"maxScore":1.0,"docs":[ { "id":"1", "fullname_s":"john smith", "_version_":15694

Re: Number of requests spike up, when i do the delta Import.

2017-06-01 Thread Amrit Sarkar
. Ideally then, a full-import or the delta-import should take similar time to build the docs (fetch next row). I may very well be going entirely wrong here. Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269 www.lucidworks.com Twitter http://twitter.com/lucidworks LinkedIn: https

  1   2   >