Re: Grouping and group.facet performance disaster

2017-05-31 Thread Ere Maijala
While I can't say whether it affects you in this case, Solr 6.4.1 has serious performance issues. I'd suggest upgrading to at least 6.4.2. --Ere 31.5.2017, 14.16, Marek Tichy kirjoitti: Hi, I'm getting a very slow response times on grouping, especially on facet grouping. Without grouping,

Re: [EXTERNAL] Grouping and group.facet performance disaster

2017-05-31 Thread Sunil . Srinivasan
Use group.cache.percent – for your index size, it might work well. Thanks, On 5/31/17, 4:16 AM, "Marek Tichy" wrote: Hi, I'm getting a very slow response times on grouping, especially on facet grouping. Without grouping, the query takes 14ms,

Re: SQL-like queries (with percent character) - matching an exact substring, with parts of words

2017-05-31 Thread Erick Erickson
Your searches against the ascii_ignorecase_string field will suffer performance wise, SQL-like %whatever% queries have to essentially do a table scan and assemble (conceptually) a huge OR clause consisting of all the terms (in this case strings) that match. Shawn's comment on using NGrams is the

Re: Number of requests spike up, when i do the delta Import.

2017-05-31 Thread Erick Erickson
This is often the delta query configuration, where sub-entities may execute a DB request for each row. Is that possible? Best, Erick On Wed, May 31, 2017 at 2:58 AM, vrindavda wrote: > Exactly, Delta import in taking More than Delta > > Here are the details required. > >

Re: SOLR query validation

2017-05-31 Thread Erick Erickson
About adding fields, consider adding a custom DocumentTransformer instead, that's much less invasive. Best, Erick On Wed, May 31, 2017 at 5:36 AM, Susheel Kumar wrote: > Some of these like restricting user to not query some fields (based on > their authorization) etc. we

Re: Solr installdir deleted after set up solr cloud

2017-05-31 Thread Erick Erickson
First your Solr install dir being removed is very odd. Solr doesn't do that so I think there's something else going on here. This part of the message: "/usr/local/Cellar/solr/6.5.0/index/write.lock" indicates that both your running Solr instances point to the same place somehow. See the section

Re: Upgrading config from 4.5.0 to 6.5.1

2017-05-31 Thread Rick Leir
Hi Nawab The recommended way is to use the new version of solrconfig.xml and apply your modifications to it. You will want to go through it looking for developments that would affect you. Cheers Rick On May 31, 2017 3:45:58 PM EDT, Nawab Zada Asad Iqbal wrote: >Hi, > >I am

Solr installdir deleted after set up solr cloud

2017-05-31 Thread tstusr
Hi, there. There is a strange behavior I'm not capable of trace when set up solr in cloud mode. I'm able to start solr in cloud mode following this tutorial. https://cwiki.apache.org/confluence/display/solr/Getting+Started+with+SolrCloud Just following instructions. We are trying to set up

Re: _version_ / Versioning using timespan

2017-05-31 Thread Susheel Kumar
"Document Centric Versioning Constraints" is what you are looking for if you want this to handled in Solr https://cwiki.apache.org/confluence/display/solr/Updating+Parts+of+Documents -- Susheel On Wed, May 31, 2017 at 6:46 AM, marotosg wrote: > Hi all. > > I need to

Upgrading config from 4.5.0 to 6.5.1

2017-05-31 Thread Nawab Zada Asad Iqbal
Hi, I am upgrading 4.5.0 to latest stable bits and wondering what will be the quickest way to find out any obsolete or deprecated settings in config files? If I run the latest server with my old config (solr.xml, solrconfig.xml, schema.xml) files, will it warn for deprecated/less-optimal values?

Re: SOLR | De-Duplication | Remove duplicate records based on their status

2017-05-31 Thread simon
Your updateRequestProcessorChain config snippet specifies the "id" field to generate a signature, but the sample data doesn't contain an "id" field ... check that out first. -Simon On Wed, May 31, 2017 at 12:06 PM, Lebin Sebastian wrote: > Hello, > > I am indexing two

SOLR | De-Duplication | Remove duplicate records based on their status

2017-05-31 Thread Lebin Sebastian
Hello, I am indexing two different model with same data but different status. Eg: *Scenario -1* {Model: "", name: "abc", status: "T"} {Model: "", name: "abc", status: "A"} Expected Output {Model: "", name: "abc", status: "A"} *Scenario -2 * {Model: "", name: "abc", status: "A"}

Re: Grouping and group.facet performance disaster

2017-05-31 Thread Susheel Kumar
Did you try sub-facets ( http://yonik.com/json-facet-api/ ) if that meets your facet grouping requirements or try Collapse/Expand Results. https://cwiki.apache.org/confluence/display/solr/Collapse+and+Expand+Results Thnx On Wed, May 31, 2017 at 7:16 AM, Marek Tichy wrote: >

Re: Facet ranges and stats

2017-05-31 Thread Susheel Kumar
Hi, You may want to explore the JSON facets. The closest I can go to meet above requirement is below query (replace inStock with your rank field and price below with total. Null handling something also will have to look. -- Susheel curl http://localhost:8983/solr/techproducts/query -d

Re: can't create collection using solrcloud

2017-05-31 Thread BrianMaltzan
Hi, I disabled SSL/TLS- now creating a collection works. To re-enable ssl, I've tried upgrading Zookeeper's java from 7 to 8. Solr was already java8. Now I'm thinking there's a solr.in.sh parameter that's new, since solr 5, that I'm still missing, or the certificate has some issue, maybe the

Re: Rule-based Replica Placement not working with Solr 6.5.1

2017-05-31 Thread Bernd Fehling
I created a jira SOLR-10733 for this problem with a new test for RuleEngine. Now I have also a patch to fix the problem added to SOLR-10733. How to proceed? Just set it to resolved and close it? Regards Bernd Am 23.05.2017 um 14:09 schrieb Noble Paul: > did you try the rule >

How to achieve ‘splitBy’ in SolrJ?

2017-05-31 Thread prabaharanr
Hi, I am trying index records into solr-5.3.1 using solrj ‘addBeans’, is there a way to specify ‘*splitBy*’ in our POJO ? Here is my data-config.xml field (which helps DB to Search mappings) definition How to achieve this ‘splitBy’ in my POJO ? @Field(value=" search-field") private DataType

Re: XLSB files not indexed

2017-05-31 Thread Allison, Timothy B.
Apache Tika version 1.15 now handles XLSB files. The behavior described below is the expected behavior if a file type is identified but there is no parser to handle that file type. A little late to the game, I admit... :) Cheers, Tim FromRoland Everaert

RE: Solr 6.4. Can't index MS Visio vsdx files

2017-05-31 Thread Allison, Timothy B.
Apache Tika 1.15 is now available. -Original Message- From: Allison, Timothy B. [mailto:talli...@mitre.org] Sent: Tuesday, May 9, 2017 7:45 AM To: solr-user@lucene.apache.org Subject: RE: Solr 6.4. Can't index MS Visio vsdx files Probably better to ask on the Tika list. We'll push the

Re: can't create collection using solrcloud

2017-05-31 Thread Susheel Kumar
Try directly the collection API url to create collection as Mangesh suggested and see how it goes... On Tue, May 30, 2017 at 11:02 PM, mganeshs wrote: > Couple of times I faced this issue when firewall "Endpoint security" was > on. > Once I disabled it then it started working.

Re: SOLR query validation

2017-05-31 Thread Susheel Kumar
Some of these like restricting user to not query some fields (based on their authorization) etc. we do in our service layer. The service layer is what exposed to consumers and this service connects to Solr using SolrJ to execute queries etc. and get back results (in binary format). This is one

SOLR query validation

2017-05-31 Thread mganeshs
Hi, In my use case, we need to validate the solr query which is getting fired to SOLR in the solr layer. Validation like, we want few fields to be passed always in the query, we don't want few fields not to be passed in the query. Which is the right place to do in the SOLR ? Currently we are

Facet ranges and stats

2017-05-31 Thread Per Newgro
Hello, i would like to generate some stats on my facets. This is working so far. My problem is that i don't know how to generate Ranges on my facets and calculate the stats for it. I have two fields in my schema -> rank(string) and total(float, nullable) Rank can be A or B or C. In case my

Grouping and group.facet performance disaster

2017-05-31 Thread Marek Tichy
Hi, I'm getting a very slow response times on grouping, especially on facet grouping. Without grouping, the query takes 14ms, faceting 57ms. With grouping, the query time goes up to 1131ms, with facet grouping, the faceting goes up to the unbearable 12103 ms. Single solr instance, 927086docs,

Re: Solr error: org.apache.solr.common.SolrException: Exception writing document id files_21122 to the index; possible analysis error.

2017-05-31 Thread Lars Müller
I think the issue is the person between keyboard and Chair. Is delete Solr and the setup and running it again with this tutorial. Now it works. https://github.com/nextcloud/nextant/wiki/Setup-your-local-Solr-as-a-Service Am 31. Mai 2017 11:37:02 vorm. schrieb Rick Leir :

_version_ / Versioning using timespan

2017-05-31 Thread marotosg
Hi all. I need to implement an indexation solution where my Solr index doesn't get a wrong version. Due to the fact I have many version for the same entity In some cases my client may end up indexing an earlier version of my entity after a newer one. I was wondering if I can use the _version_

Re: Number of requests spike up, when i do the delta Import.

2017-05-31 Thread vrindavda
Exactly, Delta import in taking More than Delta Here are the details required. When I do the delta import for 600(of total 291,633) documents is get this : Indexing completed. Added/Updated: 360,000 documents. Deleted 0 documents. (Duration: 6m 58s) For Full import : Indexing completed.

Re: Number of requests spike up, when i do the delta Import.

2017-05-31 Thread Amrit Sarkar
I am facing kinda similar issue lately where full-import is taking seconds while delta-import is taking hours. Can you share some more metrics/numbers related to full-import and delta-import requested, rows fetched and time? Amrit Sarkar Search Engineer Lucidworks, Inc. 415-589-9269

Re: Solr error: org.apache.solr.common.SolrException: Exception writing document id files_21122 to the index; possible analysis error.

2017-05-31 Thread Rick Leir
Lars, More info is needed! Were you able to index _any_ documents before this happened? Are you POSTing via curl or something else? What is your config? Did you change your config just before his? Is the error repeatable? Any idea why the IndexWriter would be closed? "Caused by:

Re: SQL-like queries (with percent character) - matching an exact substring, with parts of words

2017-05-31 Thread Maciej Ł. PCSS
Shawn, thank you for your response. Finally, my search is based on two kinds of fields (strings and text, both ignoring case and special characters) that potentially can contain any language but mainly Polish or English. This is because the two main requirements were: 1) Google-like search

Number of requests spike up, when i do the delta Import.

2017-05-31 Thread vrindavda
Hello, Number of requests spike up, whenever I do the delta import in Solr. Please help me understand this. -- View this message in context:

Solr error: org.apache.solr.common.SolrException: Exception writing document id files_21122 to the index; possible analysis error.

2017-05-31 Thread Lars Müller
Hello, I installed Solr 6.5.1 on Ubuntu. Using it with Nextcloud 12. I get this error Message: ERROR true RequestHandlerBase org.apache.solr.common.SolrException: Exception writing document id files_21122 to the index; possible analysis error.

Re: update please

2017-05-31 Thread Mikhail Khludnev
Sam, I believe you can search for q=*olomo*, and then request highlighting hl=true=conent. Probably it needs a tweak to return all fragments, one per occurrence. A slightly different idea is to request (TermsComponent) /terms for the given regexp, get all terms and then, request tf per every of