Re: SQL rpt_location question

2017-03-24 Thread Joel Bernstein
You can use the _query_ field to support any Solr query in the where clause: select a, b from c where _query_='(any solr query)' This is definitely supported in the 6.5 release which is the first release with Apache Calcite as the SQL engine. But I believe it's also supported in older versions

Re: Difference between hashJoin and innerJoin in Streaming Expression

2017-03-24 Thread Joel Bernstein
The innerJoin is a merge join and the hashJoin is a hash join. The merge join can support joins of unlimited size and never runs out of memory. But it requires that both sides of the join are sorted on the join keys. The hash join reads one side of the join into a hash map keyed on the join

Re: JSON Facet API Virtual Field Support

2017-03-24 Thread Yonik Seeley
On Fri, Mar 24, 2017 at 7:52 PM, Furkan KAMACI wrote: > Hi, > > I test JSON Facet API of Solr. Is it possible to create a virtual field > which is generated by using existing fields at response and supports > elementary arithmetic operations? > > Example: > > Schema

JSON Facet API Virtual Field Support

2017-03-24 Thread Furkan KAMACI
Hi, I test JSON Facet API of Solr. Is it possible to create a virtual field which is generated by using existing fields at response and supports elementary arithmetic operations? Example: Schema fields: products, sold_products, date I want to run a date range facet and add another field to

Re: Licensing issue advice for Solr.

2017-03-24 Thread Chris Hostetter
: I know that the product in general is licensed as Apache 2.0, but unfortunately there are packages : included in the build that are considered "non-permissive" by my company and as such, means that ... : It appears that the vast majority of the licensing issues are within the

Shingles from WDFF

2017-03-24 Thread Ken Krugler
Hi all, I’ve got some ancient Lucene tokenizer code from 2006 that I’m trying to avoid forward-porting, but I don’t think there’s an equivalent in Solr 5/6. Specifically it’s applying shingles to the output of something like the WordDelimiterFilter - e.g. MySuperSink gets split into “My”

Re: Licensing issue advice for Solr.

2017-03-24 Thread Alexandre Rafalovitch
There is no official build with minimal Solr configuration. Some downstream projects may do so, but we don't keep track of their installation specifics. If it is an issue with contrib directory I would think you should be able to just not use it or even delete it. As to the searching, you've

Re: unable to get more throughput with more threads

2017-03-24 Thread Suresh Pendap
Hi Shawn, It looks like you probably have pointed to the root cause of the issue. I am using a java client and using HttpClient library directly to fire the Http get queries. I am not using SolrJ client for firing the queries. The following is my code CloseableHttpClient httpclient =

Re: Licensing issue advice for Solr.

2017-03-24 Thread Pablo Pita Leira
No answer from my side, but if you like to search the mailing list, you can try this: http://markmail.org/search/?q=license+list%3Aorg.apache.lucene.solr-user On 24.03.2017 18:53, russell.lemas...@comcast.net wrote: Hi all, I'm just getting started with Solr (6.4.2) and am trying to get

Re: unable to get more throughput with more threads

2017-03-24 Thread Suresh Pendap
Erick, - I think I checked that my QueryResultsCache and DocumentCache ratios were close to 1. I will double check that by repeating my test. - I think checking the Qtimes in the log is a very good suggestion, I will also check that the next time I run my test - It is not possible as the client is

Re: Architecture suggestions

2017-03-24 Thread Shawn Heisey
On 3/24/2017 7:47 AM, vrindavda wrote: > In my case query rate will be average or say low, 100-120 concorrent > requests. That is not a low query rate. A low query rate would be X queries per second, where X is a small single-digit number. If there are 100 *simultaneous* requests, then the

Licensing issue advice for Solr.

2017-03-24 Thread russell . lemaster
Hi all, I'm just getting started with Solr (6.4.2) and am trying to get approval for usage in my workplace. I know that the product in general is licensed as Apache 2.0, but unfortunately there are packages included in the build that are considered "non-permissive" by my company and as

Re: to handle expired documents: collection alias or delete by id query

2017-03-24 Thread Tom Evans
On Thu, Mar 23, 2017 at 6:10 AM, Derek Poh wrote: > Hi > > I have collections of products. I am doing indexing 3-4 times daily. > Every day there are products that expired and I need to remove them from > these collectionsdaily. > > Ican think of 2 ways to do this. > 1.

SQL rpt_location question

2017-03-24 Thread GW
Dear reader, I've found that using the distinct clause gives me the list I want. I also have a multivalued rpt_location in the collection that I'd like to use in the filter. Is this possible in any way shape of form? Many thanks in advance, Greg

Re: Architecture suggestions

2017-03-24 Thread vrindavda
Thanks Shawn, In my case query rate will be average or say low, 100-120 concorrent requests. As per my understanding replica too aid shards in getting result documents, correct if I am wrong. Moreover, I intend to have fault tolerant architecture, hence opting for shards/replicas on different

Re: Architecture suggestions

2017-03-24 Thread Shawn Heisey
On 3/24/2017 1:15 AM, vrindavda wrote: > Thanks Erick and Emir , for your prompt reply. > > We are expecting around 50M documents to sit on 80GB . I understand that > there is no equation to predict the number/size of server. But considering > to have minimal fault tolerant architecture, Will 2

Re: unable to get more throughput with more threads

2017-03-24 Thread Shawn Heisey
On 3/23/2017 6:10 PM, Suresh Pendap wrote: > I performed the test with 1 thread, 10 client threads and 50 client > threads. I noticed that as I increased the number of threads, the > query latency kept increasing drastically which I was not expecting. What language and Solr library was the

Difference between hashJoin and innerJoin in Streaming Expression

2017-03-24 Thread Zheng Lin Edwin Yeo
Hi, What is the main difference between hashJoin and innerJoin in Solr Streaming Expression? I understand that both will emit a tuple containing the fields of both tuples. When I tried both hashJoin and innerJoin with the same query, I get exactly the same results, and there is no difference in

Re: Architecture suggestions

2017-03-24 Thread vrindavda
Thanks Erick and Emir , for your prompt reply. We are expecting around 50M documents to sit on 80GB . I understand that there is no equation to predict the number/size of server. But considering to have minimal fault tolerant architecture, Will 2 shards and 2 replicas with 128GB RAM, 4 core solr