Re: streaming expressions substring-evaluator

2018-10-31 Thread Joel Bernstein
I don't think there is a substring or similar function. This would be quite nice to add along with other string manipulations. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Oct 31, 2018 at 2:37 AM Aroop Ganguly wrote: > Hey Team > > > Is there a way to extract a part of a

Re: Solr Stream vs Export Request Handlers

2018-10-25 Thread Joel Bernstein
I'm not sure why /stream is exporting faster then /export. It may be that the different approaches in the client are the reason for the difference. But the /export handler would be used in both scenarios if you specify qt=/export in the search() Streaming Expression. Joel Bernstein http

Re: Streaming rollUp vs Streaming facet

2018-10-25 Thread Joel Bernstein
Your use case is somewhat special in that it involves 10 fields. With that many nested facets the JSON facet API may or may not outperform streaming rollups. For most other cases JSON facet API will outperform rollups. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Oct 17, 2018 at 11:21

Re: Dealing with null values in streaming rollup

2018-10-22 Thread Joel Bernstein
This sounds like a bug, please log a ticket. I think the workaround you suggest is the only way to solve this problem currently. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Oct 22, 2018 at 11:28 AM Kojo wrote: > I think that you can use stream evaluators in your expressi

Re: SQL Query with NOT (V 7.2.1)

2018-10-17 Thread Joel Bernstein
I believe the NOT should be working. But the joins won't work correctly. Apache Calcite, which is parsing the query, is attempting to do the joins but there are no test cases with joins at all. The official docs should say that joins are not supported. Joel Bernstein http://joelsolr.blogspot.com

Re: Trying to retrieve two values from two different collections by sql (V 7.2.1)

2018-10-17 Thread Joel Bernstein
Joins are not currently supported with Solr SQL. We should create a ticket through a proper exception in this scenario. Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Oct 16, 2018 at 10:56 PM deniz wrote: > found out sth strange regarding this case. If i change one of the val

Re: Streaming rollUp vs Streaming facet

2018-10-17 Thread Joel Bernstein
high I mean millions of facet values are being returned in the same query. 2) Rollups following any kind of relational algebra. For example a rollup on top of a hashJoin. Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Oct 16, 2018 at 8:54 AM RAUNAK AGRAWAL wrote: > Hi Guys, > >

Re: Solr Stream vs Export Request Handlers

2018-10-15 Thread Joel Bernstein
not the best choice. You'll want to read through the various streaming expressions to see if they might be more efficient for your use case. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Oct 15, 2018 at 12:05 PM Kamal Kishore Aggarwal < kkroyal@gmail.com> wrote: > Hi, &g

Re: solr and diversification

2018-09-28 Thread Joel Bernstein
Interesting, I had not heard of MMR. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Sep 28, 2018 at 10:43 AM Tim Allison wrote: > If you haven’t already, might want to check out maximal marginal > relevance...original paper: Carbonell and Goldstein. > > On Thu, Sep 27, 201

Re: Solr Streaming Queries Performance Issues [v7.2.1]

2018-09-28 Thread Joel Bernstein
parallelism and scaling. We hope to have this ready for Solr 8, which is just around the corner. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Sep 28, 2018 at 2:52 PM RAUNAK AGRAWAL wrote: > Thanks a lot Toki. I will get back to you soon regarding patch update after >

Re: solr and diversification

2018-09-27 Thread Joel Bernstein
Yeah, I think your plan sounds fine. Do you have a specific use case for diversity of results. I've been wondering if diversity of results would provide better perceived relevance. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 27, 2018 at 1:39 PM Diego Ceccarelli (BLOOMBERG/ LONDON

Re: Solr Streaming Queries Performance Issues [v7.2.1]

2018-09-27 Thread Joel Bernstein
Please post the Streaming Expression that you are using. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 27, 2018 at 6:52 PM RAUNAK AGRAWAL wrote: > Hi Guys, > > Just to give you context, we were using JSON Facets for doing analytical > queries in solr but they were slow

Re: Exporting data using solr streaming expressions is randomly not working

2018-09-27 Thread Joel Bernstein
If you have: true Then you will get errors with the export handler. After you set this to false do the errors go away? Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 27, 2018 at 6:40 PM Gaini Rajeshwar < raja.rajeshwar2...@gmail.com> wrote: > Is it anyway related to the

Re: Exporting data using solr streaming expressions is randomly not working

2018-09-27 Thread Joel Bernstein
Ok, that stack trace shows where the problem. I'll investigate and report back. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 27, 2018 at 4:01 PM Gaini Rajeshwar < raja.rajeshwar2...@gmail.com> wrote: > @Joel: On some of the shards, i am seeing the following erro

Re: Solr Stream vs Export Request Handlers

2018-09-27 Thread Joel Bernstein
The export handler does not do distributed search. So if you have a multi-shard collection you may have to use Streaming Expressions to get exports from all shards. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 27, 2018 at 4:32 AM Jan Høydahl wrote: > Hi, > > Yes, you c

Re: Exporting data using solr streaming expressions is randomly not working

2018-09-27 Thread Joel Bernstein
if there are errors on the shards? Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 27, 2018 at 12:10 PM Gaini Rajeshwar < raja.rajeshwar2...@gmail.com> wrote: > Also, to mention, if i give q="*:*" it is working fine (which is kind of > weird) > > On Thu, Sep 27, 201

Re: Empty XML output from SOLR streaming expression

2018-09-27 Thread Joel Bernstein
Streaming expression only returns JSON. That simplified many aspects of the implementation. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 27, 2018 at 12:05 PM Dariusz Wojtas wrote: > Hi, > > I am working with SOLR 7.4.0 and use streaming expressions. > This

Re: solr and diversification

2018-09-27 Thread Joel Bernstein
I've thought about this problem a little bit. What I was considering was using Kmeans clustering to cluster the top 50 docs, then pulling the top scoring doc form each cluster as the top documents. This should be fast and effective at getting diversity. Joel Bernstein http

Re: Streaming Expressions - gatherNodes

2018-09-13 Thread Joel Bernstein
I'll have to take a look and see if I can reproduce this exact behavior. Let's create a jira ticket so we can track the discussion. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 13, 2018 at 1:03 PM Kojo wrote: > Same query feeding 25000 tuples to gatherNodes: > > ga

Re: Streaming Expressions - gatherNodes

2018-09-13 Thread Joel Bernstein
I see that the hits=0 in this log request. Are there log requests that show results found for one of these queries? Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 13, 2018 at 10:15 AM Kojo wrote: > I have just run this expression: > > > gatherNodes(graph_auxilios

Re: Streaming Expressions - gatherNodes

2018-09-13 Thread Joel Bernstein
That's odd behavior. What do the logs look like? This will produce a series of queries against the projects collection. Are you seeing those in the logs? Any errors? Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 13, 2018 at 9:25 AM Kojo wrote: > Hi, > > If I try to feed ga

Re: Explode kind of function in Solr

2018-09-13 Thread Joel Bernstein
Solr Streaming Expressions allow you to do this with the cartesianProduct function: http://lucene.apache.org/solr/guide/7_4/stream-decorator-reference.html#cartesianproduct The structure of the expression is: cartesianProduct(search(...)) Joel Bernstein http://joelsolr.blogspot.com/ On Thu

Re: Streaming timeseries() and buckets with no docs

2018-09-06 Thread Joel Bernstein
I found the ticket you created and commented on it. I'll work on this today. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 6, 2018 at 9:04 AM Joel Bernstein wrote: > Ok, I'll create a ticket for this, it's a very quick fix. I'll try to > commit today. > > Joel Bern

Re: Streaming timeseries() and buckets with no docs

2018-09-06 Thread Joel Bernstein
Ok, I'll create a ticket for this, it's a very quick fix. I'll try to commit today. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 6, 2018 at 6:52 AM Jan Høydahl wrote: > Created https://issues.apache.org/jira/browse/SOLR-12749 > > -- > Jan Høydahl, search soluti

Re: Streaming timeseries() and buckets with no docs

2018-09-04 Thread Joel Bernstein
in the latest versions. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Sep 3, 2018 at 8:58 AM Jan Høydahl wrote: > Hi > > We have a timeseries expression with gap="+1DAY" and a sum(imps_l) to > aggregate sums of an integer for each bucket. > Now, some day buckets do

Re: Graph/Stream requestHandlers in standalone

2018-08-14 Thread Joel Bernstein
Currently Streaming Expressions only work with Solr Cloud. This is because Streaming Expressions were designed to use Solr Cloud features for all distributed capabilites of the language. Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Aug 14, 2018 at 1:59 AM, David '-1' Schmid wrote

Re: Can the export handler be used with the edismax or dismax query handler

2018-07-26 Thread Joel Bernstein
The export handler doesn't allow sorting by score at this time. It only supports sorting on fields. So the edismax qparser won't cxcurrently work with the export handler. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Jul 26, 2018 at 5:52 PM, Tom Burton-West wrote: > Hello all, >

Re: Local Params handling (6.5->7.x)

2018-07-12 Thread Joel Bernstein
This may be the issue: https://issues.apache.org/jira/browse/SOLR-11501 If it turns out that this causing the problem, please create a jira. It's important to discuss how SOLR-11501 is affecting real deployments. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Jul 12, 2018 at 12:30 PM

Re: Graph, GraphML, Gephi and Edge Labels

2018-06-29 Thread Joel Bernstein
Hi, Currently the nodes expression doesn't have this capability. Feel free to make a feature request on jira. This sounds like a fairly easy feature to add. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Jun 27, 2018 at 5:21 PM, Heidi McClure < heidi.mccl...@polarisalpha.com>

Re: Applying streaming expression as a filter in graph traversal expression (gatherNodes)

2018-06-21 Thread Joel Bernstein
Currently the gatherNodes expression can only be filtered by a traditional filter query. I'm curious about the type of expression you are thinking of filtering by? Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Jun 20, 2018 at 1:54 PM, Pratik Patel wrote: > We can limit the sc

Re: Search streaming expressions returns rows times number of shards docs

2018-06-21 Thread Joel Bernstein
with the results. Things have evolved quite a bit since then and having the search expression respect the rows parameter is something that I've been meaning to add. Feel free to create a ticket for this. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Jun 21, 2018 at 1:35 PM, Alfonso Muñoz-Pomer

Re: Streaming Expressions: Merge array values? Inverse of cartesianProduct()

2018-06-19 Thread Joel Bernstein
Let's move the discussion to the jira ticket. Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Jun 19, 2018 at 3:42 AM, Christian Spitzlay < christian.spitz...@biologis.com> wrote: > > > > Am 18.06.2018 um 15:30 schrieb Joel Bernstein : > > > > You

Re: Solr Odbc for Parallel Sql integration with Tableau

2018-06-18 Thread Joel Bernstein
capabilities we will need to add joins to Solr's parallel SQL. Solr already uses Apache Calcite, which has a join optimizer, so mainly this would involve hooking up the various Streaming Expression joins. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Jun 18, 2018 at 6:37 PM, Aroop Ganguly

Re: Streaming Expressions: Merge array values? Inverse of cartesianProduct()

2018-06-18 Thread Joel Bernstein
the operation would work. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Jun 15, 2018 at 8:04 AM, Christian Spitzlay < christian.spitz...@biologis.com> wrote: > Hi, > > I had come across the reduce function in the docs but > I have a hard time getting it to work; I haven't found

Re: Streaming expressions and fetch()

2018-06-18 Thread Joel Bernstein
and I will work on the issue. Joel Bernstein http://joelsolr.blogspot.com/ On Sun, Jun 17, 2018 at 2:40 PM, Dariusz Wojtas wrote: > Hi, > I am trying to use streaming expressions with SOLR 7.3.1. > I have successfully used innerJoin, leftOuterJoin and several other > function

Re: Streaming Expressions: Merge array values? Inverse of cartesianProduct()

2018-06-14 Thread Joel Bernstein
Actually you're second example is probably a straight forward: reduce(select(...), group(...), by="k1") Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Jun 14, 2018 at 7:33 PM, Joel Bernstein wrote: > Take a look at the reduce() function. You'll have to write a custom reduc

Re: Streaming Expressions: Merge array values? Inverse of cartesianProduct()

2018-06-14 Thread Joel Bernstein
in the solrconfig.xml and use it like any other function. If you're interested in working on this you could create a ticket and I can provide guidance. Joel Bernstein http://joelsolr.blogspot.com/ 2018-06-14 13:13 GMT-04:00 Christian Spitzlay < christian.spitz...@biologis.com>:

Re: Exception when processing streaming expression

2018-06-14 Thread Joel Bernstein
. The select hander does not have this code. In general you never want to use the select handler and set the rows to such a big number. If you have that many rows you'll want to use the export and handler which is designed to export the entire result set. Joel Bernstein http://joelsolr.blogspot.com

Re: A good KV store/plugins to go with Solr

2018-06-14 Thread Joel Bernstein
. This may be an approach that Solr could adopt. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Jun 14, 2018 at 8:10 AM, Jan Høydahl wrote: > You could fetch the data from your application directly :;) > Also, the Streaming expressions has a jdbc() function but then you will > need to

Re: Exception when processing streaming expression

2018-06-13 Thread Joel Bernstein
Can your provide some example expressions that are causing these exceptions? Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Jun 13, 2018 at 9:02 AM, Christian Spitzlay < christian.spitz...@biologis.com> wrote: > Hi, > > I am seeing a lot of (reproducible) exceptions in m

Re: Streaming Expression intersect() behaviour

2018-06-08 Thread Joel Bernstein
yes, I was going to suggest that as well. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Jun 8, 2018 at 9:20 AM, Christian Spitzlay < christian.spitz...@biologis.com> wrote: > As a temporary workaround until that issue is fixed > one could wrap the right stream with a select

Re: Streaming Expression intersect() behaviour

2018-06-08 Thread Joel Bernstein
uot;fieldA" ) I suspect that there are only test cases that cover this scenario as well. I'll create a jira issue for this. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Jun 8, 2018 at 3:41 AM, Christian Spitzlay < christian.spitz...@biologis.com> wrote: > Hi, > > >

Re: Streaming Expression intersect() behaviour

2018-06-07 Thread Joel Bernstein
s are being transposed with intersect function's "on" fields. The same issue was happening with joins and may have been resolved. I'll do little more research into this. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Jun 7, 2018 at 9:29 AM, Christian Spitzlay < christian.spitz

Re: Graph traversal: Bypass cycle detection?

2018-06-07 Thread Joel Bernstein
Ah. I'll do some testing to see exactly how nodes function behaves when a node links to itself. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Jun 7, 2018 at 5:06 AM, Christian Spitzlay < christian.spitz...@biologis.com> wrote: > Hi, > > > > Am 07.06.2018 um 03:20 s

Re: Streaming Expression intersect() behaviour

2018-06-07 Thread Joel Bernstein
Nice example! I'll take a look at this today. I believe there was/is a bug with the some of the joins where the "on" parameter is transposing the fields. Its possible that is the case here as well. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Jun 7, 2018 at 5:34 AM, Christia

Re: Graph traversal: Bypass cycle detection?

2018-06-06 Thread Joel Bernstein
random walk analysis? I've been meaning to add a function that supports random walks on a graph that would not do cycle detection. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Jun 6, 2018 at 9:25 AM, Christian Spitzlay < christian.spitz...@biologis.com> wrote: > Hi, > >

Re: Escaping in streaming expression

2018-05-25 Thread Joel Bernstein
sort="id asc") So it appears that the main issue is with the double quotes. This because double quotes breaks the syntax of streaming expressions so it needed special handling. That special handling breaks the ability to search on double quotes as a character. SOLR-10894 only appears to

Re: Escaping in streaming expression

2018-05-24 Thread Joel Bernstein
test_s:\"hello world", fl="id", sort="id desc") In this case the double quote is used as part of the term, which is what I believe you had in mind. SOLR-10894, I believe was created to address this issue but as of yet no patch is available. Joel Bernstein http

Re: Escaping in streaming expression

2018-05-24 Thread Joel Bernstein
ere I believe SOLR-10894 comes into play. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, May 24, 2018 at 11:23 AM, Christian Spitzlay < christian.spitz...@biologis.com> wrote: > Hello, > > I’m experimenting with streaming expressions and I wonder how to escape a > dou

Re: Escaping in streaming expression

2018-05-24 Thread Joel Bernstein
which has not yet been resolved: https://issues.apache.org/jira/browse/SOLR-10894 Joel Bernstein http://joelsolr.blogspot.com/ On Thu, May 24, 2018 at 11:23 AM, Christian Spitzlay < christian.spitz...@biologis.com> wrote: > Hello, > > I’m experimenting with streaming expressio

Re: Solr streaming - get single value from tuple

2018-05-24 Thread Joel Bernstein
. This might not yet be documented. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, May 24, 2018 at 9:11 AM, Jan Høydahl <jan@cominvent.com> wrote: > describe() returns a tuple. > > I’d like to assign the value of “mean” from that tuple to a separate > variable for use in

Re: Collapse in facet

2018-04-10 Thread Joel Bernstein
It sounds like the JSON facet API could do what you are describing. I haven't tried the exclusion of the collapse filter with the JSON facet API but I suspect it will work. Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Apr 10, 2018 at 3:40 AM, Carl-Johan Syrén <carl-johan.sy...@slu

Re: Use TopicStream as percolator

2018-04-08 Thread Joel Bernstein
would need to write an expression that performed the push. For example the expression you stored in Solr to be executed could be something like: push(topic(...)) You would provide the logic for the push expression. Joel Bernstein http://joelsolr.blogspot.com/ On Sun, Apr 8, 2018 at 11:33 AM

Re: Solr 7.1.0 - concurrent.ExecutionException building model

2018-04-05 Thread Joel Bernstein
this matrix much smaller. Its fairly easy to make the train function work on a random sample of the training set on each iteration rather then the entire training set, but currently this is not how its implemented. Feel free to create a ticket requesting this the sampling approach. Joel Bernstein http

Re: Solr 7.1.0 - concurrent.ExecutionException building model

2018-04-02 Thread Joel Bernstein
It looks like it accessing a replica that's down. Are the logs from http://vesta:9100/solr/MODEL1024_1522696624083_shard20_replica_n75 reporting any issues? When you go to that url is it back up and running? Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Apr 2, 2018 at 3:55 PM, Joe

Re: Solr or Elasticsearch

2018-03-22 Thread Joel Bernstein
Solr 7.3 has very sophisticated math capabilities described below: https://github.com/joel-bernstein/lucene-solr/blob/math_expressions_documentation/solr/solr-ref-guide/src/math-expressions.adoc This is the userguide for math expressions which didn't make the 7.3 release but all the functions

Re: Get terms in solr not working

2018-03-21 Thread Joel Bernstein
Also what is the use case? What do you plan to do with terms? There may be other approaches that will work better then the terms query. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Mar 21, 2018 at 9:28 AM, Erick Erickson <erickerick...@gmail.com> wrote: > We need a lot more in

Re: statistics in hitlist

2018-03-16 Thread Joel Bernstein
g a linear regression on two fields, X and Y, in a hitlist, we need to > do a linear regression on field X, and the average value of X. Is that > possible? To pass in a function to the regress function instead of a field? > > > > > > On Thu, Mar 15, 2018 at 10:41 PM, Joel Berns

Re: statistics in hitlist

2018-03-15 Thread Joel Bernstein
I've been working on the user guide for the math expressions. Here is the page on regression: https://github.com/joel-bernstein/lucene-solr/blob/math_expressions_documentation/solr/solr-ref-guide/src/regression.adoc This page is part of the larger math expression documentation. The TOC is here

Re: statistics in hitlist

2018-03-15 Thread Joel Bernstein
_days_production", sort="id asc"), b=col(a, oil_first_90_days_production), c=col(a, oil_last_30_days_production), d=regress(b, c), e=someExpression()) The echo parameter tells the let expression which variables to output. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Mar

Re: statistics in hitlist

2018-03-05 Thread Joel Bernstein
I suspect you've got nulls in your data. I just tested with null values and got the same error. For testing purposes try loading the data with default values of zero. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Mar 5, 2018 at 10:12 PM, Joel Bernstein <joels...@gmail.com>

Re: statistics in hitlist

2018-03-05 Thread Joel Bernstein
il_first_90_days_production)) This should return variables a and b. Let's see what the data looks like. I changed the rows from 15 to 15000. If it all looks good we can expand the rows and continue adding functions. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Mar 5, 2018 at 4:11 PM,

Re: statistics in hitlist

2018-03-01 Thread Joel Bernstein
The field type will also need to be in the schema: Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Mar 1, 2018 at 8:00 PM, Joel Bernstein <joels...@gmail.com> wrote: > You'll need to have this field in your schema: > > > > I'll check to see if the default schema

Re: statistics in hitlist

2018-03-01 Thread Joel Bernstein
multiple. I'll create ticket for getting this fixed. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Mar 1, 2018 at 4:55 PM, John Smith <localde...@gmail.com> wrote: > Joel, thanks for the pointers to the streaming feature. I had no idea solr > had that (and also just discove

Re: statistics in hitlist

2018-02-23 Thread Joel Bernstein
d other attributes of the regression model such as R (correlation), slope, y intercept etc... Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Feb 23, 2018 at 3:10 PM, John Smith <localde...@gmail.com> wrote: > Hi Joel, thanks for the answer. I'm not really a stats guy, but the

Re: statistics in hitlist

2018-02-23 Thread Joel Bernstein
Typically SSE is the sum of the squared errors of the prediction in a regression analysis. The stats component doesn't perform regression, although it might be a nice feature. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Feb 23, 2018 at 12:17 PM, John Smith <localde...@gmail.com>

Re: LTR and 'searching' a streaming expression result

2018-02-23 Thread Joel Bernstein
, which doesn't require full export or a specific sort order. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Feb 23, 2018 at 5:54 AM, Gintautas Sulskus < gintautas.suls...@gmail.com> wrote: > Hi, > > Is it possible to apply another search to a streaming expression result

Re: What is “high cardinality” in facet streams?

2018-02-21 Thread Joel Bernstein
in the cluster. The parallel SQL interface supports a map_reduce aggregation mode where you can specific then number of parallel workers. If a SQL group by query works for you that might be the easiest way to go. The docs have good coverage of this topic. Joel Bernstein http://joelsolr.blogspot.com

Re: What is “high cardinality” in facet streams?

2018-02-20 Thread Joel Bernstein
handler which can also sort data sets with very high cardinality. The docs should describe the correct usage of the rollup expression with the /export handler. Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Feb 20, 2018 at 11:10 AM, Shawn Heisey <apa...@elyograg.org> wrote: > On 2/

Re: Solr streaming expression - options for Full Outer Join

2018-02-18 Thread Joel Bernstein
I forgot to mention, in order to do a join you would merge the streams together that you wanted to join. Then reduce by the join key. This is basic structure: reduce(merge(search(), search())) Joel Bernstein http://joelsolr.blogspot.com/ On Sun, Feb 18, 2018 at 10:45 PM, Joel Bernstein <jo

Re: Solr streaming expression - options for Full Outer Join

2018-02-18 Thread Joel Bernstein
t;a_s", group(sort="a_f desc", n="4") ) The is a basic map/reduce grouping operation. Joel Bernstein http://joelsolr.blogspot.com/ On Sun, Feb 18, 2018 at 6:24 PM, GaneshSe <ganeshmail...@gmail.com> wrote: > Any help solr streaming expression option i

Re: Getting Error when try to use sql query

2018-02-16 Thread Joel Bernstein
Currently grouping on the Date field is not supported with SQL interface. At Alfresco we've added a time series capability to Solr's SQL engine and plan on contributing it back. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Feb 16, 2018 at 3:10 PM, Jorge Carlos Guedert Junior < jorge.

Re: using gatherNodes to produce 'to' -> 'from' pairs

2018-02-05 Thread Joel Bernstein
array into tuples and fetch using the ancestors node IDs. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Feb 5, 2018 at 5:20 AM, Gintautas Sulskus < gintautas.suls...@gmail.com> wrote: > Hi Joel, > > Thanks. The alternatives will not be as neat, unfortunately. > >

Re: using gatherNodes to produce 'to' -> 'from' pairs

2018-02-04 Thread Joel Bernstein
Currently you can only directly gather nodes and aggregations. The fetch() expression can be used to fetch more fields if needed in a secondary call. Joel Bernstein http://joelsolr.blogspot.com/ On Sun, Feb 4, 2018 at 3:12 PM, Gintautas Sulskus < gintautas.suls...@gmail.com> wrote:

Re: Nesting decorators in streaming expressions

2018-02-03 Thread Joel Bernstein
Yes, you can nest decorators. The expression in your email should be possible, its just a matter of getting all the syntax right. I would build up the expression one piece at a time and make sure each layer is working before moving on to the next. Joel Bernstein http://joelsolr.blogspot.com

Re: SolrCloudClient multi-threading

2018-02-02 Thread Joel Bernstein
CloudSolrClient determines which shard each document is going to in advance. It then separates the documents based on the route and uses a thread for each group to send the documents. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Feb 2, 2018 at 11:09 AM, Steve Pruitt <bpru...@opentext.

Re: Broken Feature in Solr 6.6

2018-01-30 Thread Joel Bernstein
Your welcome Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Jan 30, 2018 at 11:00 AM, Antelmo Aguilar <aagui...@nd.edu> wrote: > Hi Joel, > > Thank you! Changing the class from SearchHandler to ExportHandler worked. > I appreciate you looking into it. > > -An

Re: Broken Feature in Solr 6.6

2018-01-30 Thread Joel Bernstein
ortHandler", "useParams":"_EXPORT", "components": [ "query" ], "defaults": { "wt": "json" }, "invariants": { "rq": "{!xport}", "distrib": false }

Re: Broken Feature in Solr 6.6

2018-01-29 Thread Joel Bernstein
handler anymore. This should work fine: http://host:port/solr/collection/export?q=*:*=exp_id_s=exp_id_s+asc Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Jan 29, 2018 at 4:59 PM, Antelmo Aguilar <aagui...@nd.edu> wrote: > Hi All, > > I was using this feature in So

Re: parallel - cartesianProduct

2018-01-29 Thread Joel Bernstein
Thanks! Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Jan 29, 2018 at 11:14 AM, Kojo <rbsnk...@gmail.com> wrote: > Joel, > The Jira is created: > https://issues.apache.org/jira/browse/SOLR-11922 > > I hope it helps. > > Thank you very much. > > >

Re: parallel - cartesianProduct

2018-01-29 Thread Joel Bernstein
This looks like a bug in the CartesianProductStream. It's going to have be fixed before parallel cartesian products can be run. Feel free to create a jira for this. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Jan 29, 2018 at 9:58 AM, Kojo <rbsnk...@gmail.com> wrote: > Hi s

Re: Partial results from streaming expressions (i.e. making them "stream")

2018-01-17 Thread Joel Bernstein
to export result sets of any size. The underlying buffer sizes are really abstracted away and not meant to dealt with. What's the higher level issue you are concerned with? Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Jan 17, 2018 at 8:54 AM, Radu Gheorghe <radu.gheor...@sematext.

Re: Streaming Expressions shortestPath

2018-01-14 Thread Joel Bernstein
://lucene.apache.org/solr/guide/7_2/stream-decorator-reference.html#cartesianproduct Joel Bernstein http://joelsolr.blogspot.com/ On Sun, Jan 14, 2018 at 8:07 PM, Joel Bernstein <joels...@gmail.com> wrote: > The shortestPath returns a list of node id's, and fetch expects a stream > o

Re: Streaming Expressions shortestPath

2018-01-14 Thread Joel Bernstein
with the shortestPath expression? Joel Bernstein http://joelsolr.blogspot.com/ On Sun, Jan 14, 2018 at 11:47 AM, Karthik Ramachandran <mrk...@gmail.com> wrote: > Can shortestPath stream source be used with fetch? > > With below streaming expression, we only get the path, are we doing > so

Re: SolrJ with Async Http Client

2018-01-03 Thread Joel Bernstein
://joelsolr.blogspot.com/2017/01/deploying-solrs-new-parallel-executor.html After you've read through the two blogs let me know if you have questions about how to apply this to your use case. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Jan 3, 2018 at 8:11 AM, RAUNAK AGRAWAL <agrawal.rau...@gmail.com>

Re: solr 6.6.0 /export returns xml not json

2017-12-14 Thread Joel Bernstein
I suspect when you did the upgrade you kept the same configs. The /export handler is an implicit handler in 6.6 so if you see any reference to /export handler in the solrconfig remove it. The /export hander does only return json or javabin in Solr 6.6. Joel Bernstein http://joelsolr.blogspot.com

Re: Solr Aggregation queries are way slower than Elastic Search

2017-12-12 Thread Joel Bernstein
The JSON facet API is a different aggregation implementation. So, it will definitely have different performance characteristics. Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Dec 12, 2017 at 9:17 AM, RAUNAK AGRAWAL <agrawal.rau...@gmail.com> wrote: > Hi Yonik, > > So if th

Re: Streaming and large resultsets

2017-11-09 Thread Joel Bernstein
node. Possibly one of your nodes is slow for some reason and that is causing the entire query to respond slowly. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Nov 9, 2017 at 2:22 PM, Lanny Ripple <la...@spotright.com> wrote: > Happy to do so. I am testing streams for the

Re: Streaming and large resultsets

2017-11-09 Thread Joel Bernstein
will slow down as you add more fields to the field list and sort list. But if you post your query and information on what types the fields are in the field list I can give you an idea of the type of performance I would expect. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Nov 9, 2017 at 9:54 AM

Re: Streaming Expression usage

2017-11-08 Thread Joel Bernstein
and then use Streaming Expressions for specific use cases. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Nov 8, 2017 at 10:31 AM, Kojo <rbsnk...@gmail.com> wrote: > We have a web site with traditional search capabilities, faceting, sorting > and so on. It has many problems before we took

Re: Streaming Expression usage

2017-11-08 Thread Joel Bernstein
. If your planning a traditional search interface then Streaming Expressions is going to be the less efficient approach. Joel Bernstein http://joelsolr.blogspot.com/ On Wed, Nov 8, 2017 at 7:44 AM, Kojo <rbsnk...@gmail.com> wrote: > Amrit, > as far as I understand, in your example I h

Re: Solr streaming innerJoin doesn't return rows

2017-11-02 Thread Joel Bernstein
or even a small cluster. So you'll really need to think about how the joins are designed and whether they fit your use case. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Nov 2, 2017 at 2:25 PM, Webster Homer <webster.ho...@sial.com> wrote: > I'm using Solr 6.2.0. I am trying to u

Re: Solr streaming questions

2017-10-31 Thread Joel Bernstein
It is not possible to use score with the /export handler. The /export handler currently only supports sorting by fields. You can sort by score using the default /select handler. Joel Bernstein http://joelsolr.blogspot.com/ On Tue, Oct 31, 2017 at 1:50 PM, Webster Homer <webster.ho...@sial.

Re: Graph Traversal

2017-10-28 Thread Joel Bernstein
I don't see a jira ticket for this yet. Feel free to create it and reply back with the link. Joel Bernstein http://joelsolr.blogspot.com/ On Fri, Oct 27, 2017 at 9:55 AM, Kojo <rbsnk...@gmail.com> wrote: > Hi, I was looking for information on Graph Traversal. More specifically,

Re: Parallel SQL: GROUP BY throws exception

2017-10-17 Thread Joel Bernstein
This would be a good jira to create at ( https://issues.apache.org/jira/projects/SOLR) Interesting that the query works in MySQL. I'm assuming MySQL automatically adds the group by field to the field list. We can look at doing this as well. Joel Bernstein http://joelsolr.blogspot.com/ On Tue

Re: Parallel SQL: GROUP BY throws exception

2017-10-16 Thread Joel Bernstein
is needed. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Oct 16, 2017 at 9:49 AM, Joel Bernstein <joels...@gmail.com> wrote: > Also what version are you using? > > Joel Bernstein > http://joelsolr.blogspot.com/ > > On Mon, Oct 16, 2017 at 9:49 AM, Joel Bernstein <

Re: Parallel SQL: GROUP BY throws exception

2017-10-16 Thread Joel Bernstein
Also what version are you using? Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Oct 16, 2017 at 9:49 AM, Joel Bernstein <joels...@gmail.com> wrote: > Can you provide the stack trace? > > Are you in SolrCloud mode? > > > > Joel Bernstein > http://joelsolr.bl

Re: Parallel SQL: GROUP BY throws exception

2017-10-16 Thread Joel Bernstein
Can you provide the stack trace? Are you in SolrCloud mode? Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Oct 16, 2017 at 9:20 AM, Dmitry Gerasimov <dgerasi...@kommunion.com> wrote: > Hi all! > > This query works as expected: > SELECT sum(amount) as total FROM d

Re: streaming with SolrJ

2017-09-28 Thread Joel Bernstein
of them and because their initialization can be complex. All of the test cases are now focused on exercising the underlying classes through the expressions. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, Sep 28, 2017 at 4:47 PM, Hendrik Haddorp <hendrik.hadd...@gmx.net> wrote: > h

Re: Solr SQL: standalone mode

2017-09-25 Thread Joel Bernstein
It's the automatic node discovery provided by ZooKeeper. If you setup a single node SolrCloud it will work fine. Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Sep 25, 2017 at 3:09 AM, Pavel Micka <pavel.mi...@zoomint.com> wrote: > Glad to hear that. Btw: where is the limitation

<    1   2   3   4   5   6   7   8   >