[jira] [Commented] (SOLR-10320) Perform secondary sort using both values in and outside Solr index

2017-03-30 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-10320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15949555#comment-15949555
 ] 

ASF GitHub Bot commented on SOLR-10320:
---

GitHub user bkinlaw opened a pull request:

https://github.com/apache/lucene-solr/pull/179

SOLR-10320: Perform secondary sort using both values in and outside S…

…olr index


This is an ideas pull request to address issue SOLR-10320. Any 
suggestions/feedback is welcome. Still needs more testing and unit tests. 

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/bkinlaw/lucene-solr SOLR-10320

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/lucene-solr/pull/179.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #179


commit 52a72dc845261e116a304ccda6e78d2a04790529
Author: Yuchuan Zhou 
Date:   2017-03-23T14:07:28Z

SOLR-10320: Perform secondary sort using both values in and outside Solr 
index




> Perform secondary sort using both values in and outside Solr index
> --
>
> Key: SOLR-10320
> URL: https://issues.apache.org/jira/browse/SOLR-10320
> Project: Solr
>  Issue Type: New Feature
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Yuchuan Zhou
>
> There are some situations that we need to sort results based on values 
> outside of Solr (say, from a separate datastore or a data analytics service 
> that ranks entities based on analytic results). There is also the need to 
> return results in a deterministic order but applying a dynamic chain of 
> sorting and/or ranking algorithms to the result set. This chain would be 
> processed as a secondary sort implementation, where ties returned from one 
> sorting/ranking algorithm are passed to the next sorting/ranking algorithm in 
> the chain until all ties are resolved, resulting in a deterministic result 
> order. This chain should have the ability to apply sorting algorithms that 
> use data found within the solr index as well as outside of the index.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-10320) Perform secondary sort using both values in and outside Solr index

2017-03-30 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-10320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15949554#comment-15949554
 ] 

ASF GitHub Bot commented on SOLR-10320:
---

Github user bkinlaw closed the pull request at:

https://github.com/apache/lucene-solr/pull/178


> Perform secondary sort using both values in and outside Solr index
> --
>
> Key: SOLR-10320
> URL: https://issues.apache.org/jira/browse/SOLR-10320
> Project: Solr
>  Issue Type: New Feature
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Yuchuan Zhou
>
> There are some situations that we need to sort results based on values 
> outside of Solr (say, from a separate datastore or a data analytics service 
> that ranks entities based on analytic results). There is also the need to 
> return results in a deterministic order but applying a dynamic chain of 
> sorting and/or ranking algorithms to the result set. This chain would be 
> processed as a secondary sort implementation, where ties returned from one 
> sorting/ranking algorithm are passed to the next sorting/ranking algorithm in 
> the chain until all ties are resolved, resulting in a deterministic result 
> order. This chain should have the ability to apply sorting algorithms that 
> use data found within the solr index as well as outside of the index.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-10320) Perform secondary sort using both values in and outside Solr index

2017-03-30 Thread ASF GitHub Bot (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-10320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15949553#comment-15949553
 ] 

ASF GitHub Bot commented on SOLR-10320:
---

GitHub user bkinlaw opened a pull request:

https://github.com/apache/lucene-solr/pull/178

SOLR-10320: Perform secondary sort using both values in and outside S…

…olr index

You can merge this pull request into a Git repository by running:

$ git pull https://github.com/bkinlaw/lucene-solr SOLR-10320

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/lucene-solr/pull/178.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #178


commit 52a72dc845261e116a304ccda6e78d2a04790529
Author: Yuchuan Zhou 
Date:   2017-03-23T14:07:28Z

SOLR-10320: Perform secondary sort using both values in and outside Solr 
index




> Perform secondary sort using both values in and outside Solr index
> --
>
> Key: SOLR-10320
> URL: https://issues.apache.org/jira/browse/SOLR-10320
> Project: Solr
>  Issue Type: New Feature
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Yuchuan Zhou
>
> There are some situations that we need to sort results based on values 
> outside of Solr (say, from a separate datastore or a data analytics service 
> that ranks entities based on analytic results). There is also the need to 
> return results in a deterministic order but applying a dynamic chain of 
> sorting and/or ranking algorithms to the result set. This chain would be 
> processed as a secondary sort implementation, where ties returned from one 
> sorting/ranking algorithm are passed to the next sorting/ranking algorithm in 
> the chain until all ties are resolved, resulting in a deterministic result 
> order. This chain should have the ability to apply sorting algorithms that 
> use data found within the solr index as well as outside of the index.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-10320) Perform secondary sort using both values in and outside Solr index

2017-03-20 Thread Yuchuan Zhou (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-10320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15933154#comment-15933154
 ] 

Yuchuan Zhou commented on SOLR-10320:
-

I have custom code for my own need, and would like to make it as a Solr feature 
so everyone could use it.

I think Solr can provide public methods which take a map (field key->value) or 
a ordered list of values, and use these data to sort matched results. So Solr 
just provides a standard way to developers, reduce the custom code they need to 
write.

And for performance, nothing is free. Developers who use this feature need to 
know there will be extra overhead, from the external data service, and from 
sorting operations within Solr. For my case and probably many out there, we can 
live with this price.

> Perform secondary sort using both values in and outside Solr index
> --
>
> Key: SOLR-10320
> URL: https://issues.apache.org/jira/browse/SOLR-10320
> Project: Solr
>  Issue Type: New Feature
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Yuchuan Zhou
>
> There are some situations that we need to sort results based on values 
> outside of Solr (say, from a separate datastore or a data analytics service 
> that ranks entities based on analytic results). There is also the need to 
> return results in a deterministic order but applying a dynamic chain of 
> sorting and/or ranking algorithms to the result set. This chain would be 
> processed as a secondary sort implementation, where ties returned from one 
> sorting/ranking algorithm are passed to the next sorting/ranking algorithm in 
> the chain until all ties are resolved, resulting in a deterministic result 
> order. This chain should have the ability to apply sorting algorithms that 
> use data found within the solr index as well as outside of the index.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-10320) Perform secondary sort using both values in and outside Solr index

2017-03-20 Thread Erick Erickson (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-10320?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15933017#comment-15933017
 ] 

Erick Erickson commented on SOLR-10320:
---

first, before submitting a JIRA it's a good idea to discuss something on the 
user's list.

Do you have a patch you'd like to submit? 

You can sort by function queries, and you can write custom function query. 
Since based on something outside Solr is, by definition, custom for most 
situations, you'd have to write some custom code anyway so would that do?

And the overriding concern here is speed. Every doc that has a non-zero score 
will have to go through sorting. It's hard to see how going out to an 
external-to-solr source for, potentially, billions of documents has to be very, 
very efficient.

> Perform secondary sort using both values in and outside Solr index
> --
>
> Key: SOLR-10320
> URL: https://issues.apache.org/jira/browse/SOLR-10320
> Project: Solr
>  Issue Type: New Feature
>  Security Level: Public(Default Security Level. Issues are Public) 
>Reporter: Yuchuan Zhou
>
> There are some situations that we need to sort results based on values 
> outside of Solr (say, from a separate datastore or a data analytics service 
> that ranks entities based on analytic results). There is also the need to 
> return results in a deterministic order but applying a dynamic chain of 
> sorting and/or ranking algorithms to the result set. This chain would be 
> processed as a secondary sort implementation, where ties returned from one 
> sorting/ranking algorithm are passed to the next sorting/ranking algorithm in 
> the chain until all ties are resolved, resulting in a deterministic result 
> order. This chain should have the ability to apply sorting algorithms that 
> use data found within the solr index as well as outside of the index.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org