[
https://issues.apache.org/jira/browse/SOLR-14787?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17362560#comment-17362560
]
Kevin Watters commented on SOLR-14787:
--------------------------------------
[~jbernste] I've done some analysis on the payload vs join stuff, some
surprising results. If the join use case is really only ever filtering on 1
dimension, then payloads are a very viable way to encode the data for query.
This is the basis for my talk at berlin buzzwords this week (Wednesday )
I have a more detailed writeup here:
[https://kmwllc.com/index.php/2021/06/12/solr-payload-inequalities/]
feedback very welcome.
[https://2021.berlinbuzzwords.de/session/document-classification-search-joins-vs-payloads]
> Inequality support in Payload Check query parser
> ------------------------------------------------
>
> Key: SOLR-14787
> URL: https://issues.apache.org/jira/browse/SOLR-14787
> Project: Solr
> Issue Type: New Feature
> Reporter: Kevin Watters
> Assignee: Gus Heck
> Priority: Major
> Fix For: main (9.0)
>
> Time Spent: 1h 20m
> Remaining Estimate: 0h
>
> The goal of this ticket/pull request is to support a richer set of matching
> and filtering based on term payloads. This patch extends the
> PayloadCheckQueryParser to add a new local param for "op"
> The value of OP could be one of the following
> * gt - greater than
> * gte - greater than or equal
> * lt - less than
> * lte - less than or equal
> default value for "op" if not specified is to be the current behavior of
> equals.
> Additionally to the operation you can specify a threshold local parameter
> This will provide the ability to search for the term "cat" so long as the
> payload has a value of greater than 0.75.
> One use case is to classify a document into various categories with an
> associated confidence or probability that the classification is correct.
> That can be indexed into a delimited payload field. The searches can find
> and match documents that were tagged with the "cat" category with a
> confidence of greater than 0.5.
> Example Document
> {code:java}
> {
> "id":"doc_1",
> "classifications_payload":["cat|0.75 dog|2.0"]
> }
> {code}
> Example Syntax
> {code:java}
> {!payload_check f=classifications_payload payloads='1' op='gt'
> threshold='0.5'}cat {code}
>
>
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]