[
https://issues.apache.org/jira/browse/SOLR-16857?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joel Bernstein updated SOLR-16857:
----------------------------------
Description:
E-commerce catalogs often use collapse to collapse product records within a
group. For example a particular couch might come in different colors or
fabrics. So a single couch might have a large number of slightly different
records within the group.
When reranking a collapsed query with a vector query the vector query will
select the top K matches based on the vector. The top K could include multiple
records from within the same product group although only one group head was
selected from the group. This will pollute the top K results with lots of
duplicate records of no value.
The solution is to devise a filter that limits the vector query to searching
only the selected group heads from the collapse.
was:
E-commerce catalogs often use collapse to collapse product records within a
group. For example a particular couch might come in different colors or
fabrics. So a single couch might have a large number of slightly different
records within the group.
When reranking a collapsed query with a vector query the vector query will
select the top K matches based on the vector. The top K could include multiple
records from within the same product group although only one group head was
selected from the group. This will pollute the top K results with lots
duplicate records of no value.
The solution is to devise a filter that limits the vector query to searching on
the selected group heads from the collapse.
> Efficiently rerank collapsed queries with vector queries
> --------------------------------------------------------
>
> Key: SOLR-16857
> URL: https://issues.apache.org/jira/browse/SOLR-16857
> Project: Solr
> Issue Type: New Feature
> Security Level: Public(Default Security Level. Issues are Public)
> Reporter: Joel Bernstein
> Priority: Major
>
> E-commerce catalogs often use collapse to collapse product records within a
> group. For example a particular couch might come in different colors or
> fabrics. So a single couch might have a large number of slightly different
> records within the group.
> When reranking a collapsed query with a vector query the vector query will
> select the top K matches based on the vector. The top K could include
> multiple records from within the same product group although only one group
> head was selected from the group. This will pollute the top K results with
> lots of duplicate records of no value.
> The solution is to devise a filter that limits the vector query to searching
> only the selected group heads from the collapse.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]