[ 
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12793101#action_12793101
 ] 

Shalin Shekhar Mangar commented on SOLR-236:
--------------------------------------------

How about we change the current field collapsing response format to the 
following?

We add new well-known fields to the document itself, say 
# "collapse.value" - contains the group field's value for this document
# "collapse.count" - the number of results collapsed under this document
# "collapse.aggregate.function(field-name)" - the aggregate value for the given 
function applied to the given field for this document's group

Example:
{code:xml}
<?xml version="1.0" encoding="UTF-8"?>
<response>
  <lst name="responseHeader">
    <int name="status">0</int>
    <int name="QTime">2</int>
    <lst name="params">
      <str name="collapse.field">manu_exact</str>
      <str name="collapse.aggregate">max(field1)</str>
      <str name="collapse.aggregate">avg(field1)</str>
      <str name="q">title:test</str>
      <str name="field.collapse">title</str>
      <str name="qt">collapse</str>
    </lst>
  </lst>
  <result name="response" numFound="30" start="0">
    <doc>
      <str name="id">F8V7067-APL-KIT</str>
      <str name="collapse.value">Belkin</str>
      <int name="collapse.count">1</int>
      <int name="collapse.aggregate.max(field1)">100</int>
      <float name="collapse.aggregate.avg(field1)">50.0</float>
    </doc>
    <doc>
      <str name="id">TWINX2048-3200PRO</str>
      <str name="collapse.value">Corsair Microsystems Inc.</str>
      <int name="collapse.count">3</int>
      <int name="collapse.aggregate.max(field1)">100</int>
      <float name="collapse.aggregate.avg(field1)">50.0</float>
    </doc>
  </result>
</response>
{code}

No need to have another section and correlate based on uniqueKeys. For this to 
work, CollapseComponent must generate a custom SolrDocumentList and set it as 
"results" in the response.

For request parameters:
# "collapse.aggregate" - Can we make this a multi-valued parameter instead of 
comma separated?

> Field collapsing
> ----------------
>
>                 Key: SOLR-236
>                 URL: https://issues.apache.org/jira/browse/SOLR-236
>             Project: Solr
>          Issue Type: New Feature
>          Components: search
>    Affects Versions: 1.3
>            Reporter: Emmanuel Keller
>            Assignee: Shalin Shekhar Mangar
>             Fix For: 1.5
>
>         Attachments: collapsing-patch-to-1.3.0-dieter.patch, 
> collapsing-patch-to-1.3.0-ivan.patch, collapsing-patch-to-1.3.0-ivan_2.patch, 
> collapsing-patch-to-1.3.0-ivan_3.patch, field-collapse-3.patch, 
> field-collapse-4-with-solrj.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-solr-236-2.patch, field-collapse-solr-236.patch, 
> field-collapsing-extended-592129.patch, field_collapsing_1.1.0.patch, 
> field_collapsing_1.3.patch, field_collapsing_dsteigerwald.diff, 
> field_collapsing_dsteigerwald.diff, field_collapsing_dsteigerwald.diff, 
> quasidistributed.additional.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236-FieldCollapsing.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, solr-236.patch, 
> SOLR-236_collapsing.patch, SOLR-236_collapsing.patch
>
>
> This patch include a new feature called "Field collapsing".
> "Used in order to collapse a group of results with similar value for a given 
> field to a single entry in the result set. Site collapsing is a special case 
> of this, where all results for a given web site is collapsed into one or two 
> entries in the result set, typically with an associated "more documents from 
> this site" link. See also Duplicate detection."
> http://www.fastsearch.com/glossary.aspx?m=48&amid=299
> The implementation add 3 new query parameters (SolrParams):
> "collapse.field" to choose the field used to group results
> "collapse.type" normal (default value) or adjacent
> "collapse.max" to select how many continuous results are allowed before 
> collapsing
> TODO (in progress):
> - More documentation (on source code)
> - Test cases
> Two patches:
> - "field_collapsing.patch" for current development version
> - "field_collapsing_1.1.0.patch" for Solr-1.1.0
> P.S.: Feedback and misspelling correction are welcome ;-)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to