[ 
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Shalin Shekhar Mangar updated SOLR-236:
---------------------------------------

    Attachment: SOLR-236.patch

Changes:

# Modified configuration as Noble suggested. The 
AggregateCollapseCollectorFactory is now PluginInfoInitialized instead of 
NamedListInitialzed and functions are plugins. The "name" attribute is removed 
from "collapseCollectorFactory" since it is no longer necessary:
{code:xml}
<searchComponent name="collapse" 
class="org.apache.solr.handler.component.CollapseComponent">
    <collapseCollectorFactory 
class="solr.fieldcollapse.collector.DocumentGroupCountCollapseCollectorFactory" 
/>

    <collapseCollectorFactory 
class="solr.fieldcollapse.collector.FieldValueCountCollapseCollectorFactory" />

    <collapseCollectorFactory 
class="solr.fieldcollapse.collector.DocumentFieldsCollapseCollectorFactory" />

    <collapseCollectorFactory 
class="org.apache.solr.search.fieldcollapse.collector.AggregateCollapseCollectorFactory">
      <function name="sum" 
class="org.apache.solr.search.fieldcollapse.collector.aggregate.SumFunction"/>
      <function name="avg" 
class="org.apache.solr.search.fieldcollapse.collector.aggregate.AverageFunction"/>
      <function name="min" 
class="org.apache.solr.search.fieldcollapse.collector.aggregate.MinFunction"/>
      <function name="max" 
class="org.apache.solr.search.fieldcollapse.collector.aggregate.MaxFunction"/>
    </collapseCollectorFactory>

        <fieldCollapseCache
      class="solr.FastLRUCache"
      size="512"
      initialSize="512"
      autowarmCount="128"/>
    
  </searchComponent>
{code}
# Changed DistributedFieldCollapsingIntegrationTest to use 
BaseDistributedSearchTestCase. This fails right now. I believe there is a bug 
with the distributed implementation. The distributed version returns one extra 
group when compared to the non-distributed version. I've put an @Ignore 
annotation on that test.

We can consider creating the functions through a factory so that they can 
accept initialization parameters. The schema-fieldcollapse.xml and 
solrconfig-fieldcollapse.xml are no longer necessary and can be removed.

Next steps:
# Let us open issues for all the modifications needed in Solr to support this 
feature. That will help us break down this patch into more manageable (and 
easily reviewable) pieces. I guess we need one for providing custom Collectors 
for SolrIndexSearcher methods. Any others?
# The response format is not very clear in the wiki. We should add more 
examples and explain the format.

> Field collapsing
> ----------------
>
>                 Key: SOLR-236
>                 URL: https://issues.apache.org/jira/browse/SOLR-236
>             Project: Solr
>          Issue Type: New Feature
>          Components: search
>    Affects Versions: 1.3
>            Reporter: Emmanuel Keller
>            Assignee: Shalin Shekhar Mangar
>             Fix For: 1.5
>
>         Attachments: collapsing-patch-to-1.3.0-dieter.patch, 
> collapsing-patch-to-1.3.0-ivan.patch, collapsing-patch-to-1.3.0-ivan_2.patch, 
> collapsing-patch-to-1.3.0-ivan_3.patch, field-collapse-3.patch, 
> field-collapse-4-with-solrj.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-solr-236-2.patch, field-collapse-solr-236.patch, 
> field-collapsing-extended-592129.patch, field_collapsing_1.1.0.patch, 
> field_collapsing_1.3.patch, field_collapsing_dsteigerwald.diff, 
> field_collapsing_dsteigerwald.diff, field_collapsing_dsteigerwald.diff, 
> quasidistributed.additional.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236-FieldCollapsing.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, solr-236.patch, 
> SOLR-236_collapsing.patch, SOLR-236_collapsing.patch
>
>
> This patch include a new feature called "Field collapsing".
> "Used in order to collapse a group of results with similar value for a given 
> field to a single entry in the result set. Site collapsing is a special case 
> of this, where all results for a given web site is collapsed into one or two 
> entries in the result set, typically with an associated "more documents from 
> this site" link. See also Duplicate detection."
> http://www.fastsearch.com/glossary.aspx?m=48&amid=299
> The implementation add 3 new query parameters (SolrParams):
> "collapse.field" to choose the field used to group results
> "collapse.type" normal (default value) or adjacent
> "collapse.max" to select how many continuous results are allowed before 
> collapsing
> TODO (in progress):
> - More documentation (on source code)
> - Test cases
> Two patches:
> - "field_collapsing.patch" for current development version
> - "field_collapsing_1.1.0.patch" for Solr-1.1.0
> P.S.: Feedback and misspelling correction are welcome ;-)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to