[ 
https://issues.apache.org/jira/browse/SOLR-5302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13969478#comment-13969478
 ] 

Grant Ingersoll commented on SOLR-5302:
---------------------------------------

I don't agree.  Distributed is and should be the default mode we do everything 
in going forward and if we don't account for it up front, then we end up making 
all kinds of compromises on it and/or it takes years to get done (just look at 
MLT).  I can almost guarantee you the first question on the list once this is 
released is "how come it doesn't work in distributed".  This is not a case of 
the "perfect being the enemy of the good enough", but a case of missing the 
fact that the usage of distributed is the world we live in and so this patch 
only serves those going backwards and not those going forward.

It would be one thing if this issue had a plan for what can be distributed and 
what can't and an approach outlined such that it _could_ be implemented sooner 
rather than later, but that doesn't appear to be the case, AFAICT.  For 
instance, some of the stats that can't be easily distributed do have 
approximations that can be.

bq. We'd essentially need a generic distributed map-reduce implementation that 
could run inside a query. +1 for that.

See https://issues.apache.org/jira/browse/SOLR-5069.


> Analytics Component
> -------------------
>
>                 Key: SOLR-5302
>                 URL: https://issues.apache.org/jira/browse/SOLR-5302
>             Project: Solr
>          Issue Type: New Feature
>            Reporter: Steven Bower
>            Assignee: Erick Erickson
>             Fix For: 5.0
>
>         Attachments: SOLR-5302.patch, SOLR-5302.patch, SOLR-5302.patch, 
> SOLR-5302.patch, Search Analytics Component.pdf, Statistical Expressions.pdf, 
> solr_analytics-2013.10.04-2.patch
>
>
> This ticket is to track a "replacement" for the StatsComponent. The 
> AnalyticsComponent supports the following features:
> * All functionality of StatsComponent (SOLR-4499)
> * Field Faceting (SOLR-3435)
> ** Support for limit
> ** Sorting (bucket name or any stat in the bucket
> ** Support for offset
> * Range Faceting
> ** Supports all options of standard range faceting
> * Query Faceting (SOLR-2925)
> * Ability to use overall/field facet statistics as input to range/query 
> faceting (ie calc min/max date and then facet over that range
> * Support for more complex aggregate/mapping operations (SOLR-1622)
> ** Aggregations: min, max, sum, sum-of-square, count, missing, stddev, mean, 
> median, percentiles
> ** Operations: negation, abs, add, multiply, divide, power, log, date math, 
> string reversal, string concat
> ** Easily pluggable framework to add additional operations
> * New / cleaner output format
> Outstanding Issues:
> * Multi-value field support for stats (supported for faceting)
> * Multi-shard support (may not be possible for some operations, eg median)



--
This message was sent by Atlassian JIRA
(v6.2#6252)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to