Re: Hierarchical stats for Solr

2012-12-18 Thread Ryan McKinley
Hi Steve-

The work you discuss sounds interesting, can you make a JIRA issue for this?

See:
http://wiki.apache.org/solr/HowToContribute#JIRA_tips_.28our_issue.2BAC8-bug_tracker.29

thanks
ryan


On Tue, Dec 18, 2012 at 3:09 PM, Steve McKay steve.mc...@gcecloud.comwrote:

 e.g. facet by vendor and then facet each vendor by year. I've also added
 stats.sort, stats.limit, and stats.offset field params. stats.sort syntax
 is sum|min|max|stdDev|average|sumOfSquares|count|missing|value:asc|desc
 and limit and offset work as in SQL. Faceting will generally use more RAM
 and be faster than the 4.0 baseline. I've changed more than some might
 consider to be strictly necessary; this is because a large part of my
 effort has been to make faceting performant under adverse conditions, with
 large result sets and faceting on fields with large (1m+) cardinalities. If
 there's interest I can post some rough response time numbers for faceting
 on fields with various cardinalities.



Re: Hierarchical stats for Solr

2012-12-18 Thread Steve McKay
Sure can, thanks!

On Dec 18, 2012, at 8:12 PM, Ryan McKinley 
ryan...@gmail.commailto:ryan...@gmail.com wrote:

Hi Steve-

The work you discuss sounds interesting, can you make a JIRA issue for this?

See:
http://wiki.apache.org/solr/HowToContribute#JIRA_tips_.28our_issue.2BAC8-bug_tracker.29

thanks
ryan


On Tue, Dec 18, 2012 at 3:09 PM, Steve McKay 
steve.mc...@gcecloud.commailto:steve.mc...@gcecloud.com wrote:
e.g. facet by vendor and then facet each vendor by year. I've also added 
stats.sort, stats.limit, and stats.offset field params. stats.sort syntax is 
sum|min|max|stdDev|average|sumOfSquares|count|missing|value:asc|desc and 
limit and offset work as in SQL. Faceting will generally use more RAM and be 
faster than the 4.0 baseline. I've changed more than some might consider to be 
strictly necessary; this is because a large part of my effort has been to make 
faceting performant under adverse conditions, with large result sets and 
faceting on fields with large (1m+) cardinalities. If there's interest I can 
post some rough response time numbers for faceting on fields with various 
cardinalities.