[ https://issues.apache.org/jira/browse/SOLR-1972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13540211#comment-13540211 ]
Shawn Heisey commented on SOLR-1972: ------------------------------------ Lance, I see two other problems with OnlineSummarizer. 1) There are only 100 samples. That's apparently enough for 0/25/50/75/100 quartiles, but it seems very low for 95/99. I was even worried about the 1024 samples in the metrics library, but apparently it actually works very well. 2) Solr already includes mahout-math 0.3 as a dependency of carrot2. OnlineSummarizer was introduced in 0.4, and there are some serious bugs in it as late as 0.6. Upgrading mahout-math to 0.7 causes clustering (carrot2) tests to fail because it can't find a mahout class. Even the newest version of carrot2 depends specifically on mahout-math 0.3. I have a checkout where I am re-applying the metrics patch and have made RequestHandlerBase implement Closeable, but I have zero knowledge about where the close() method would actually have to be called. I have been attempting to figure it out, but Solr is a very large and complex codebase, so I will need some help. I'm almost always idling in #lucene and #solr as 'elyograg' if someone wants to give me some live pointers. I can do the actual work, but I need some help figuring out where handlers are created and destroyed. > Need additional query stats in admin interface - median, 95th and 99th > percentile > --------------------------------------------------------------------------------- > > Key: SOLR-1972 > URL: https://issues.apache.org/jira/browse/SOLR-1972 > Project: Solr > Issue Type: Improvement > Components: web gui > Affects Versions: 1.4 > Reporter: Shawn Heisey > Assignee: Alan Woodward > Priority: Minor > Fix For: 4.2, 5.0 > > Attachments: elyograg-1972-3.2.patch, elyograg-1972-3.2.patch, > elyograg-1972-trunk.patch, elyograg-1972-trunk.patch, leak-closeable.patch, > leak.patch, revert-SOLR-1972.patch, SOLR-1972-branch3x-url_pattern.patch, > SOLR-1972-branch4x.patch, SOLR-1972-branch4x.patch, SOLR-1972_metrics.patch, > SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, > SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, SOLR-1972_metrics.patch, > SOLR-1972_metrics.patch, solr1972-metricsregistry-branch4x-failure.log, > SOLR-1972.patch, SOLR-1972.patch, SOLR-1972.patch, SOLR-1972.patch, > SOLR-1972-url_pattern.patch, stacktraces.tar.gz > > > I would like to see more detailed query statistics from the admin GUI. This > is what you can get now: > requests : 809 > errors : 0 > timeouts : 0 > totalTime : 70053 > avgTimePerRequest : 86.59209 > avgRequestsPerSecond : 0.8148785 > I'd like to see more data on the time per request - median, 95th percentile, > 99th percentile, and any other statistical function that makes sense to > include. In my environment, the first bunch of queries after startup tend to > take several seconds each. I find that the average value tends to be useless > until it has several thousand queries under its belt and the caches are > thoroughly warmed. The statistical functions I have mentioned would quickly > eliminate the influence of those initial slow queries. > The system will have to store individual data about each query. I don't know > if this is something Solr does already. It would be nice to have a > configurable count of how many of the most recent data points are kept, to > control the amount of memory the feature uses. The default value could be > something like 1024 or 4096. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org