[ https://issues.apache.org/jira/browse/LUCENE-2393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael McCandless resolved LUCENE-2393. ---------------------------------------- Fix Version/s: 4.0 Resolution: Fixed Thanks Tom! > Utility to output total term frequency and df from a lucene index > ----------------------------------------------------------------- > > Key: LUCENE-2393 > URL: https://issues.apache.org/jira/browse/LUCENE-2393 > Project: Lucene - Java > Issue Type: New Feature > Components: contrib/* > Reporter: Tom Burton-West > Priority: Trivial > Fix For: 4.0 > > Attachments: LUCENE-2393.patch, LUCENE-2393.patch, LUCENE-2393.patch, > LUCENE-2393.patch, LUCENE-2393.patch, LUCENE-2393.patch, LUCENE-2393.patch > > > This is a pair of command line utilities that provide information on the > total number of occurrences of a term in a Lucene index. The first takes a > field name, term, and index directory and outputs the document frequency for > the term and the total number of occurrences of the term in the index (i.e. > the sum of the tf of the term for each document). The second reads the > index to determine the top N most frequent terms (by document frequency) and > then outputs a list of those terms along with the document frequency and the > total number of occurrences of the term. Both utilities are useful for > estimating the size of the term's entry in the *prx files and consequent Disk > I/O demands. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org