[ https://issues.apache.org/jira/browse/LUCENE-2393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tom Burton-West updated LUCENE-2393: ------------------------------------ Attachment: LUCENE-2393-3xbranch.patch Since many people will want to use branch 3.x instead of trunk, I back-ported the flex version to 3x ( patched against http://svn.apache.org/repos/asf/lucene/dev/branches/branch_3x/lucene : 955141) Mike, can this be committed to branch_3x? Tom > Utility to output total term frequency and df from a lucene index > ----------------------------------------------------------------- > > Key: LUCENE-2393 > URL: https://issues.apache.org/jira/browse/LUCENE-2393 > Project: Lucene - Java > Issue Type: New Feature > Components: contrib/* > Reporter: Tom Burton-West > Priority: Trivial > Fix For: 4.0 > > Attachments: LUCENE-2393-3xbranch.patch, LUCENE-2393.patch, > LUCENE-2393.patch, LUCENE-2393.patch, LUCENE-2393.patch, LUCENE-2393.patch, > LUCENE-2393.patch, LUCENE-2393.patch > > > This is a pair of command line utilities that provide information on the > total number of occurrences of a term in a Lucene index. The first takes a > field name, term, and index directory and outputs the document frequency for > the term and the total number of occurrences of the term in the index (i.e. > the sum of the tf of the term for each document). The second reads the > index to determine the top N most frequent terms (by document frequency) and > then outputs a list of those terms along with the document frequency and the > total number of occurrences of the term. Both utilities are useful for > estimating the size of the term's entry in the *prx files and consequent Disk > I/O demands. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org