You can create a CONCORDANCE file. Click on this link, then click on CONCORDANCE. Their explanation is more succinct than I could manage quickly. http://office.microsoft.com/en-us/word/HP051892831033.aspx
Suzanne L. Perry P O Box 7493 Louisville, KY 40257-0493 Phone: 502-424-5435 Phone: 502-317-1307 E-mail: [email protected] --- On Fri, 4/3/09, David B Teague <[email protected]> wrote: From: David B Teague <[email protected]> Subject: Re: [users] Automatically generated indexes To: [email protected] Date: Friday, April 3, 2009, 5:42 PM Brian Barker wrote: > At 12:23 03/04/2009 +0800, Bashar Maree wrote: >> In OOo I can easily generate a table of contents that would list the >> headings in the document plus their page numbers. Is it possible to auto >> generate such indexes for diagrams and tables that are embedded in a >> document. > > Yes. First you need to add captions to the items concerned. Go to Insert | > Caption... or right-click | Caption... to do this. (Perhaps you have already > done this.) Then create your index in a similar way to how you would a table > of contents. > > o Go to Insert | Indexes and Tables > | Indexes and Tables... | Index/Table > | Type and title. > o Against Type, select what you need from the drop-down menu. "Illustration > Index" and "Index of Tables" are both available. > > I trust this helps. > > Brian Barker > Well Dang! I misread the intent of the question completely. I want to generate an index of keywords (and perhaps phrases) in a document. That's involved if done by hand, and if automatic, one has to have heuristics to decide what to include. And an automatic index would have to have some words the author would prefer were in the index and conversely. Does anyone know of an automatic index generator? One way is: Create a file of pairs of each of the words in the document with the page number where it occurs. Remove unwanted words, (a, an, the, of, for, from, then, than, conjugations of "to be" and "to have" etc. Remove other words not wanted in the index. Sort the pairs alphabetically on the words, ignoring case. Might want to do the "Remove unwanted words" here, where they will all be together. Remove duplicates, making a list of the page numbers of the occurrences on the remaining instance of the word. This is a naive algorithm, but I have thought about the problem a bit. My solution requires some programming, which I DO NOT want to do. Does anyone know of a better solution out there? I hope so for otherwise I won't have an index generator otherwise. David Teague --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
