Re: [users] Automatically generated indexes

S Perry Fri, 03 Apr 2009 16:16:11 -0700

You can create a CONCORDANCE file.  Click on this link, then click on 
CONCORDANCE.  Their explanation is more succinct than I could manage quickly.  
http://office.microsoft.com/en-us/word/HP051892831033.aspx

Suzanne L. Perry
P O Box 7493
Louisville, KY 40257-0493
Phone: 502-424-5435
Phone: 502-317-1307
E-mail: [email protected]

--- On Fri, 4/3/09, David B Teague <[email protected]> wrote:

From: David B Teague <[email protected]>
Subject: Re: [users] Automatically generated indexes
To: [email protected]
Date: Friday, April 3, 2009, 5:42 PM

Brian Barker wrote:
> At 12:23 03/04/2009 +0800, Bashar Maree wrote:
>> In OOo I can easily generate a table of contents that would list the 
>> headings in the document plus their page numbers.  Is it possible to auto 
>> generate such indexes for diagrams and tables that are embedded in a 
>> document.
> 
> Yes.  First you need to add captions to the items concerned.  Go to Insert | 
> Caption... or right-click | Caption... to do this.  (Perhaps you have already 
> done this.)  Then create your index in a similar way to how you would a table 
> of contents.
> 
> o  Go to Insert | Indexes and Tables > | Indexes and Tables... | Index/Table 
> | Type and title.
> o  Against Type, select what you need from the drop-down menu.  "Illustration 
> Index" and "Index of Tables" are both available.
> 
> I trust this helps.
> 
> Brian Barker
> 
Well Dang! I misread the intent of the question completely.

I want to generate an index of keywords (and perhaps phrases) in a document. 
That's involved if done by hand, and if automatic, one has to have heuristics 
to decide what to include. And an automatic index would have to have some words 
the author would prefer were in the index and conversely.

Does anyone know of an automatic index generator? One way is:

Create a file of pairs of each of the words in the document with the page 
number where it occurs.
Remove unwanted words, (a, an, the, of, for, from, then, than, conjugations of 
"to be" and "to have" etc.
Remove other words not wanted in the index. Sort the pairs alphabetically on 
the words, ignoring case.
Might want to do the "Remove unwanted words" here, where they will all be 
together.
Remove duplicates, making a list of the page numbers of the occurrences on the 
remaining instance of the word. 
This is a naive algorithm, but I have thought about the problem a bit.

My solution requires some programming, which I DO NOT want to do. Does anyone 
know of a better solution out there? I hope so for otherwise I won't have an 
index generator otherwise.

David Teague

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [users] Automatically generated indexes

Reply via email to