Oh, you mean MS Word files! Great.

To index MS Word files, you have to find and test MS Word to HTML (or MS Word to plain text) converter; there are a number of options, I remember 'word2x' and 'antiword' programs. Quick search on freshmeat.net should help you. Please test the converter you choose to make sure it understand _your_ MS Word files.

Next, see what Content-Type your web server returns to you; it may be 'application/x-ms-word' or something like that. If web server returns
'application/octet-stream', 'text/plain' or something like that (generic),
you have to fix it (in Apache it is done with a simple

AddType application/msword doc

line in httpd.conf

Next, you add an appropriate 'Converter' directive to aspseek.conf file. Format is described in documentation (see man aspseek.conf (5) or
http://www.aspseek.org/man/aspseek.conf.5.php#lbAM)

Regarding your second question about titles for plain-text documents,
you have two options here. First, you can insert the titles manually (or with a simple script) directly into urlwordsXX MySQL tables with a simple
INSERT statements.

Second, you can again use "Converters" feature to make ASPseek's index convert plain text documents into HTML. In this case the term 'conversion' would simply means inserting

<HTML><HEAD><TITLE>your_title</TITLE></HEAD>
<BODY><XMP>

before the contents of plain text, and

</XMP></BODY></HTML>

right after the end. You also have to change 'your_title' above to something sensible. Probably some heuristic can be used in the process, but it all depends on what texts do you have.

PS If you have more questions, please subscribe to and post your questions to aseek-users@ mailing list. Sub info is given at http://www.aspseek.org/ml-users.php



NIRANJAN SINGH wrote:
I have a bunch of word files that I woulapplication/msword docd like to index with ASPseek. Is that
possible?

Second, to improve indexing, can I introduce titles into the text using ASPseek?
Thanks,
Niranjan


Kir Kolyshkin <[EMAIL PROTECTED]> 01/08/03 05:08AM >>>

Sorry, I do not quite understand your question. Could you emphasize on it?

NIRANJAN SINGH wrote:

Hello Mr. Kolyshkin,

Is it possible to inject identifiers (words, numbers), using ASPseek, into text content that needs to be indexed and searched on using ASPseek?

Your advise will be invaluable,


--
== kir_at_asplinux.ru == 7551596_at_ICQ == 6722750_at_sms.beemail.ru ==

Dream like you'll live forever...Love like you've never been hurt...
Work like you don't need the money...and Dance like nobody is watching!
       -- Satchel Paige

Reply via email to