> Thanks a lot for your answer !
>
> >If you could be sure that each and every .zip file contained only files
of a
> >single, known MIME-type then this should be straightforward using the
> >'unzip' utility as an external converter, with the -c or -p option to
send
> >the contents to STDOUT.
>
> Well nope i can't be sure : the purpose of htdig on my sites is to index
> documents submitted by people.
> And.. as for now they only submit ZIP files containing Word or Excel
> files... (but i set up things to handle at least Word bytes)
>
> In fact the goal would be to cheat htdig thinking zip files are just dirs
> and then have it index the content of those so called dirs and their
> subtrees...
>
> >In the general case, I think it would be wiser to settle for zipinfo as
an
> >external converter.  This would allow you to index the names of the files
in
> >the .zip file, plus any comments included in the zip.   I considered this
> >myself,
>
> With unzip or zipinfo what would the config line be like ?

You would probably have to write a script to invoke zipinfo or unzip with
suitable options.
If you used zipinfo then your config. file would need something like:

external_parsers:    application/zip->text/plain
/full/path/name/to/zipinfoscript

Where zipinfoscript could be something like (I have NOT tried this):

#!/bin/sh
exec /usr/bin/zipinfo -v $1

Hope that helps.

>
> >but decided not to index .zip files atall: my argument was that searchers
> >could find the page containing a link to a .zip file, which was probably
> >more use to them.
>
> I understand your point but i precisely use htdig only for files that are
> just uploaded in a dir without beeing linked by any page...
> (i use it while finishing writing the pages enabling people querying the
> database handling the files, queries based on keywords, description and
> everything... as i am in a hurry i have set htdig up first ;-)
>
> Sorry if i'm expressing myself in a somewhat lousy way but i'm french and
> only practising english when watching (action) movies ;-)
>
> Thanks again,
> Pierre
>

--
David Adams
Computing Services
Southampton University


_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to