Re: [htdig] [EXT PARSER] .zip files

Pierre Cadeot Tue, 10 Jul 2001 14:39:56 -0700
Thanks a lot for your answer !

>If you could be sure that each and every .zip file contained only files of a
>single, known MIME-type then this should be straightforward using the
>'unzip' utility as an external converter, with the -c or -p option to send
>the contents to STDOUT.

Well nope i can't be sure : the purpose of htdig on my sites is to index 
documents submitted by people.
And.. as for now they only submit ZIP files containing Word or Excel 
files... (but i set up things to handle at least Word bytes)

In fact the goal would be to cheat htdig thinking zip files are just dirs 
and then have it index the content of those so called dirs and their 
subtrees...

>In the general case, I think it would be wiser to settle for zipinfo as an
>external converter.  This would allow you to index the names of the files in
>the .zip file, plus any comments included in the zip.   I considered this
>myself,

With unzip or zipinfo what would the config line be like ?

>but decided not to index .zip files atall: my argument was that searchers
>could find the page containing a link to a .zip file, which was probably
>more use to them.

I understand your point but i precisely use htdig only for files that are 
just uploaded in a dir without beeing linked by any page...
(i use it while finishing writing the pages enabling people querying the 
database handling the files, queries based on keywords, description and 
everything... as i am in a hurry i have set htdig up first ;-)

Sorry if i'm expressing myself in a somewhat lousy way but i'm french and 
only practising english when watching (action) movies ;-)

Thanks again,
Pierre

_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a 
subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html
Re: [htdig] [EXT PARSER] .zip files

Reply via email to