e the files and
> than delete the directory. But this would greatly slow indexing and use up
> disk space.
>
> Luke
>
> - Original Message -
> From: "Ernesto De Santis" <[EMAIL PROTECTED]>
> To: "Lucene Users List"
> Sent: Tuesday, March
01, 2005 12:39 PM
> To: Lucene Users List
> Subject: Re: Zip Files
>
> Thanks Ernesto.
>
> The issue I'm working with now (this is more lack of experience than
> anything) is getting an input I can index. All my indexing classes (doc,
> pdf, xml, ppt) take a File obj
Santis" <[EMAIL PROTECTED]>
To: "Lucene Users List"
Sent: Tuesday, March 01, 2005 10:48 AM
Subject: Re: Zip Files
> Hello
>
> first, you need a parser for each file type: pdf, txt, word, etc.
> and use a java api to iterate zip content, see:
>
> http://java.sun.com
);
ZipEntry zipEntry;
while(zipEntry = zis.getNextEntry() != null){
//use zipEntry to get name, etc.
//get properly parser for current entry
//use parser with zis (ZipInputStream)
}
good luck
Ernesto
Luke Shannon escribió:
Hello;
Anyone have an ideas on how to index the contents within zip files
Hello;
Anyone have an ideas on how to index the contents within zip files?
Thanks,
Luke
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]