Op Wednesday 29 May 2002 11:56, Karl �ie schreef:
> b: convert the documents to something that is accessable through java like
> xml, etc...

We're using wvWare (wvware.com) to convert word to html (or text) and index 
that and xpdf for converting PDF to text and index that. Any links on 
indexing using POI converters (or other java converters) are very welcome!

Ewout

>
> the best way is to convert as the java api's for MSOffice documents still
> are under development
>
> mvh karl �ie
>
> On Wednesday 29 May 2002 11:48, Rama Krishna wrote:
> > Hi,
> >
> > I am trying to build a search engine which search in MS Word, excel, ppt
> > and adobe pdf. I am not sure whether i can use Lucene for this or not. 
> > pl. help me out in this regard.
> >
> >
> > Regards,
> > Ramakrishna
> >
> >
> > _________________________________________________________________
> > Chat with friends online, try MSN Messenger: http://messenger.msn.com

-- 
Ewout Prangsma, Directeur
Daisy Software
Telefoon/fax: +31-77-3270305/3270306
Email: [EMAIL PROTECTED]
Website: www.daisysoftware.com
KvK Venlo nr. 12046144 




--
To unsubscribe, e-mail:   <mailto:[EMAIL PROTECTED]>
For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>

Reply via email to