Op Wednesday 29 May 2002 11:56, Karl �ie schreef: > b: convert the documents to something that is accessable through java like > xml, etc...
We're using wvWare (wvware.com) to convert word to html (or text) and index that and xpdf for converting PDF to text and index that. Any links on indexing using POI converters (or other java converters) are very welcome! Ewout > > the best way is to convert as the java api's for MSOffice documents still > are under development > > mvh karl �ie > > On Wednesday 29 May 2002 11:48, Rama Krishna wrote: > > Hi, > > > > I am trying to build a search engine which search in MS Word, excel, ppt > > and adobe pdf. I am not sure whether i can use Lucene for this or not. > > pl. help me out in this regard. > > > > > > Regards, > > Ramakrishna > > > > > > _________________________________________________________________ > > Chat with friends online, try MSN Messenger: http://messenger.msn.com -- Ewout Prangsma, Directeur Daisy Software Telefoon/fax: +31-77-3270305/3270306 Email: [EMAIL PROTECTED] Website: www.daisysoftware.com KvK Venlo nr. 12046144 -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>
