As a spinoff, I was wondering if anyone has been happy with indexing and searching Word docs. What about reading the contents? Any problems?
-----Original Message----- From: Ryan Ackley [mailto:[EMAIL PROTECTED] Sent: Friday, December 12, 2003 5:59 PM To: Zhou, Oliver; Lucene Users List Subject: Re: textmining: document title Check out jakarta POI (http://jakarta.apache.org/poi ) particularly the HPSF API. It allows you to extract metadata like Title, Author, etc. from OLE documents. -Ryan ----- Original Message ----- From: "Zhou, Oliver" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Friday, December 12, 2003 5:26 PM Subject: textmining: document title > Ryan, > > I'm using textmining and lucene to index word documents but don't know how > to get word document title. Your advice on this matter is appreciated. > > Thanks, > Oliver Zhou > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
