Check out jakarta POI (http://jakarta.apache.org/poi ) particularly the HPSF API. It allows you to extract metadata like Title, Author, etc. from OLE documents.
-Ryan ----- Original Message ----- From: "Zhou, Oliver" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Sent: Friday, December 12, 2003 5:26 PM Subject: textmining: document title > Ryan, > > I'm using textmining and lucene to index word documents but don't know how > to get word document title. Your advice on this matter is appreciated. > > Thanks, > Oliver Zhou > > --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
