Check out jakarta POI (http://jakarta.apache.org/poi ) particularly the HPSF
API. It allows you to extract metadata like Title, Author, etc. from OLE
documents.

-Ryan

----- Original Message ----- 
From: "Zhou, Oliver" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Friday, December 12, 2003 5:26 PM
Subject: textmining: document title


> Ryan,
>
> I'm using textmining and lucene to index word documents but don't know how
> to get word document title.  Your advice on this matter is appreciated.
>
> Thanks,
> Oliver Zhou
>
>


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to