On Mon, 9 Jun 2014, Bing Ran wrote:
Now I'm looking at some docx files and wondering if there's something similar to the hwpf WordToHtmlConverter/WordToTextConverter which has served me very well for extracting text and images for doc files.

For plain text, try XWPFWordExtractor. For HTML, try Apache Tika (which wraps Apache POI)

Nick

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to