Hi, In the current code, the images from Word documents are referenced by "embedded:xxx" links in the generated HTML. This causes the browsers display "x" icon instead of the image.
The proposed patch encodes the images using Data URI, if there is -Dtika.parsers.urlimages system property. http://en.wikipedia.org/wiki/Data_URI_scheme So the default behavior is the same, but users of the library can optionally generate self-contained HTML with correct images. Thank you, Andrew.
