Hi,

In the current code, the images from Word documents are referenced by
"embedded:xxx" links in the generated HTML. This causes the browsers
display "x" icon instead of the image.

The proposed patch encodes the images using Data URI, if there is
-Dtika.parsers.urlimages system property.

http://en.wikipedia.org/wiki/Data_URI_scheme

So the default behavior is the same, but users of the library can
optionally generate self-contained HTML with correct images.

Thank you,

Andrew.

Reply via email to