Missing required alt attribute on img tag
-----------------------------------------
Key: TIKA-650
URL: https://issues.apache.org/jira/browse/TIKA-650
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 0.9
Reporter: Raimund Merkert
I've written a content handler that prints out the xhtml tags for conversion
from a word document with embedded images. For images, it does not generate the
"alt" attribute for img tags, which causes validation to fail. alt is a
required attribute in XHTML.
Here's a partial output from [http://validator.w3.org/check]:
{quote}
Error Line 3, Column 1026: required attribute "alt" not specified
...meta><title> </title></head><body><p><img
src="embedded:image63.jpg"></img></p>
✉
The attribute given above is required for an element that you've used, but you
have omitted it. For instance, in most HTML and XHTML document types the "type"
attribute is required on the "script" element and the "alt" attribute is
required for the "img" element.
Typical values for type are type="text/css" for <style> and
type="text/javascript" for <script>.
{quote}
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira