Attribute on html tag not represented in XHTML -----------------------------------------------
Key: TIKA-379 URL: https://issues.apache.org/jira/browse/TIKA-379 Project: Tika Issue Type: Bug Components: parser Reporter: Julien Nioche The following HTML document : <html lang="fi"><head>document 1 title</head><body>jotain suomeksi</body></html> is rendered as the following xhtml by Tika : <?xml version="1.0" encoding="UTF-8"?><html xmlns="http://www.w3.org/1999/xhtml"><head><title/></head><body>document 1 titlejotain suomeksi</body></html> with the lang attribute getting lost. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.