Sathia created TIKA-3254:
----------------------------
Summary: Html font styles missing - doc to html
Key: TIKA-3254
URL: https://issues.apache.org/jira/browse/TIKA-3254
Project: Tika
Issue Type: Bug
Reporter: Sathia
Attachments: Sample.doc
Hi Team,
I tried using convert doc to xhtml using tika. the conversation is successful
but styles missing.
Attached *sample.doc* which I used. the below code I have used for conversation.
{{public}} {{String parseToHTML() }}{{throws}} {{IOException, SAXException,
TikaException {}}
{{ }}{{ContentHandler handler = }}{{new}} {{ToXMLContentHandler();}}
{{ }}{{AutoDetectParser parser = }}{{new}} {{AutoDetectParser();}}
{{ }}{{Metadata metadata = }}{{new}} {{Metadata();}}
{{ }}{{try}} {{(InputStream stream =
ContentHandlerExample.}}{{class}}{{.getResourceAsStream(}}{{"test.doc"}}{{)) {}}
{{ }}{{parser.parse(stream, handler, metadata);}}
{{ }}{{return}} {{handler.toString();}}
{{ }}{{}}}
{{}}}
Regards,
Sathia
--
This message was sent by Atlassian Jira
(v8.3.4#803005)