Luis Filipe Nassif created TIKA-2390:
----------------------------------------
Summary: Extract images embedded in Html
Key: TIKA-2390
URL: https://issues.apache.org/jira/browse/TIKA-2390
Project: Tika
Issue Type: Improvement
Components: parser
Affects Versions: 1.15
Reporter: Luis Filipe Nassif
Priority: Minor
We should handle images embedded in html like we do for other formats, as
attachments. There are encodings other than base64 used out there to embed
images in html?
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)