Prasanth Iyer created TIKA-1478:
-----------------------------------
Summary: Build a parser to extract data from .dif format
Key: TIKA-1478
URL: https://issues.apache.org/jira/browse/TIKA-1478
Project: Tika
Issue Type: New Feature
Components: metadata, mime, parser
Affects Versions: 1.6
Reporter: Prasanth Iyer
An initial crawl of the Acadis website (https://www.aoncadis.org/home.htm)
revealed that a number of the files on this website are of the .dif type.
Currently, Tika categorizes these files as text/plain since it does not have a
parser for this type of file. The need is to provide metadata support and to
build a parser for this kind of file.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)