[
https://issues.apache.org/jira/browse/TIKA-1724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Lewis John McGibbney updated TIKA-1724:
---------------------------------------
Attachment: TIKA-1724.patch
Patch for trunk folks.
I have a major problem with this patch... when I run it, I have debugged .obo
file formats running and being detected perfectly... however during parse phase
of tika logic I cannot invoke this parser... instead the Text parser is invoked
and we merely parse out textual content.
YES... .obo (ontology manifestation) format is text based... however there are
distinct topic areas which we can make sense of.
If ANYONE can help to debug WHY the OBO parser is not being called then I would
really appreciated it.
Thanks
> Create parser for .obo file format.
> -----------------------------------
>
> Key: TIKA-1724
> URL: https://issues.apache.org/jira/browse/TIKA-1724
> Project: Tika
> Issue Type: New Feature
> Components: parser
> Reporter: Lewis John McGibbney
> Assignee: Lewis John McGibbney
> Fix For: 1.12
>
> Attachments: TIKA-1724.patch, TIKA-1724.patch
>
>
> This parser implementation caters for files of the [OBO Flat File Format
> Guide, version 1.4|http://purl.obolibrary.org/obo/oboformat/spec.html]
> MimeType.
> The OBO format is the text file format used by OBO-Edit, the open source,
> platform-independent application for viewing and editing ontologies. This
> file format is used heavily within the clinical and biomedical fields as a
> particular flat file serialization for ontologies. .obo files are 'typically'
> accompanied by corresponding .owl serializations as this is also another file
> format used pervasively within the clinical and biomedical fields.
> I would sincerely appreciate code review. Thanks.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)