[ 
https://issues.apache.org/jira/browse/TIKA-1724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney updated TIKA-1724:
---------------------------------------
    Attachment: TIKA-1724.patch

Patch for trunk. Comes with some test cases and Javadoc for explaining the 
document format, structure, etc.
There is some commented out code. The reason I've done this is that I feel this 
parser can be extended in the future. An example of this would be dealing with 
.obo imports (essentially references to other files).

> Create parser for .obo file format.
> -----------------------------------
>
>                 Key: TIKA-1724
>                 URL: https://issues.apache.org/jira/browse/TIKA-1724
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Lewis John McGibbney
>            Assignee: Lewis John McGibbney
>             Fix For: 1.11
>
>         Attachments: TIKA-1724.patch
>
>
> This parser implementation caters for files of the 
> [http://purl.obolibrary.org/obo/oboformat/spec.html|OBO Flat File Format 
> Guide, version 1.4] MimeType.
> The OBO format is the text file format used by OBO-Edit, the open source, 
> platform-independent application for viewing and editing ontologies. This 
> file format is used heavily within the clinical and biomedical fields as a 
> particular flat file serialization for ontologies. .obo files are 'typically' 
> accompanied by corresponding .owl serializations as this is also another file 
> format used pervasively within the clinical and biomedical fields.
> I would sincerely appreciate code review. Thanks.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to