[
https://issues.apache.org/jira/browse/TIKA-2762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16669704#comment-16669704
]
Hudson commented on TIKA-2762:
------------------------------
UNSTABLE: Integrated in Jenkins build tika-branch-1x #121 (See
[https://builds.apache.org/job/tika-branch-1x/121/])
TIKA-2762 Capture short fields (<150 chars) in EnviParserHeader Metadata
(tallison:
[https://github.com/apache/tika/commit/6b2cdc9049691ca37f5e37d7721a32a976f9f456])
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/envi/EnviHeaderParser.java
TIKA-2762 Capture short fields (<150 chars) in EnviParserHeader Metadata
(tallison:
[https://github.com/apache/tika/commit/33d960c24dd9f7e97cd11f18295cfd347cfdb1d4])
* (edit) tika-core/src/main/java/org/apache/tika/config/TikaConfig.java
* (edit) tika-core/src/test/java/org/apache/tika/detect/MagicDetectorTest.java
* (edit) tika-core/src/test/java/org/apache/tika/detect/NameDetectorTest.java
* (edit)
tika-parsers/src/main/java/org/apache/tika/parser/envi/EnviHeaderParser.java
* (add)
tika-core/src/test/resources/test-documents/ang20150420t182050_corr_v1e_img.hdr
> Capture short fields (<150 chars) in EnviParserHeader Metadata
> --------------------------------------------------------------
>
> Key: TIKA-2762
> URL: https://issues.apache.org/jira/browse/TIKA-2762
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.19.1
> Reporter: Lewis John McGibbney
> Assignee: Lewis John McGibbney
> Priority: Major
> Fix For: 1.20
>
>
> I have always wanted to capture more metadata for the EnviHeader files. Right
> now everything is shoved into the records content and I think we could
> improve it.
> I've implemented a rudimentary parser improvement with essentially captures
> any reasonably sized lines items (<150 chars) which can then be populated up
> to Metadata level making faceted search over ENVI .hdr documents a much
> easier task.
> PR coming up.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)