[
https://issues.apache.org/jira/browse/TIKA-622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Burch updated TIKA-622:
----------------------------
Attachment: NPOIFS-3.8beta2.diff
The patch NPOIFS-3.8beta2.diff depends on the latest POI snapshot, so can't be
applied until 3.8 beta 2 is out. However, it allows all unit tests to pass when
using NPOIFS (and also includes TNEF which depends on 3.8 beta 2), so can be
applied as soon as the new POI release is out
> Switch from POIFSFileSystem to NPOIFSFileSystem, for speed and memory
> improvements
> ----------------------------------------------------------------------------------
>
> Key: TIKA-622
> URL: https://issues.apache.org/jira/browse/TIKA-622
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Reporter: Nick Burch
> Assignee: Nick Burch
> Priority: Minor
> Attachments: NPOIFS-3.8beta2.diff, NPOIFS.diff
>
>
> With POI 3.8 beta 1, there's an alternate OLE2 implementation
> NPOIFSFileSystem. From an API level, it's very similar to the existing
> POIFSFileSystem class, but internally it's lower memory and generally faster.
> While NPOIFSFileSystem is currently read only (the write support has a couple
> of gaps), for Tika's needs that isn't an issue
> We should therefore switch our uses of POIFSFileSystem to NPOIFSFileSystem to
> reduce memory and enhance speed.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira