[
https://issues.apache.org/jira/browse/TIKA-1380?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14086189#comment-14086189
]
Tim Allison commented on TIKA-1380:
-----------------------------------
Something along these lines:
{noformat}
if (type == POIFSDocumentType.OLE10_NATIVE) {
try {
// Try to un-wrap the OLE10Native record:
Ole10Native ole =
Ole10Native.createFromEmbeddedOleObject((DirectoryNode)dir);
String label = ole.getLabel();
if (label == null){
label = "unknown_"+unknownFileNum++;
}
metadata.set(Metadata.RESOURCE_NAME_KEY, dir.getName() +
'/' + label);
{noformat}
yields a RESOURCE_NAME_KEY of:
MBD002B0FA6/unknown_0
Or, if label is null, do we just want to use the directory name? Perhaps with
an unknownFileNum appended to prevent clashes?
> Upgrade to Apache POI 3.11 beta 1
> ---------------------------------
>
> Key: TIKA-1380
> URL: https://issues.apache.org/jira/browse/TIKA-1380
> Project: Tika
> Issue Type: Improvement
> Components: parser
> Affects Versions: 1.6
> Reporter: Nick Burch
> Fix For: 1.6, 1.7
>
> Attachments: TIKA-1380.patch, TIKA-1380b.patch, TIKA-1380c.patch,
> tika-commentstable-missing.diff
>
>
> All being well, in a week there'll be a new release of Apache POI available,
> 3.11 beta 1
> This issue is to track the upgrade, any required changes, and fixing any
> TODOs that this upgrade permits
--
This message was sent by Atlassian JIRA
(v6.2#6252)