Hello!

11.11.2010 17:05, Jukka Zitting пишет:
Log:
Extract interface for EmbeddedDocumentExtractor

We have POI-based utility that extracts all embedded files (attachments, pictures and etc) from different file formats. This utility takes arbitrary file and returns ZIP-archive with all attachments.

This utility duplicates functionality of embedded file processing in Tika. I'm trying to convert my tool from POI to Tika. I think this will make better both my unpacker utility and Tika.

I needed a way to replace concrete implementation of EmbeddedDocumentExtractor with my own implementation that copies attachments outside of ContentHandler, so I splitted interface and implementation of that class.

It would be good if all non-trivial commit messages contained a
reference to a relevant issue in Jira for better context of why
particular changes are being made.

Nick correctly noted earlier that we should write such conventions up
somewhere. I'll try to find time to draft something for review.

So I need to create JIRA issue before commit?

best wishes, Max

Reply via email to