This is an automated email from the ASF dual-hosted git repository.
tallison pushed a change to branch
TIKA-4710-rtf-attachments-in-html-decapsulation
in repository https://gitbox.apache.org/repos/asf/tika.git
from 57e18b6a8d TIKA-4710 -- extract RTF attachments during html
decapsulation in msgs
add 23b177958d TIKA-4710 -- fix ups
No new revisions were added by this update.
Summary of changes:
.../tika/parser/microsoft/OfficeParserConfig.java | 20 +++
.../tika/parser/microsoft/OutlookExtractor.java | 3 +-
.../microsoft/rtf/jflex/RTFEmbeddedHandler.java | 169 +++++----------------
.../microsoft/rtf/jflex/RTFHtmlDecapsulator.java | 105 ++++---------
.../rtf/jflex/RTFObjDataStreamParser.java | 100 +++++-------
.../microsoft/rtf/jflex/RTFPictStreamParser.java | 41 +++--
.../tika/parser/microsoft/rtf/jflex/RTFState.java | 8 +-
.../tika/parser/microsoft/rtf/jflex/RTFToken.java | 30 +++-
.../parser/microsoft/rtf/jflex/RTFTokenizer.jflex | 9 +-
.../rtf/jflex/RTFHtmlDecapsulatorTest.java | 47 +++---
.../parser/microsoft/rtf/jflex/RTFStateTest.java | 14 +-
.../microsoft/rtf/jflex/RTFTokenizerTest.java | 24 +--
12 files changed, 236 insertions(+), 334 deletions(-)