[
https://issues.apache.org/jira/browse/TIKA-3695?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17510719#comment-17510719
]
Hudson commented on TIKA-3695:
------------------------------
UNSTABLE: Integrated in Jenkins build Tika » tika-main-jdk8 #497 (See
[https://ci-builds.apache.org/job/Tika/job/tika-main-jdk8/497/])
TIKA-3695 -- further refactorings to add limits on key sizes and per field
limits. (tallison:
[https://github.com/apache/tika/commit/b2e18d38854d12ff16278e63b22085ce7d050416])
* (add)
tika-core/src/main/java/org/apache/tika/metadata/writefilter/MetadataWriteFilter.java
* (delete)
tika-core/src/test/java/org/apache/tika/metadata/MetadataWriteFilterTest.java
* (delete)
tika-core/src/main/java/org/apache/tika/metadata/MetadataWriteFilter.java
* (add)
tika-core/src/test/java/org/apache/tika/metadata/writefilter/MetadataWriteFilterTest.java
* (edit) tika-core/src/test/resources/org/apache/tika/config/TIKA-3695.xml
* (add)
tika-core/src/main/java/org/apache/tika/metadata/writefilter/StandardWriteFilter.java
* (add)
tika-core/src/main/java/org/apache/tika/metadata/writefilter/StandardWriteFilterFactory.java
* (edit)
tika-core/src/main/java/org/apache/tika/parser/AutoDetectParserConfig.java
* (add)
tika-core/src/main/java/org/apache/tika/metadata/writefilter/MetadataWriteFilterFactory.java
* (delete)
tika-core/src/main/java/org/apache/tika/metadata/StandardWriteFilterFactory.java
* (delete)
tika-core/src/main/java/org/apache/tika/metadata/MetadataWriteFilterFactory.java
* (edit)
tika-core/src/test/resources/org/apache/tika/config/TIKA-3695-fields.xml
* (edit) tika-core/src/main/java/org/apache/tika/metadata/Metadata.java
* (delete)
tika-core/src/main/java/org/apache/tika/metadata/StandardWriteFilter.java
* (edit)
tika-core/src/main/java/org/apache/tika/metadata/TikaCoreProperties.java
> LimitingMetadataFilter
> ----------------------
>
> Key: TIKA-3695
> URL: https://issues.apache.org/jira/browse/TIKA-3695
> Project: Tika
> Issue Type: New Feature
> Components: metadata
> Affects Versions: 1.28.1, 2.3.0
> Reporter: Julien Massiera
> Priority: Major
> Fix For: 2.4.0
>
> Attachments: huge-title.docx, tika-config.xml
>
>
> Some files may contain abnormally big metadata (several MB, be it for the
> metadata values, the metadata names, but also for the total amount of
> metadata) that can be problematic concerning the memory consumption.
> It would be great to develop a new LimitingMetadataFilter so that we can
> filter out the metadata according to different bytes limits (on metadata
> names, metadata values and global amount of metadata)
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)