[
https://issues.apache.org/jira/browse/TIKA-4693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arpit updated TIKA-4693:
------------------------
Summary: TikaFileMetadata shows wrong data for "dc:subject"metadata
properties for Doc and docx (was: TikaFileMetadata shows wrong data in xmp
for "dc:subject" tag for Doc and docx)
> TikaFileMetadata shows wrong data for "dc:subject"metadata properties for
> Doc and docx
> ----------------------------------------------------------------------------------------
>
> Key: TIKA-4693
> URL: https://issues.apache.org/jira/browse/TIKA-4693
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 3.2.3
> Reporter: Arpit
> Priority: Major
>
> Currently we are using <tika-core.version>3.2.3</tika-core. Version> , where
> we are seeing for subject attribute both subject and keywords are being
> returned instead of returning on subject for doc and docx files
> This is the metadata attribute (dc:subject) which we are using for fetching
> subject and it return both subject + keyword
> Able to see one more issue related to same which is in resolved state for PDF
> File https://issues.apache.org/jira/browse/TIKA-4444
--
This message was sent by Atlassian Jira
(v8.20.10#820010)