Tim Allison created TIKA-3698:
---------------------------------
Summary: Duplicate subject/description for Outlook msgs
Key: TIKA-3698
URL: https://issues.apache.org/jira/browse/TIKA-3698
Project: Tika
Issue Type: Task
Reporter: Tim Allison
On TIKA-3629, despite our best efforts to simplify and streamline metadata
keys, we backed off and continued to include/added back keywords _and_ subject.
Another area where we should probably include both includes msg files.
POI's msg.getSubject() is going to "dc:title", and msg.getConversationTopic()
is going to "dc:description". Along the lines of what we did on TIKA-3629, I
propose adding msg.getConversationTopic() also under the key "dc:subject".
--
This message was sent by Atlassian Jira
(v8.20.1#820001)