Hi Tim,

 

We identified cases where pdf files may contain abnormaly big metadata
(several MB, be it for the metadata values, the metadata names, but also for
the total amount of metadata). Some time ago, I proposed the creation of a
"writeLimit" header in Tika Server (and you accepted to implement it, thanks
for that) on the /rmeta endpoint. We think it would make sense to have an
equivalent for the metadata content, e.g. to avoid potential OOMs.

Would you think it is worth it that I create a ticket for this feature ?

 

Regards,

Julien 

 

 

Reply via email to