This came up recently for me too. Same issue. I can maybe implement this
later today

On Mon, Mar 7, 2022, 3:08 PM Tim Allison <[email protected]> wrote:

> Yes please. We can add a limiting MetadataFilter.
>
> On Mon, Mar 7, 2022 at 8:39 AM Julien Massiera <
> [email protected]> wrote:
>
> > Hi Tim,
> >
> >
> >
> > We identified cases where pdf files may contain abnormaly big metadata
> > (several MB, be it for the metadata values, the metadata names, but also
> > for
> > the total amount of metadata). Some time ago, I proposed the creation of
> a
> > "writeLimit" header in Tika Server (and you accepted to implement it,
> > thanks
> > for that) on the /rmeta endpoint. We think it would make sense to have an
> > equivalent for the metadata content, e.g. to avoid potential OOMs.
> >
> > Would you think it is worth it that I create a ticket for this feature ?
> >
> >
> >
> > Regards,
> >
> > Julien
> >
> >
> >
> >
> >
> >
>

Reply via email to