Files are updated under https://corpora.tika.apache.org/base/docs/bug_trackers/
I updated the README: https://corpora.tika.apache.org/base/docs/bug_trackers/README.txt Let me know if you find any surprises. On Fri, Nov 6, 2020 at 10:05 AM Tim Allison <talli...@apache.org> wrote: > All, > With many thanks to Apache's infra, I was unbanned after a few too many > requests to Apache's JIRA/bugzilla. > I'm currently doing some post processing cleanup on the refreshed > corpus. I'm planning to remove .diff files and zero-byte files. If there > are any objections, let me know soon. > Thank you, all. > > Best, > > Tim > >