How about a simple password protection? Thus would be helpful during regression tests. Tilman
Gesendet mit der Telekom Mail App -- Original-Nachricht -- Von: Andreas Lehmkühler <andr...@lehmi.de.invalid> Betreff: Re: Turning off public access to the regression corpora? Datum: 09.01.2025, 07:53 Uhr An: corpora-dev@tika.apache.org Hi, I agree with Maruan. :-( Just out of curiosity, the origin source of those files is some public webserver, isn't it? Andreas Am 09.01.25 um 05:27 schrieb Maruan Sahyoun: > Hi, > > this is unfortunate but as this is posing the risk of legal actions to the > ASF but also to me hosting the site I think we should stop that. > > BR > Maruan > >> Am 09.01.2025 um 02:37 schrieb Tim Allison <talli...@apache.org>: >> >> \All, >> We've gotten a handful of takedown requests recently. I had initially >> envisioned public sharing of files as a key component of our server. We can >> still use the files and offer read access to fellow file researchers. I'm >> not sure I want to deal with further takedown requests. >> As an intermediate step, we could ask robots not to crawl the data, but >> that's not reliable. >> So, in lieu of that, with heavy heart, I ask if it is time to close off >> public access? >> WDYT? >> >> Best, >> >> Tim