[ https://issues.apache.org/jira/browse/LUCENE-7696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15868652#comment-15868652 ]
Jan Høydahl edited comment on LUCENE-7696 at 2/15/17 11:29 PM: --------------------------------------------------------------- Important to note that the mirrors and the main Apache dist site http://www.apache.org/dist/lucene/ have a {{.htaccess}} redirect for mahout, nutch and tika, and do not contain hadoop at all. So it is only those that dig for archived versions inside dist/lucene that will ever land here, no route from the TLP sites... The Hadoop TLP issue is https://issues.apache.org/jira/browse/INFRA-1477 The Mahout TLP issue is https://issues.apache.org/jira/browse/INFRA-2643 The Tika TLP issue is https://issues.apache.org/jira/browse/INFRA-2692 but it does not mention archives The Nutch TLP issue is https://issues.apache.org/jira/browse/INFRA-2693, no discussion about archives Suggest I start by sending an email to dev@tika *(DONE)*, then see what they say before we tackle the other projects. was (Author: janhoy): Important to note that the mirrors and the main Apache dist site http://www.apache.org/dist/lucene/ have a {{.htaccess}} redirect for mahout, nutch and tika, and do not contain hadoop at all. So it is only those that dig for archived versions inside dist/lucene that will ever land here, no route from the TLP sites... The Hadoop TLP issue is https://issues.apache.org/jira/browse/INFRA-1477 The Mahout TLP issue is https://issues.apache.org/jira/browse/INFRA-2643 The Tika TLP issue is https://issues.apache.org/jira/browse/INFRA-2692 but it does not mention archives The Nutch TLP issue is https://issues.apache.org/jira/browse/INFRA-2693, no discussion about archives Suggest I start by sending an email to private@ for Tika, then see what they say before we tackle the other projects. > Remove ancient projects from the dist area > ------------------------------------------ > > Key: LUCENE-7696 > URL: https://issues.apache.org/jira/browse/LUCENE-7696 > Project: Lucene - Core > Issue Type: Task > Components: general/website > Reporter: Jan Høydahl > Labels: archive, dist, download > > In https://archive.apache.org/dist/lucene/ we have these folders: > {noformat} > [DIR] hadoop/ 2008-01-22 23:40 - > [DIR] java/ 2017-02-14 08:33 - > [DIR] mahout/ 2015-02-17 20:27 - > [DIR] nutch/ 2015-02-17 20:29 - > [DIR] pylucene/ 2017-02-13 22:00 - > [DIR] solr/ 2017-02-14 08:33 - > [DIR] tika/ 2015-02-17 20:29 - > [ ] KEYS 2016-08-30 09:59 148K > {noformat} > Nobody will expect to find hadoop, mahout, nutch and tika here anymore, so > why not clean up? > I double checked, and both https://archive.apache.org/dist/hadoop/core/ and > https://archive.apache.org/dist/mahout/ have a full copy of all releases, so > we lose nothing. > For https://archive.apache.org/dist/nutch/, they do not have 0.6-0.8 releases > that we have under lucene, and https://archive.apache.org/dist/tika/ do not > have v0.2-0.7 that only exists with us. For these two projects we could ask > their PMC to copy over the early versions and then we nuk'em? > Any other reason to keep these in the lucene area? -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org