dave2wave commented on issue #222:
URL: 
https://github.com/apache/tooling-trusted-releases/issues/222#issuecomment-3693425599

   The daily 
[cronjob](https://svn.apache.org/repos/asf/comdev/projects.apache.org/trunk/scripts/cronjobs/parsereleases.py)
 run to support `projects.apache.org` uses previously created on 
`downloads.apache.org` `find_ls` file and classifies current source releases. 
By looking at:
   
   ```
   def cleanFilename(filename):
       """
           Attempts to determine the release id to which a file belongs
           Strips extensions such as .tgz etc, then suffixes such as -sources
           Replaces qualifiers such as -assembly- by '-'
           Returns the simplified filename .
   
           N.B. the ordering is significant, as the list is only scanned once
       """
       for suffix in ['-all', '-src', '_src', '.src', '-sources', '_sources', 
'-source', '-bin', '-dist',
                      '-source-release', '-source-relase', '-tests', '-test', 
'-debug', '-uber',
                      '-macosx', '-distribution', '-example', '-native', 
'-win', '-win32', '-linux', '-pack', '-packaged', '-current', '-embedded',
                      '-py', '-py2', '-py2.6', '-py2.7', '-no', 'unix-distro', 
'windows-distro', 'with', '-dep', '-standalone', '-webapp', '-dom', '-om',
                      '-32bit', '-i386', '_i386', '.i386', '-minimal', 
'-jettyconfig', '-py2.py3-none-any', 'newkey', 'oldkey', 'jars', '-jre13', 
'-hadoop1', '-hadoop2', '-project',
                      '-with-dependencies', '-client', '-server', 
'server-webapps', '-full', '-all', '-standard', '-for-javaee', '-for-tomcat',
                      'hadoop1-scala2', '-deployer', '-fulldocs', '-embed', 
'-apps', '-app', '-ref', '-installer', '-bundle', '-java']:
           # The above list could be simplified further
           if filename.endswith(suffix):
               filename = filename[0:len(filename)-len(suffix)]
       # COMDEV-535: No longer strip '-parent-', as that may be needed
       for repl in ['-assembly-', '-minimal-', '-doc-', '-src-', '-webapp-', 
'-standalone-', '-project-', '-win32-']:
           filename = filename.replace(repl, '-')
       return filename
   ```
   
   You will find suffixes to consider "sources" rules. 
[`releases.json`](https://svn.apache.org/repos/asf/comdev/projects.apache.org/trunk/site/json/foundation/releases.json)
 and 
[`releases-files.json`](https://svn.apache.org/repos/asf/comdev/projects.apache.org/trunk/site/json/foundation/releases-files.json)
   
   The weakness in this p.a.o code is that it does not do well in identifying 
sub-projects. Look at airflow_providers, beam, etc, but this is close.
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to