dave2wave commented on issue #222: URL: https://github.com/apache/tooling-trusted-releases/issues/222#issuecomment-3693425599
The daily [cronjob](https://svn.apache.org/repos/asf/comdev/projects.apache.org/trunk/scripts/cronjobs/parsereleases.py) run to support `projects.apache.org` uses previously created on `downloads.apache.org` `find_ls` file and classifies current source releases. By looking at: ``` def cleanFilename(filename): """ Attempts to determine the release id to which a file belongs Strips extensions such as .tgz etc, then suffixes such as -sources Replaces qualifiers such as -assembly- by '-' Returns the simplified filename . N.B. the ordering is significant, as the list is only scanned once """ for suffix in ['-all', '-src', '_src', '.src', '-sources', '_sources', '-source', '-bin', '-dist', '-source-release', '-source-relase', '-tests', '-test', '-debug', '-uber', '-macosx', '-distribution', '-example', '-native', '-win', '-win32', '-linux', '-pack', '-packaged', '-current', '-embedded', '-py', '-py2', '-py2.6', '-py2.7', '-no', 'unix-distro', 'windows-distro', 'with', '-dep', '-standalone', '-webapp', '-dom', '-om', '-32bit', '-i386', '_i386', '.i386', '-minimal', '-jettyconfig', '-py2.py3-none-any', 'newkey', 'oldkey', 'jars', '-jre13', '-hadoop1', '-hadoop2', '-project', '-with-dependencies', '-client', '-server', 'server-webapps', '-full', '-all', '-standard', '-for-javaee', '-for-tomcat', 'hadoop1-scala2', '-deployer', '-fulldocs', '-embed', '-apps', '-app', '-ref', '-installer', '-bundle', '-java']: # The above list could be simplified further if filename.endswith(suffix): filename = filename[0:len(filename)-len(suffix)] # COMDEV-535: No longer strip '-parent-', as that may be needed for repl in ['-assembly-', '-minimal-', '-doc-', '-src-', '-webapp-', '-standalone-', '-project-', '-win32-']: filename = filename.replace(repl, '-') return filename ``` You will find suffixes to consider "sources" rules. [`releases.json`](https://svn.apache.org/repos/asf/comdev/projects.apache.org/trunk/site/json/foundation/releases.json) and [`releases-files.json`](https://svn.apache.org/repos/asf/comdev/projects.apache.org/trunk/site/json/foundation/releases-files.json) The weakness in this p.a.o code is that it does not do well in identifying sub-projects. Look at airflow_providers, beam, etc, but this is close. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
