Re: [jira] [Commented] (RAT-363) Introduce project description
I would like to see us try to develop DOAP maintenance software. Once we identify sources for the values in the DOAP it should be doable. Possibly under a differen Creadur project. On Sat, May 4, 2024 at 5:10 PM Claude Warren (Jira) wrote: > > [ > https://issues.apache.org/jira/browse/RAT-363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843445#comment-17843445 > ] > > Claude Warren commented on RAT-363: > --- > > DOAP schema is available at https://github.com/ewilderj/doap > > > Introduce project description > > - > > > > Key: RAT-363 > > URL: https://issues.apache.org/jira/browse/RAT-363 > > Project: Apache Rat > > Issue Type: Task > >Reporter: Philipp Ottlinger > >Priority: Major > > Fix For: 0.17 > > > > > > Create a project description (DOAP) - possible templates can be derived > from: > > * [https://github.com/apache/juneau/blob/master/doap_Juneau.rdf] > > * [https://github.com/apache/airflow/blob/main/doap_airflow.rdf] > > * [https://github.com/apache/kafka/blob/trunk/doap_Kafka.rdf] > > > > -- > This message was sent by Atlassian Jira > (v8.20.10#820010) > -- LinkedIn: http://www.linkedin.com/in/claudewarren
Re: Web site contents
I was only looking at a small part of the documentation. What I was hoping was that there was a tag in the apt or vm processing that would allow us to insert text. And then figure out how to load that text from the application itself. There are several outputs that would be good to generate. But I have also thought about how to revamp the entire documentation building/maintaining exercise. But I am not a documentation expert so I am hoping we can find someone to assist who has the experience. On Sat, May 4, 2024 at 9:11 PM P. Ottlinger wrote: > Hi, > > Am 04.05.24 um 12:40 schrieb Claude Warren: > > Is there a way to execute a Java application, capture the output, and > > insert it into an index.apt.vm file? If so I would like to generate the > > help for the CLI and insert it during the build process. > > As there are so many XML/TXT/maven-site files with the help contents I > thought of generating these files as part of the build. > > Is that the same what you intended to do? > > I'm not sure if this could somehow be accomplished via an antrun- or > maven-exec step in the site build > https://www.mojohaus.org/exec-maven-plugin/usage.html > > If I understood you correctly we would like to call the "--help" option > and pipe its output to a file that is included in the site build, right? > > I filed > https://issues.apache.org/jira/browse/RAT-374 > for the issue. > > Thanks for bringing it up! > > Cheers, > Phil > > > > -- LinkedIn: http://www.linkedin.com/in/claudewarren
[jira] [Updated] (RAT-369) Add SpotBugs and checkstyle to the pom files.
[ https://issues.apache.org/jira/browse/RAT-369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Philipp Ottlinger updated RAT-369: -- Description: Currently we have a number of issues that Spotbugs will detect. I ran a quick test this afternoon and it found 60 potential bugs. This ticket is to add spotbugs to the maven build, ensure that no bugs are found and ensure that the build will fail if bugs are found. Add checkbugs as well. As most of the reported errors cannot be fixed with JDK8, add exclusions for these "old"/JDK8-classes that do not allow to set a proper encoding, thus relying on the default system encoding! was:Currently we have a number of issues that Spotbugs will detect. I ran a quick test this afternoon and it found 60 potential bugs. This ticket is to add spotbugs to the maven build, ensure that no bugs are found and ensure that the build will fail if bugs are found. > Add SpotBugs and checkstyle to the pom files. > - > > Key: RAT-369 > URL: https://issues.apache.org/jira/browse/RAT-369 > Project: Apache Rat > Issue Type: Improvement > Components: Client - ant, Client - cli, Client - maven, core engine >Reporter: Claude Warren >Assignee: Philipp Ottlinger >Priority: Major > Fix For: 0.17 > > > Currently we have a number of issues that Spotbugs will detect. I ran a > quick test this afternoon and it found 60 potential bugs. This ticket is to > add spotbugs to the maven build, ensure that no bugs are found and ensure > that the build will fail if bugs are found. > Add checkbugs as well. > As most of the reported errors cannot be fixed with JDK8, add exclusions for > these "old"/JDK8-classes that do not allow to set a proper encoding, thus > relying on the default system encoding! -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (RAT-374) Automatically output RAT CLI's help options during the build and add it to the project webpage
[ https://issues.apache.org/jira/browse/RAT-374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Philipp Ottlinger updated RAT-374: -- Fix Version/s: 0.17 > Automatically output RAT CLI's help options during the build and add it to > the project webpage > -- > > Key: RAT-374 > URL: https://issues.apache.org/jira/browse/RAT-374 > Project: Apache Rat > Issue Type: Improvement >Affects Versions: 0.16.1 >Reporter: Philipp Ottlinger >Priority: Major > Fix For: 0.17 > > > At the moment there are plenty of places that need to be changed when a new > CLI option is added or an existing one is changed, e.g. > * https://github.com/apache/creadur-rat/blob/master/apache-rat/README-CLI.txt > * https://github.com/apache/creadur-rat/blob/master/apache-rat/README-ANT.txt > * > https://github.com/apache/creadur-rat/blob/master/apache-rat/src/site/apt/index.apt.vm > Generating the current version of CLI options directly from RAT's sources > would be much easier for further development and to keep the docs in sync > with the code: > https://creadur.apache.org/rat/apache-rat/index.html > Not sure if this task can be accomplished with the help of > * maven-antrun > * maven-exec > or other means, but the desired state is less manual change in case of > CLI/option changes. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (WHISKER-24) Introduce project description
[ https://issues.apache.org/jira/browse/WHISKER-24?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Philipp Ottlinger updated WHISKER-24: - Description: Create a project description (DOAP) - possible templates can be derived from: * [https://github.com/apache/juneau/blob/master/doap_Juneau.rdf] * [https://github.com/apache/airflow/blob/main/doap_airflow.rdf] * [https://github.com/apache/kafka/blob/trunk/doap_Kafka.rdf] was: Create a project description (DOAF) - possible templates can be derived from: * [https://github.com/apache/juneau/blob/master/doap_Juneau.rdf] * [https://github.com/apache/airflow/blob/main/doap_airflow.rdf] * [https://github.com/apache/kafka/blob/trunk/doap_Kafka.rdf] > Introduce project description > - > > Key: WHISKER-24 > URL: https://issues.apache.org/jira/browse/WHISKER-24 > Project: Apache Whisker > Issue Type: Task >Reporter: Philipp Ottlinger >Priority: Major > > Create a project description (DOAP) - possible templates can be derived from: > * [https://github.com/apache/juneau/blob/master/doap_Juneau.rdf] > * [https://github.com/apache/airflow/blob/main/doap_airflow.rdf] > * [https://github.com/apache/kafka/blob/trunk/doap_Kafka.rdf] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-150) RAT should use Apache Tika to simply guess ignored [application/X] file types and focus on the [text/Y] family as a sensible default
[ https://issues.apache.org/jira/browse/RAT-150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843497#comment-17843497 ] ASF subversion and git services commented on RAT-150: - Commit 46eaed82aeea3ae17a03da30875e1fce789d375b in creadur-rat's branch refs/heads/feature/RAT-369 from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=46eaed82 ] RAT-150: Code cleanup > RAT should use Apache Tika to simply guess ignored [application/X] file types > and focus on the [text/Y] family as a sensible default > > > Key: RAT-150 > URL: https://issues.apache.org/jira/browse/RAT-150 > Project: Apache Rat > Issue Type: New Feature > Components: mime-meta-data, scan >Affects Versions: 0.8 >Reporter: Chris A. Mattmann >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > RAT could use Apache Tika to automatically guess file types, obviating the > need to specify an explicit white list or black list. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-301) Rat check file identification error,java files with Chinese characters are recognized as binary files
[ https://issues.apache.org/jira/browse/RAT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843496#comment-17843496 ] ASF subversion and git services commented on RAT-301: - Commit 1b919deed53553ae1d5eb08808c13727af15a335 in creadur-rat's branch refs/heads/feature/RAT-369 from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=1b919dee ] RAT-301: Fix test setup after renaming file > Rat check file identification error,java files with Chinese characters are > recognized as binary files > - > > Key: RAT-301 > URL: https://issues.apache.org/jira/browse/RAT-301 > Project: Apache Rat > Issue Type: Bug >Affects Versions: 0.13 > Environment: Window > Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; > 2015-11-11T00:41:47+08:00) >Reporter: Chen Xia >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > {code:java} > // code placeholder > > > org.apache.rat > apache-rat-plugin > 0.13 > > > rat-validate > validate > > check > > > > > > **/*.versionsBackup > **/.idea/ > **/*.iml > **/*.txt > **/*.json > web/.editorconfig > web/.env > web/.eslintignore > web/.jshintrc > web/public/favicon.ico > web/dist/** > web/node_modules/** > web/apache-linkis-*-web-bin.tar.gz > **/*.md > .git/ > .gitignore > **/.settings/* > **/.classpath > **/.project > **/target/** > **/out/** > **/*.log > CONTRIBUTING.md > CONTRIBUTING_CN.md > DISCLAIMER > DISCLAIMER > README.md > **/META-INF/** > .github/** > compiler/** > **/generated/** > > > {code} > This is the result of {{mvn apache-rat:check}} > {code:java} > Summary > --- > Generated at: 2022-05-06T09:56:39+08:00 > Notes: 0 > Binaries: 1 > Archives: 0 > Standards: 13 > Apache Licensed: 13 > Generated Documents: 0 > JavaDocs are generated, thus a license header is optional. > Generated files do not require license headers. > 0 Unknown Licenses > * > Files with Apache License headers will be marked AL > Binary files (which do not require any license headers) will be marked B > B > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/entity/UDFVersion.java > AL > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/excepiton/UDFException.java > > * {code} > UDFVersion.java is recognized as a binary file > source code: https://github.com/casionone/incubator-linkis/tree/dev-1.1.1-rat -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-54) MIME Detection Using Tika
[ https://issues.apache.org/jira/browse/RAT-54?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843498#comment-17843498 ] ASF subversion and git services commented on RAT-54: Commit 8fcb1cf497701ca0a3d12ebe61a2abbb516f6ac6 in creadur-rat's branch refs/heads/feature/RAT-369 from Claude Warren [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=8fcb1cf4 ] Merge pull request #240 from Claudenw/tika_based_document_analyzer RAT-54: Tika based document analyzer > MIME Detection Using Tika > - > > Key: RAT-54 > URL: https://issues.apache.org/jira/browse/RAT-54 > Project: Apache Rat > Issue Type: New Feature >Affects Versions: 0.7 >Reporter: Robert Burrell Donkin >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > Tika provides sophisticated and comprehensive MIME detection. Add support for > a Tika based implementation. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-301) Rat check file identification error,java files with Chinese characters are recognized as binary files
[ https://issues.apache.org/jira/browse/RAT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843494#comment-17843494 ] ASF subversion and git services commented on RAT-301: - Commit d0534b8b14769e5e2ae610884ba458426b006d73 in creadur-rat's branch refs/heads/feature/RAT-369 from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=d0534b8b ] RAT-301: Change example to non-meaningful Java to not find this class in IDE searches in RAT's codebase > Rat check file identification error,java files with Chinese characters are > recognized as binary files > - > > Key: RAT-301 > URL: https://issues.apache.org/jira/browse/RAT-301 > Project: Apache Rat > Issue Type: Bug >Affects Versions: 0.13 > Environment: Window > Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; > 2015-11-11T00:41:47+08:00) >Reporter: Chen Xia >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > {code:java} > // code placeholder > > > org.apache.rat > apache-rat-plugin > 0.13 > > > rat-validate > validate > > check > > > > > > **/*.versionsBackup > **/.idea/ > **/*.iml > **/*.txt > **/*.json > web/.editorconfig > web/.env > web/.eslintignore > web/.jshintrc > web/public/favicon.ico > web/dist/** > web/node_modules/** > web/apache-linkis-*-web-bin.tar.gz > **/*.md > .git/ > .gitignore > **/.settings/* > **/.classpath > **/.project > **/target/** > **/out/** > **/*.log > CONTRIBUTING.md > CONTRIBUTING_CN.md > DISCLAIMER > DISCLAIMER > README.md > **/META-INF/** > .github/** > compiler/** > **/generated/** > > > {code} > This is the result of {{mvn apache-rat:check}} > {code:java} > Summary > --- > Generated at: 2022-05-06T09:56:39+08:00 > Notes: 0 > Binaries: 1 > Archives: 0 > Standards: 13 > Apache Licensed: 13 > Generated Documents: 0 > JavaDocs are generated, thus a license header is optional. > Generated files do not require license headers. > 0 Unknown Licenses > * > Files with Apache License headers will be marked AL > Binary files (which do not require any license headers) will be marked B > B > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/entity/UDFVersion.java > AL > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/excepiton/UDFException.java > > * {code} > UDFVersion.java is recognized as a binary file > source code: https://github.com/casionone/incubator-linkis/tree/dev-1.1.1-rat -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-301) Rat check file identification error,java files with Chinese characters are recognized as binary files
[ https://issues.apache.org/jira/browse/RAT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843495#comment-17843495 ] ASF subversion and git services commented on RAT-301: - Commit e11553d30d531e8fbf6b4b988c45dc82b35cfae6 in creadur-rat's branch refs/heads/feature/RAT-369 from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=e11553d3 ] RAT-301: Add changelog after integrating the file as test > Rat check file identification error,java files with Chinese characters are > recognized as binary files > - > > Key: RAT-301 > URL: https://issues.apache.org/jira/browse/RAT-301 > Project: Apache Rat > Issue Type: Bug >Affects Versions: 0.13 > Environment: Window > Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; > 2015-11-11T00:41:47+08:00) >Reporter: Chen Xia >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > {code:java} > // code placeholder > > > org.apache.rat > apache-rat-plugin > 0.13 > > > rat-validate > validate > > check > > > > > > **/*.versionsBackup > **/.idea/ > **/*.iml > **/*.txt > **/*.json > web/.editorconfig > web/.env > web/.eslintignore > web/.jshintrc > web/public/favicon.ico > web/dist/** > web/node_modules/** > web/apache-linkis-*-web-bin.tar.gz > **/*.md > .git/ > .gitignore > **/.settings/* > **/.classpath > **/.project > **/target/** > **/out/** > **/*.log > CONTRIBUTING.md > CONTRIBUTING_CN.md > DISCLAIMER > DISCLAIMER > README.md > **/META-INF/** > .github/** > compiler/** > **/generated/** > > > {code} > This is the result of {{mvn apache-rat:check}} > {code:java} > Summary > --- > Generated at: 2022-05-06T09:56:39+08:00 > Notes: 0 > Binaries: 1 > Archives: 0 > Standards: 13 > Apache Licensed: 13 > Generated Documents: 0 > JavaDocs are generated, thus a license header is optional. > Generated files do not require license headers. > 0 Unknown Licenses > * > Files with Apache License headers will be marked AL > Binary files (which do not require any license headers) will be marked B > B > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/entity/UDFVersion.java > AL > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/excepiton/UDFException.java > > * {code} > UDFVersion.java is recognized as a binary file > source code: https://github.com/casionone/incubator-linkis/tree/dev-1.1.1-rat -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-301) Rat check file identification error,java files with Chinese characters are recognized as binary files
[ https://issues.apache.org/jira/browse/RAT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843493#comment-17843493 ] ASF subversion and git services commented on RAT-301: - Commit e84ebdb739b5fabb0bb370c94edb5a90aecbd131 in creadur-rat's branch refs/heads/feature/RAT-369 from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=e84ebdb7 ] RAT-301: Ensure Chinese characters are not categorized as binary > Rat check file identification error,java files with Chinese characters are > recognized as binary files > - > > Key: RAT-301 > URL: https://issues.apache.org/jira/browse/RAT-301 > Project: Apache Rat > Issue Type: Bug >Affects Versions: 0.13 > Environment: Window > Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; > 2015-11-11T00:41:47+08:00) >Reporter: Chen Xia >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > {code:java} > // code placeholder > > > org.apache.rat > apache-rat-plugin > 0.13 > > > rat-validate > validate > > check > > > > > > **/*.versionsBackup > **/.idea/ > **/*.iml > **/*.txt > **/*.json > web/.editorconfig > web/.env > web/.eslintignore > web/.jshintrc > web/public/favicon.ico > web/dist/** > web/node_modules/** > web/apache-linkis-*-web-bin.tar.gz > **/*.md > .git/ > .gitignore > **/.settings/* > **/.classpath > **/.project > **/target/** > **/out/** > **/*.log > CONTRIBUTING.md > CONTRIBUTING_CN.md > DISCLAIMER > DISCLAIMER > README.md > **/META-INF/** > .github/** > compiler/** > **/generated/** > > > {code} > This is the result of {{mvn apache-rat:check}} > {code:java} > Summary > --- > Generated at: 2022-05-06T09:56:39+08:00 > Notes: 0 > Binaries: 1 > Archives: 0 > Standards: 13 > Apache Licensed: 13 > Generated Documents: 0 > JavaDocs are generated, thus a license header is optional. > Generated files do not require license headers. > 0 Unknown Licenses > * > Files with Apache License headers will be marked AL > Binary files (which do not require any license headers) will be marked B > B > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/entity/UDFVersion.java > AL > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/excepiton/UDFException.java > > * {code} > UDFVersion.java is recognized as a binary file > source code: https://github.com/casionone/incubator-linkis/tree/dev-1.1.1-rat -- This message was sent by Atlassian Jira (v8.20.10#820010)
Re: Web site contents
Hi, Am 04.05.24 um 12:40 schrieb Claude Warren: Is there a way to execute a Java application, capture the output, and insert it into an index.apt.vm file? If so I would like to generate the help for the CLI and insert it during the build process. As there are so many XML/TXT/maven-site files with the help contents I thought of generating these files as part of the build. Is that the same what you intended to do? I'm not sure if this could somehow be accomplished via an antrun- or maven-exec step in the site build https://www.mojohaus.org/exec-maven-plugin/usage.html If I understood you correctly we would like to call the "--help" option and pipe its output to a file that is included in the site build, right? I filed https://issues.apache.org/jira/browse/RAT-374 for the issue. Thanks for bringing it up! Cheers, Phil OpenPGP_signature.asc Description: OpenPGP digital signature
[jira] [Created] (RAT-374) Automatically output RAT CLI's help options during the build and add it to the project webpage
Philipp Ottlinger created RAT-374: - Summary: Automatically output RAT CLI's help options during the build and add it to the project webpage Key: RAT-374 URL: https://issues.apache.org/jira/browse/RAT-374 Project: Apache Rat Issue Type: Improvement Affects Versions: 0.16.1 Reporter: Philipp Ottlinger At the moment there are plenty of places that need to be changed when a new CLI option is added or an existing one is changed, e.g. * https://github.com/apache/creadur-rat/blob/master/apache-rat/README-CLI.txt * https://github.com/apache/creadur-rat/blob/master/apache-rat/README-ANT.txt * https://github.com/apache/creadur-rat/blob/master/apache-rat/src/site/apt/index.apt.vm Generating the current version of CLI options directly from RAT's sources would be much easier for further development and to keep the docs in sync with the code: https://creadur.apache.org/rat/apache-rat/index.html Not sure if this task can be accomplished with the help of * maven-antrun * maven-exec or other means, but the desired state is less manual change in case of CLI/option changes. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (RAT-369) Add SpotBugs and checkstyle to the pom files.
[ https://issues.apache.org/jira/browse/RAT-369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Philipp Ottlinger updated RAT-369: -- Summary: Add SpotBugs and checkstyle to the pom files. (was: Add SpotBugs to the pom files.) > Add SpotBugs and checkstyle to the pom files. > - > > Key: RAT-369 > URL: https://issues.apache.org/jira/browse/RAT-369 > Project: Apache Rat > Issue Type: Improvement > Components: Client - ant, Client - cli, Client - maven, core engine >Reporter: Claude Warren >Assignee: Philipp Ottlinger >Priority: Major > Fix For: 0.17 > > > Currently we have a number of issues that Spotbugs will detect. I ran a > quick test this afternoon and it found 60 potential bugs. This ticket is to > add spotbugs to the maven build, ensure that no bugs are found and ensure > that the build will fail if bugs are found. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Resolved] (RAT-301) Rat check file identification error,java files with Chinese characters are recognized as binary files
[ https://issues.apache.org/jira/browse/RAT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Claude Warren resolved RAT-301. --- Resolution: Fixed Resolved with pull request #240 https://github.com/apache/creadur-rat/pull/240 > Rat check file identification error,java files with Chinese characters are > recognized as binary files > - > > Key: RAT-301 > URL: https://issues.apache.org/jira/browse/RAT-301 > Project: Apache Rat > Issue Type: Bug >Affects Versions: 0.13 > Environment: Window > Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; > 2015-11-11T00:41:47+08:00) >Reporter: Chen Xia >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > {code:java} > // code placeholder > > > org.apache.rat > apache-rat-plugin > 0.13 > > > rat-validate > validate > > check > > > > > > **/*.versionsBackup > **/.idea/ > **/*.iml > **/*.txt > **/*.json > web/.editorconfig > web/.env > web/.eslintignore > web/.jshintrc > web/public/favicon.ico > web/dist/** > web/node_modules/** > web/apache-linkis-*-web-bin.tar.gz > **/*.md > .git/ > .gitignore > **/.settings/* > **/.classpath > **/.project > **/target/** > **/out/** > **/*.log > CONTRIBUTING.md > CONTRIBUTING_CN.md > DISCLAIMER > DISCLAIMER > README.md > **/META-INF/** > .github/** > compiler/** > **/generated/** > > > {code} > This is the result of {{mvn apache-rat:check}} > {code:java} > Summary > --- > Generated at: 2022-05-06T09:56:39+08:00 > Notes: 0 > Binaries: 1 > Archives: 0 > Standards: 13 > Apache Licensed: 13 > Generated Documents: 0 > JavaDocs are generated, thus a license header is optional. > Generated files do not require license headers. > 0 Unknown Licenses > * > Files with Apache License headers will be marked AL > Binary files (which do not require any license headers) will be marked B > B > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/entity/UDFVersion.java > AL > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/excepiton/UDFException.java > > * {code} > UDFVersion.java is recognized as a binary file > source code: https://github.com/casionone/incubator-linkis/tree/dev-1.1.1-rat -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Resolved] (RAT-211) Generated rat-output.xml must be well-formed, even if BinaryGuesser fails
[ https://issues.apache.org/jira/browse/RAT-211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Claude Warren resolved RAT-211. --- Resolution: Fixed Resolved with pull request #240 https://github.com/apache/creadur-rat/pull/240 > Generated rat-output.xml must be well-formed, even if BinaryGuesser fails > - > > Key: RAT-211 > URL: https://issues.apache.org/jira/browse/RAT-211 > Project: Apache Rat > Issue Type: Bug >Reporter: Konstantin Kolinko >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > Attachments: rat-output.xml > > > This issue was originally reported by Infrastructure team while running RAT > over Apache Tomcat source code, see thread > "Files to exclude from buildbot rat tests" (started 2016-02-15) at dev "at" > tomcat.apache.org mailing list. (1) > The issue: > === > 1. Buildbot at ASF is configured to run RAT tool over tomcat-trunk, tomcat-8, > tomcat-7 source code. > 2. Tomcat has \*.bmp, \*.dia files in its source code (images used by Windows > installer, diagrams in documentation) that RAT failed to recognize as binary. > 3. RAT generated rat-output.xml file that included header-sample fragments of > those *.bmp and *.dia files. Those fragments are actually binary garbage. > The result is that a broken XML file was generated. > 4. XSLT transformation from rat-output.xml into rat-output.html failed. > I have not seen the actual error printed by XSLT processor, but I confirmed > that the file is broken by downloading rat-output.xml and opening it in > Firefox. Firefox reported a syntax error. > Workaround: > === > rat-excludes.txt file in Tomcat source code was updated to exclude > \*\*/\*.bmp > \*\*/\*.dia > References: > === > 1. "Files to exclude from buildbot rat tests" (started 2016-02-15) at dev > "at" tomcat.apache.org mailing list. > http://markmail.org/message/rhrm54ch5omjalt4 > 2. Apache Tomcat links to Buildbot resuls: > http://tomcat.apache.org/ci.html#Buildbot > 3. Apache Tomcat source code > http://tomcat.apache.org/svn.html > Notes: > - RAT excludes files in Tomcat source code are at > res/rat/rat-excludes.txt > - I know that Buildbot uses Ant to run RAT. The Ant project file for that is > not in Tomcat sources, but in Infrastructure configuration (I do not have a > link). It can be seen in "shell_5 RAT Report Complete" step during build run. > E.g. here: > https://ci.apache.org/builders/tomcat-trunk/builds/1061 > - I do not know what version of RAT is used by that build slave on Buildbot. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Resolved] (RAT-147) binary guesser design improvement
[ https://issues.apache.org/jira/browse/RAT-147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Claude Warren resolved RAT-147. --- Resolution: Fixed Resolved with pull request #240 https://github.com/apache/creadur-rat/pull/240 > binary guesser design improvement > - > > Key: RAT-147 > URL: https://issues.apache.org/jira/browse/RAT-147 > Project: Apache Rat > Issue Type: Improvement >Affects Versions: 0.8 >Reporter: Marshall Schor >Assignee: Claude Warren >Priority: Minor > Fix For: 0.17 > > Attachments: unix-newlines.txt.bin, windows-newlines.txt.bin > > > A release manager cut a release; RAT was run, all was OK. Another user tried > building from source / tag, and RAT complained of 2 files missing headers. > This was traced to the "binary guesser" which read the 1st 200 bytes of a > file and "guessed" if it was binary. The file in question had a UTF-8 > byte-order mark at the beginning, and was, in fact after that, plain ASCII. > The reason for 2 different results: the release manager's OS had a default > file encoding set to US-ASCII (as determined by running a small Java program > that prints out the value of System.property("file.encoding"). This encoding > is for 7-bit ASCII, so the guesser when decoding this gets a malformed > exception on the 3 bytes at the beginning of the file. This causes the > guesser to conclude this is a "binary" file which doesn't need to be > RAT-checked. The other user was on a Windows 7 machine, which has the > file.encoding defaulting to Cp1252 - which does have code points defined for > the first 3 bytes, and therefore doesn't throw any exception. This makes the > guesser guess that this isn't a binary file, and it checks the file and > reports a missing header (the file is test data...). > Workaround - add the file to the explicit excludes. > Potential problem - on a machine with default encoding US-ASCII, RAT will > improperly skip checking files which perhaps should have headers, if they > have a UTF-8 byte-order mark. > Potential problem #2 - RAT is dependent on the default file encoding setting > for part of its behavior, causing differences in what it checks. > I'm not sure what a good solution would be here. It might range from > eliminating the binary "guesser" that looks at the first 200 bytes of a file, > to forcing UTF-8 as the charset to use. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Resolved] (RAT-150) RAT should use Apache Tika to simply guess ignored [application/X] file types and focus on the [text/Y] family as a sensible default
[ https://issues.apache.org/jira/browse/RAT-150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Claude Warren resolved RAT-150. --- Resolution: Fixed Resolved with pull request #240 https://github.com/apache/creadur-rat/pull/240 > RAT should use Apache Tika to simply guess ignored [application/X] file types > and focus on the [text/Y] family as a sensible default > > > Key: RAT-150 > URL: https://issues.apache.org/jira/browse/RAT-150 > Project: Apache Rat > Issue Type: New Feature > Components: mime-meta-data, scan >Affects Versions: 0.8 >Reporter: Chris A. Mattmann >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > RAT could use Apache Tika to automatically guess file types, obviating the > need to specify an explicit white list or black list. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Resolved] (RAT-54) MIME Detection Using Tika
[ https://issues.apache.org/jira/browse/RAT-54?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Claude Warren resolved RAT-54. -- Resolution: Fixed Fixed with pull request #240 https://github.com/apache/creadur-rat/pull/240 > MIME Detection Using Tika > - > > Key: RAT-54 > URL: https://issues.apache.org/jira/browse/RAT-54 > Project: Apache Rat > Issue Type: New Feature >Affects Versions: 0.7 >Reporter: Robert Burrell Donkin >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > Tika provides sophisticated and comprehensive MIME detection. Add support for > a Tika based implementation. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-301) Rat check file identification error,java files with Chinese characters are recognized as binary files
[ https://issues.apache.org/jira/browse/RAT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843450#comment-17843450 ] ASF subversion and git services commented on RAT-301: - Commit e84ebdb739b5fabb0bb370c94edb5a90aecbd131 in creadur-rat's branch refs/heads/master from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=e84ebdb7 ] RAT-301: Ensure Chinese characters are not categorized as binary > Rat check file identification error,java files with Chinese characters are > recognized as binary files > - > > Key: RAT-301 > URL: https://issues.apache.org/jira/browse/RAT-301 > Project: Apache Rat > Issue Type: Bug >Affects Versions: 0.13 > Environment: Window > Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; > 2015-11-11T00:41:47+08:00) >Reporter: Chen Xia >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > {code:java} > // code placeholder > > > org.apache.rat > apache-rat-plugin > 0.13 > > > rat-validate > validate > > check > > > > > > **/*.versionsBackup > **/.idea/ > **/*.iml > **/*.txt > **/*.json > web/.editorconfig > web/.env > web/.eslintignore > web/.jshintrc > web/public/favicon.ico > web/dist/** > web/node_modules/** > web/apache-linkis-*-web-bin.tar.gz > **/*.md > .git/ > .gitignore > **/.settings/* > **/.classpath > **/.project > **/target/** > **/out/** > **/*.log > CONTRIBUTING.md > CONTRIBUTING_CN.md > DISCLAIMER > DISCLAIMER > README.md > **/META-INF/** > .github/** > compiler/** > **/generated/** > > > {code} > This is the result of {{mvn apache-rat:check}} > {code:java} > Summary > --- > Generated at: 2022-05-06T09:56:39+08:00 > Notes: 0 > Binaries: 1 > Archives: 0 > Standards: 13 > Apache Licensed: 13 > Generated Documents: 0 > JavaDocs are generated, thus a license header is optional. > Generated files do not require license headers. > 0 Unknown Licenses > * > Files with Apache License headers will be marked AL > Binary files (which do not require any license headers) will be marked B > B > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/entity/UDFVersion.java > AL > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/excepiton/UDFException.java > > * {code} > UDFVersion.java is recognized as a binary file > source code: https://github.com/casionone/incubator-linkis/tree/dev-1.1.1-rat -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-150) RAT should use Apache Tika to simply guess ignored [application/X] file types and focus on the [text/Y] family as a sensible default
[ https://issues.apache.org/jira/browse/RAT-150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843454#comment-17843454 ] ASF subversion and git services commented on RAT-150: - Commit 46eaed82aeea3ae17a03da30875e1fce789d375b in creadur-rat's branch refs/heads/master from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=46eaed82 ] RAT-150: Code cleanup > RAT should use Apache Tika to simply guess ignored [application/X] file types > and focus on the [text/Y] family as a sensible default > > > Key: RAT-150 > URL: https://issues.apache.org/jira/browse/RAT-150 > Project: Apache Rat > Issue Type: New Feature > Components: mime-meta-data, scan >Affects Versions: 0.8 >Reporter: Chris A. Mattmann >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > RAT could use Apache Tika to automatically guess file types, obviating the > need to specify an explicit white list or black list. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-301) Rat check file identification error,java files with Chinese characters are recognized as binary files
[ https://issues.apache.org/jira/browse/RAT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843453#comment-17843453 ] ASF subversion and git services commented on RAT-301: - Commit 1b919deed53553ae1d5eb08808c13727af15a335 in creadur-rat's branch refs/heads/master from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=1b919dee ] RAT-301: Fix test setup after renaming file > Rat check file identification error,java files with Chinese characters are > recognized as binary files > - > > Key: RAT-301 > URL: https://issues.apache.org/jira/browse/RAT-301 > Project: Apache Rat > Issue Type: Bug >Affects Versions: 0.13 > Environment: Window > Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; > 2015-11-11T00:41:47+08:00) >Reporter: Chen Xia >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > {code:java} > // code placeholder > > > org.apache.rat > apache-rat-plugin > 0.13 > > > rat-validate > validate > > check > > > > > > **/*.versionsBackup > **/.idea/ > **/*.iml > **/*.txt > **/*.json > web/.editorconfig > web/.env > web/.eslintignore > web/.jshintrc > web/public/favicon.ico > web/dist/** > web/node_modules/** > web/apache-linkis-*-web-bin.tar.gz > **/*.md > .git/ > .gitignore > **/.settings/* > **/.classpath > **/.project > **/target/** > **/out/** > **/*.log > CONTRIBUTING.md > CONTRIBUTING_CN.md > DISCLAIMER > DISCLAIMER > README.md > **/META-INF/** > .github/** > compiler/** > **/generated/** > > > {code} > This is the result of {{mvn apache-rat:check}} > {code:java} > Summary > --- > Generated at: 2022-05-06T09:56:39+08:00 > Notes: 0 > Binaries: 1 > Archives: 0 > Standards: 13 > Apache Licensed: 13 > Generated Documents: 0 > JavaDocs are generated, thus a license header is optional. > Generated files do not require license headers. > 0 Unknown Licenses > * > Files with Apache License headers will be marked AL > Binary files (which do not require any license headers) will be marked B > B > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/entity/UDFVersion.java > AL > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/excepiton/UDFException.java > > * {code} > UDFVersion.java is recognized as a binary file > source code: https://github.com/casionone/incubator-linkis/tree/dev-1.1.1-rat -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-301) Rat check file identification error,java files with Chinese characters are recognized as binary files
[ https://issues.apache.org/jira/browse/RAT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843452#comment-17843452 ] ASF subversion and git services commented on RAT-301: - Commit e11553d30d531e8fbf6b4b988c45dc82b35cfae6 in creadur-rat's branch refs/heads/master from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=e11553d3 ] RAT-301: Add changelog after integrating the file as test > Rat check file identification error,java files with Chinese characters are > recognized as binary files > - > > Key: RAT-301 > URL: https://issues.apache.org/jira/browse/RAT-301 > Project: Apache Rat > Issue Type: Bug >Affects Versions: 0.13 > Environment: Window > Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; > 2015-11-11T00:41:47+08:00) >Reporter: Chen Xia >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > {code:java} > // code placeholder > > > org.apache.rat > apache-rat-plugin > 0.13 > > > rat-validate > validate > > check > > > > > > **/*.versionsBackup > **/.idea/ > **/*.iml > **/*.txt > **/*.json > web/.editorconfig > web/.env > web/.eslintignore > web/.jshintrc > web/public/favicon.ico > web/dist/** > web/node_modules/** > web/apache-linkis-*-web-bin.tar.gz > **/*.md > .git/ > .gitignore > **/.settings/* > **/.classpath > **/.project > **/target/** > **/out/** > **/*.log > CONTRIBUTING.md > CONTRIBUTING_CN.md > DISCLAIMER > DISCLAIMER > README.md > **/META-INF/** > .github/** > compiler/** > **/generated/** > > > {code} > This is the result of {{mvn apache-rat:check}} > {code:java} > Summary > --- > Generated at: 2022-05-06T09:56:39+08:00 > Notes: 0 > Binaries: 1 > Archives: 0 > Standards: 13 > Apache Licensed: 13 > Generated Documents: 0 > JavaDocs are generated, thus a license header is optional. > Generated files do not require license headers. > 0 Unknown Licenses > * > Files with Apache License headers will be marked AL > Binary files (which do not require any license headers) will be marked B > B > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/entity/UDFVersion.java > AL > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/excepiton/UDFException.java > > * {code} > UDFVersion.java is recognized as a binary file > source code: https://github.com/casionone/incubator-linkis/tree/dev-1.1.1-rat -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-301) Rat check file identification error,java files with Chinese characters are recognized as binary files
[ https://issues.apache.org/jira/browse/RAT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843451#comment-17843451 ] ASF subversion and git services commented on RAT-301: - Commit d0534b8b14769e5e2ae610884ba458426b006d73 in creadur-rat's branch refs/heads/master from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=d0534b8b ] RAT-301: Change example to non-meaningful Java to not find this class in IDE searches in RAT's codebase > Rat check file identification error,java files with Chinese characters are > recognized as binary files > - > > Key: RAT-301 > URL: https://issues.apache.org/jira/browse/RAT-301 > Project: Apache Rat > Issue Type: Bug >Affects Versions: 0.13 > Environment: Window > Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; > 2015-11-11T00:41:47+08:00) >Reporter: Chen Xia >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > {code:java} > // code placeholder > > > org.apache.rat > apache-rat-plugin > 0.13 > > > rat-validate > validate > > check > > > > > > **/*.versionsBackup > **/.idea/ > **/*.iml > **/*.txt > **/*.json > web/.editorconfig > web/.env > web/.eslintignore > web/.jshintrc > web/public/favicon.ico > web/dist/** > web/node_modules/** > web/apache-linkis-*-web-bin.tar.gz > **/*.md > .git/ > .gitignore > **/.settings/* > **/.classpath > **/.project > **/target/** > **/out/** > **/*.log > CONTRIBUTING.md > CONTRIBUTING_CN.md > DISCLAIMER > DISCLAIMER > README.md > **/META-INF/** > .github/** > compiler/** > **/generated/** > > > {code} > This is the result of {{mvn apache-rat:check}} > {code:java} > Summary > --- > Generated at: 2022-05-06T09:56:39+08:00 > Notes: 0 > Binaries: 1 > Archives: 0 > Standards: 13 > Apache Licensed: 13 > Generated Documents: 0 > JavaDocs are generated, thus a license header is optional. > Generated files do not require license headers. > 0 Unknown Licenses > * > Files with Apache License headers will be marked AL > Binary files (which do not require any license headers) will be marked B > B > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/entity/UDFVersion.java > AL > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/excepiton/UDFException.java > > * {code} > UDFVersion.java is recognized as a binary file > source code: https://github.com/casionone/incubator-linkis/tree/dev-1.1.1-rat -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-54) MIME Detection Using Tika
[ https://issues.apache.org/jira/browse/RAT-54?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843455#comment-17843455 ] ASF subversion and git services commented on RAT-54: Commit 8fcb1cf497701ca0a3d12ebe61a2abbb516f6ac6 in creadur-rat's branch refs/heads/master from Claude Warren [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=8fcb1cf4 ] Merge pull request #240 from Claudenw/tika_based_document_analyzer RAT-54: Tika based document analyzer > MIME Detection Using Tika > - > > Key: RAT-54 > URL: https://issues.apache.org/jira/browse/RAT-54 > Project: Apache Rat > Issue Type: New Feature >Affects Versions: 0.7 >Reporter: Robert Burrell Donkin >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > Tika provides sophisticated and comprehensive MIME detection. Add support for > a Tika based implementation. -- This message was sent by Atlassian Jira (v8.20.10#820010)
Re: [PR] RAT-54: Tika based document analyzer [creadur-rat]
Claudenw merged PR #240: URL: https://github.com/apache/creadur-rat/pull/240 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@creadur.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Commented] (RAT-363) Introduce project description
[ https://issues.apache.org/jira/browse/RAT-363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843445#comment-17843445 ] Claude Warren commented on RAT-363: --- DOAP schema is available at https://github.com/ewilderj/doap > Introduce project description > - > > Key: RAT-363 > URL: https://issues.apache.org/jira/browse/RAT-363 > Project: Apache Rat > Issue Type: Task >Reporter: Philipp Ottlinger >Priority: Major > Fix For: 0.17 > > > Create a project description (DOAP) - possible templates can be derived from: > * [https://github.com/apache/juneau/blob/master/doap_Juneau.rdf] > * [https://github.com/apache/airflow/blob/main/doap_airflow.rdf] > * [https://github.com/apache/kafka/blob/trunk/doap_Kafka.rdf] -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Updated] (RAT-363) Introduce project description
[ https://issues.apache.org/jira/browse/RAT-363?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Claude Warren updated RAT-363: -- Description: Create a project description (DOAP) - possible templates can be derived from: * [https://github.com/apache/juneau/blob/master/doap_Juneau.rdf] * [https://github.com/apache/airflow/blob/main/doap_airflow.rdf] * [https://github.com/apache/kafka/blob/trunk/doap_Kafka.rdf] was: Create a project description (DOAF) - possible templates can be derived from: * https://github.com/apache/juneau/blob/master/doap_Juneau.rdf * https://github.com/apache/airflow/blob/main/doap_airflow.rdf * https://github.com/apache/kafka/blob/trunk/doap_Kafka.rdf > Introduce project description > - > > Key: RAT-363 > URL: https://issues.apache.org/jira/browse/RAT-363 > Project: Apache Rat > Issue Type: Task >Reporter: Philipp Ottlinger >Priority: Major > Fix For: 0.17 > > > Create a project description (DOAP) - possible templates can be derived from: > * [https://github.com/apache/juneau/blob/master/doap_Juneau.rdf] > * [https://github.com/apache/airflow/blob/main/doap_airflow.rdf] > * [https://github.com/apache/kafka/blob/trunk/doap_Kafka.rdf] -- This message was sent by Atlassian Jira (v8.20.10#820010)
Re: [RAT][DISCUSS] Process Archive Files
Ticket RAT-372 [1] has been opened to track this the changes based on this discussion. Pull request 246 [2] is open to work on issue RAT-372 [1] https://issues.apache.org/jira/browse/RAT-372 [2] https://github.com/apache/creadur-rat/pull/246 On Fri, May 3, 2024 at 1:04 PM P. Ottlinger wrote: > Cool, good to know that everything works automatically! > > Phil > > Am 03.05.24 um 12:59 schrieb Claude Warren: > > If the XML looks like: > > > > " > > > > > > > > > > The output as defined in the default.xslt would be: > > > > A src/test/resources/elements/dummy.jar > > MIT MITThe MIT License > > AL AL Apache License Version 2.0 > > > > So all the enclosed licenses are reported for the jar itself. If you > want > > a report of what is in the jar you can run Rat with the jar as an > argument. > > There is no change to the XML format (resources don't have child > resources) > > and all reports that have been modified for multiple license reporting, > > should work without modification. > > > > Claude > > -- LinkedIn: http://www.linkedin.com/in/claudewarren
Re: [PR] RAT-372: Create archive processor [creadur-rat]
Claudenw commented on PR #246: URL: https://github.com/apache/creadur-rat/pull/246#issuecomment-2094238845 This change also updated the Help routines and the CLI usage of commons-cli. New help screen now looks like: ``` usage: java -jar apache-rat/target/apache-rat-CURRENT-VERSION.jar [options] [DIR|TARBALL] == Available Options == -a (deprecated) Add the default license header to any file with an unknown license. Use '-A' or ---addLicense instead. -A,--addLicense Add the default license header to any file with an unknown license that is not in the exclusion list. By default new files will be created with the license header, to force the modification of existing files use the --force option. --archive Specifies how ARCHIVE processing will be handled. (default is NOTIFICATION) -c,--copyright The copyright message to use in the license headers, usually in the form of "Copyright 2008 Foo" -d,--dir (deprecated, use '--') Used to indicate source when using --exclude. --dry-run If set do not update the files but generate the reports. -e,--exclude Excludes files matching wildcard . May be followed by multiple argumentsNote that '--' or a following option is required when using this parameter. -E,--exclude-file Excludes files matching regular expression in the input file. -f,--force Forces any changes in files to be written directly to the source files (i.e. new files are not created). -h,--helpPrint help for the RAT command line interface and exit. --licenses File names or URLs for license definitions --list-familiesList the defined license families (default is NONE). Valid options are: ALL, APPROVED, NONE --list-licensesList the defined licenses (default is NONE). Valid options are: ALL, APPROVED, NONE --log-level sets the log level. --no-default-licenses Ignore default configuration. By default all approved default licenses are used -o,--outDefine the output file where to write a report to (default is System.out). -s,--stylesheet XSLT stylesheet to use when creating the report. Not compatible with -x. Either an external xsl file may be specified or one of the internal named sheets: plain-rat (default), missing-headers, or unapproved-licenses --scan-hidden-directories Scan hidden directories -x,--xml Output the report in raw XML format. Not compatible with -s == Argument Types == A director or archive file to scan A wildcard file matching pattern. example: *-test-*.txt A file name or URI A defined filter for the licenses to include. Valid values: ALL, APPROVED, NONE. The log level to use. Valid values DEBUG, INFO, WARN, ERROR, OFF. Specifies how to process file types. Valid values are: NOTIFICATION: List archive as present PRESENCE: List any licenses found ABSENCE: List licenses found and any unknown licences Either an external xsl file may be one of the internal named sheets: missing-headers, plain-rat, unapproved-licenses. == Notes == 1. Rat highlights possible issues. 2. Rat reports require interpretation. 3. Rat often requires some tuning before it runs well against a project. 4. Rat relies on heuristics: it may miss issues ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@creadur.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [PR] RAT-369: Add checkstyle and spotbugs to build and generate a report [creadur-rat]
ottlinger commented on code in PR #238: URL: https://github.com/apache/creadur-rat/pull/238#discussion_r1590002176 ## apache-rat-plugin/src/main/java/org/apache/rat/mp/util/ignore/GlobIgnoreMatcher.java: ## @@ -25,11 +25,7 @@ import java.io.File; import java.io.FileReader; import java.io.IOException; -import java.util.ArrayList; -import java.util.Arrays; -import java.util.Collection; -import java.util.List; -import java.util.Optional; +import java.util.*; Review Comment: Fixed. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@creadur.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Commented] (RAT-369) Add SpotBugs to the pom files.
[ https://issues.apache.org/jira/browse/RAT-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843441#comment-17843441 ] ASF subversion and git services commented on RAT-369: - Commit 7081bdd920615e94c2336498667a963abe29f78b in creadur-rat's branch refs/heads/feature/RAT-369 from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=7081bdd9 ] RAT-369: Add changelog > Add SpotBugs to the pom files. > -- > > Key: RAT-369 > URL: https://issues.apache.org/jira/browse/RAT-369 > Project: Apache Rat > Issue Type: Improvement > Components: Client - ant, Client - cli, Client - maven, core engine >Reporter: Claude Warren >Assignee: Philipp Ottlinger >Priority: Major > Fix For: 0.17 > > > Currently we have a number of issues that Spotbugs will detect. I ran a > quick test this afternoon and it found 60 potential bugs. This ticket is to > add spotbugs to the maven build, ensure that no bugs are found and ensure > that the build will fail if bugs are found. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-369) Add SpotBugs to the pom files.
[ https://issues.apache.org/jira/browse/RAT-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843440#comment-17843440 ] ASF subversion and git services commented on RAT-369: - Commit 0a62cb2ca5dbe6732522c1e4fbb7a9e92159c55e in creadur-rat's branch refs/heads/feature/RAT-369 from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=0a62cb2c ] RAT-369: Add checkstyle to build and maven site > Add SpotBugs to the pom files. > -- > > Key: RAT-369 > URL: https://issues.apache.org/jira/browse/RAT-369 > Project: Apache Rat > Issue Type: Improvement > Components: Client - ant, Client - cli, Client - maven, core engine >Reporter: Claude Warren >Assignee: Philipp Ottlinger >Priority: Major > Fix For: 0.17 > > > Currently we have a number of issues that Spotbugs will detect. I ran a > quick test this afternoon and it found 60 potential bugs. This ticket is to > add spotbugs to the maven build, ensure that no bugs are found and ensure > that the build will fail if bugs are found. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-369) Add SpotBugs to the pom files.
[ https://issues.apache.org/jira/browse/RAT-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843439#comment-17843439 ] ASF subversion and git services commented on RAT-369: - Commit 85cb1a09f9025acc96e2efea0171c3b0089ce807 in creadur-rat's branch refs/heads/feature/RAT-369 from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=85cb1a09 ] RAT-369: Fix import > Add SpotBugs to the pom files. > -- > > Key: RAT-369 > URL: https://issues.apache.org/jira/browse/RAT-369 > Project: Apache Rat > Issue Type: Improvement > Components: Client - ant, Client - cli, Client - maven, core engine >Reporter: Claude Warren >Assignee: Philipp Ottlinger >Priority: Major > Fix For: 0.17 > > > Currently we have a number of issues that Spotbugs will detect. I ran a > quick test this afternoon and it found 60 potential bugs. This ticket is to > add spotbugs to the maven build, ensure that no bugs are found and ensure > that the build will fail if bugs are found. -- This message was sent by Atlassian Jira (v8.20.10#820010)
Re: [PR] WIP: RAT-369: Add checkstyle and spotbugs to build and generate a report [creadur-rat]
ottlinger commented on PR #238: URL: https://github.com/apache/creadur-rat/pull/238#issuecomment-2094212508 Could we merge this branch and fix the remaining errors step-by-step? Personally I'd prefer to have shorter-living branches ;) @Claudenw -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@creadur.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Commented] (RAT-369) Add SpotBugs to the pom files.
[ https://issues.apache.org/jira/browse/RAT-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843438#comment-17843438 ] ASF subversion and git services commented on RAT-369: - Commit 2136362d9f4834d6f341b5e6adc7fb6d4c9eb91e in creadur-rat's branch refs/heads/feature/RAT-369 from Philipp Ottlinger [ https://gitbox.apache.org/repos/asf?p=creadur-rat.git;h=2136362d ] RAT-369: Revert star-import > Add SpotBugs to the pom files. > -- > > Key: RAT-369 > URL: https://issues.apache.org/jira/browse/RAT-369 > Project: Apache Rat > Issue Type: Improvement > Components: Client - ant, Client - cli, Client - maven, core engine >Reporter: Claude Warren >Assignee: Philipp Ottlinger >Priority: Major > Fix For: 0.17 > > > Currently we have a number of issues that Spotbugs will detect. I ran a > quick test this afternoon and it found 60 potential bugs. This ticket is to > add spotbugs to the maven build, ensure that no bugs are found and ensure > that the build will fail if bugs are found. -- This message was sent by Atlassian Jira (v8.20.10#820010)
Re: [PR] RAT-372: Create archive processor [creadur-rat]
ottlinger commented on PR #246: URL: https://github.com/apache/creadur-rat/pull/246#issuecomment-2094207899 Let's review this after https://github.com/apache/creadur-rat/pull/240 is merged into master - thanks. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@creadur.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [PR] RAT-54: Tika based document analyzer [creadur-rat]
ottlinger commented on PR #240: URL: https://github.com/apache/creadur-rat/pull/240#issuecomment-2094202616 @Claudenw pls review my latest additions concerning RAT-301, after that go ahead with the merge. Thanks for your work and the cool addition of more functionality to RAT #kudos -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@creadur.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Re: [PR] RAT-54: Tika based document analyzer [creadur-rat]
ottlinger commented on code in PR #240: URL: https://github.com/apache/creadur-rat/pull/240#discussion_r1589986059 ## src/changes/changes.xml: ## @@ -72,6 +72,22 @@ https://maven.apache.org/plugins/maven-changes-plugin/xsd/changes-1.0.0.xsd --> + +MIME Detection Using Tika + + +Changed to detecting binary by content not name. + + +Change to detect non UTF-8 text as text not binary. Review Comment: @Claudenw I brought in changes related to RAT-301 - is that fine for you or should the Chinese character example go somewhere else? Thanks -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@creadur.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Updated] (RAT-301) Rat check file identification error,java files with Chinese characters are recognized as binary files
[ https://issues.apache.org/jira/browse/RAT-301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Philipp Ottlinger updated RAT-301: -- Fix Version/s: 0.17 > Rat check file identification error,java files with Chinese characters are > recognized as binary files > - > > Key: RAT-301 > URL: https://issues.apache.org/jira/browse/RAT-301 > Project: Apache Rat > Issue Type: Bug >Affects Versions: 0.13 > Environment: Window > Apache Maven 3.3.9 (bb52d8502b132ec0a5a3f4c09453c07478323dc5; > 2015-11-11T00:41:47+08:00) >Reporter: Chen Xia >Assignee: Claude Warren >Priority: Major > Fix For: 0.17 > > > {code:java} > // code placeholder > > > org.apache.rat > apache-rat-plugin > 0.13 > > > rat-validate > validate > > check > > > > > > **/*.versionsBackup > **/.idea/ > **/*.iml > **/*.txt > **/*.json > web/.editorconfig > web/.env > web/.eslintignore > web/.jshintrc > web/public/favicon.ico > web/dist/** > web/node_modules/** > web/apache-linkis-*-web-bin.tar.gz > **/*.md > .git/ > .gitignore > **/.settings/* > **/.classpath > **/.project > **/target/** > **/out/** > **/*.log > CONTRIBUTING.md > CONTRIBUTING_CN.md > DISCLAIMER > DISCLAIMER > README.md > **/META-INF/** > .github/** > compiler/** > **/generated/** > > > {code} > This is the result of {{mvn apache-rat:check}} > {code:java} > Summary > --- > Generated at: 2022-05-06T09:56:39+08:00 > Notes: 0 > Binaries: 1 > Archives: 0 > Standards: 13 > Apache Licensed: 13 > Generated Documents: 0 > JavaDocs are generated, thus a license header is optional. > Generated files do not require license headers. > 0 Unknown Licenses > * > Files with Apache License headers will be marked AL > Binary files (which do not require any license headers) will be marked B > B > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/entity/UDFVersion.java > AL > D:/DataSphere/linkis_svn/1.1.1-RC1/apache-linkis-1.1.1-incubating-src/apache-linkis-1.1.1-incubating-src/linkis-public-enhancements/linkis-publicservice/linkis-udf/linkis-udf-common/src/main/java/org/apache/linkis/udf/excepiton/UDFException.java > > * {code} > UDFVersion.java is recognized as a binary file > source code: https://github.com/casionone/incubator-linkis/tree/dev-1.1.1-rat -- This message was sent by Atlassian Jira (v8.20.10#820010)
[PR] RAT-372: Create archive processor [creadur-rat]
Claudenw opened a new pull request, #246: URL: https://github.com/apache/creadur-rat/pull/246 Initial implementation see https://issues.apache.org/jira/browse/RAT-372 See discussion: https://lists.apache.org/thread/gyp2tfsw3w66x0wy95tv1bcwb667h866 This pull is built upon the Tika changes in RAT-54 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@creadur.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
[jira] [Created] (RAT-373) Collect sample files
Claude Warren created RAT-373: - Summary: Collect sample files Key: RAT-373 URL: https://issues.apache.org/jira/browse/RAT-373 Project: Apache Rat Issue Type: Improvement Components: core engine Affects Versions: 0.16.1 Reporter: Claude Warren Find sample files to prove parsing of the following file types test.m3u test-audio.mpa test-audio.wma -- This message was sent by Atlassian Jira (v8.20.10#820010)
Re: [PR] RAT-54: Tika based document analyzer [creadur-rat]
Claudenw commented on PR #240: URL: https://github.com/apache/creadur-rat/pull/240#issuecomment-2094116895 I updated the checklist. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@creadur.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org
Web site contents
Is there a way to execute a Java application, capture the output, and insert it into an index.apt.vm file? If so I would like to generate the help for the CLI and insert it during the build process.
[jira] [Created] (RAT-372) Process Archive Files
Claude Warren created RAT-372: - Summary: Process Archive Files Key: RAT-372 URL: https://issues.apache.org/jira/browse/RAT-372 Project: Apache Rat Issue Type: Bug Components: core engine Affects Versions: 0.16.1 Reporter: Claude Warren Process the contents of Archive files to determine the licenses associated with the Archive file. See discussion: [https://lists.apache.org/thread/gyp2tfsw3w66x0wy95tv1bcwb667h866] Also add tests for the ArchiveWalker as noted in pull request #240 -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Assigned] (RAT-7) RAT should distinguish the different AL header versions
[ https://issues.apache.org/jira/browse/RAT-7?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Claude Warren reassigned RAT-7: --- Assignee: Claude Warren > RAT should distinguish the different AL header versions > --- > > Key: RAT-7 > URL: https://issues.apache.org/jira/browse/RAT-7 > Project: Apache Rat > Issue Type: Bug > Components: license-meta-data >Reporter: Sebb >Assignee: Claude Warren >Priority: Major > > RAT currently does not seem to care which version of the AL header is present. > This is a bit misleading; IMO it should distinguishe between current AL and > obsolete AL headers. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (RAT-5) Check Copyright dates
[ https://issues.apache.org/jira/browse/RAT-5?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17843419#comment-17843419 ] Claude Warren commented on RAT-5: - We now have a Copyright matcher. It may be possible for the builder to use a system derived date to set the date. [~sebb] if you want to look at it I would be happy to help. > Check Copyright dates > - > > Key: RAT-5 > URL: https://issues.apache.org/jira/browse/RAT-5 > Project: Apache Rat > Issue Type: New Feature > Components: core engine >Reporter: Sebb >Priority: Major > > The Copyright years in the NOTICE file(s) could be checked against: > - current year > - the Maven tag -- This message was sent by Atlassian Jira (v8.20.10#820010)