Thank you for sharing more details and suggestions, Vlad and Herman.

Additionally, I found one interesting thread where an Apache Spark PMC member 
asked the specific question to `legal-discuss@` literally for the above 
`TestSerDe.jar` file.

The thread at https://lists.apache.org/list?legal-disc...@apache.org
- https://lists.apache.org/thread/xmbgpgt30n7fdd99pnbg7983qzzrx24k (June 25, 
2018)
  (Re: LICENSE and NOTICE file content)

Apache Spark 1.0 (jar)
- 
https://github.com/apache/spark/blob/branch-1.0/sql/hive/src/test/resources/data/files/TestSerDe.jar

Apache Hive 1.2 (Original Source Code)
- 
https://github.com/apache/hive/blob/rel/release-1.2.2/itests/test-serde/src/main/java/org/apache/hadoop/hive/serde2/TestSerDe.java

Dongjoon.

On 2025/02/26 04:59:56 "Rozov, Vlad" wrote:
> AFAIK, there are 3 ways to satisfy ASF source release policy:
> 
> 1. Use source code to build test jars and remove jars from the source release 
> and they may still be part of the source in the github repo (preferable).
> 2. Publish jars as artifacts in Maven central and pull them as a dependency.
> 3. Repackage jars to include LICENSE.
> 
> Thank you,
> 
> Vlad
> 
> On Feb 25, 2025, at 7:18 PM, Herman van Hovell 
> <her...@databricks.com.INVALID> wrote:
> 
> https://github.com/apache/spark/blob/master/sql/connect/client/jvm/src/test/resources/StubClassDummyUdf.scala
>  (commit 408c3df) is the source code for the following jars:
> 
>   *   ./connector/connect/server/src/test/resources/udf_noA.jar
>   *   ./connector/connect/client/jvm/src/test/resources/udf2.13.jar
>   *   ./connector/connect/client/jvm/src/test/resources/udf2.12.jar
> 
> https://github.com/apache/spark/blob/master/sql/hive/src/test/resources/regression-test-SPARK-8489/Main.scala
>  and 
> https://github.com/apache/spark/blob/master/sql/hive/src/test/resources/regression-test-SPARK-8489/MyCoolClass.scala
>  are the source files for the following jars:
> 
>   *   ./sql/hive/src/test/resources/regression-test-SPARK-8489/test-2.12.jar
>   *   ./sql/hive/src/test/resources/regression-test-SPARK-8489/test-2.13.jar
>   *   ./sql/hive/src/test/resources/regression-test-SPARK-8489/test-2.11.jar
> 
> ./sql/core/src/test/resources/SPARK-33084.jar was added in 
> https://github.com/apache/spark/commit/771c538620e66be2d0fb0e383e4aa37b4d29f7eb.
>  The commit contains the source code.
> 
> I will get the source code for the following jars checked in:
> 
>   *   ./core/src/test/resources/TestHelloV3_2.12.jar
>   *   ./core/src/test/resources/TestHelloV2_2.12.jar
>   *   ./core/src/test/resources/TestHelloV2_2.13.jar
>   *   ./core/src/test/resources/TestHelloV3_2.13.jar
>   *   
> ./connector/connect/common/src/test/resources/artifact-tests/junitLargeJar.jar
>   *   
> ./connector/connect/common/src/test/resources/artifact-tests/smallJar.jar
>   *   ./connector/connect/client/jvm/src/test/resources/TestHelloV2_2.12.jar
>   *   ./connector/connect/client/jvm/src/test/resources/TestHelloV2_2.13.jar
>   *   ./data/artifact-tests/junitLargeJar.jar
>   *   ./data/artifact-tests/smallJar.jar
> 
> 
> On Tue, Feb 25, 2025 at 9:22 PM Rozov, Vlad <vro...@amazon.com.invalid> wrote:
> Right, the issue does not seem to be new for 3.5 and it is not new for 3.5.5. 
> Here is the list of all jars I found in the source release:
> 
> ./core/src/test/resources/TestHelloV3_2.12.jar
> ./core/src/test/resources/TestHelloV2_2.12.jar
> ./core/src/test/resources/TestHelloV2_2.13.jar
> ./core/src/test/resources/TestHelloV3_2.13.jar
> ./core/src/test/resources/TestUDTF.jar
> ./connector/connect/server/src/test/resources/udf_noA.jar
> ./connector/connect/common/src/test/resources/artifact-tests/junitLargeJar.jar
> ./connector/connect/common/src/test/resources/artifact-tests/smallJar.jar
> ./connector/connect/client/jvm/src/test/resources/TestHelloV2_2.12.jar
> ./connector/connect/client/jvm/src/test/resources/TestHelloV2_2.13.jar
> ./connector/connect/client/jvm/src/test/resources/udf2.13.jar
> ./connector/connect/client/jvm/src/test/resources/udf2.12.jar
> ./data/artifact-tests/junitLargeJar.jar
> ./data/artifact-tests/smallJar.jar
> ./sql/core/src/test/resources/SPARK-33084.jar
> ./sql/hive/src/test/noclasspath/hive-test-udfs.jar
> ./sql/hive/src/test/resources/regression-test-SPARK-8489/test-2.12.jar
> ./sql/hive/src/test/resources/regression-test-SPARK-8489/test-2.13.jar
> ./sql/hive/src/test/resources/regression-test-SPARK-8489/test-2.11.jar
> ./sql/hive/src/test/resources/SPARK-21101-1.0.jar
> ./sql/hive/src/test/resources/data/files/TestSerDe.jar
> ./sql/hive/src/test/resources/TestUDTF.jar
> ./sql/hive-thriftserver/src/test/resources/TestUDTF.jar
> 
> Thank you,
> 
> Vlad
> 
> > On Feb 25, 2025, at 3:03 PM, Dongjoon Hyun 
> > <dongj...@apache.org<mailto:dongj...@apache.org>> wrote:
> >
> > Just for the record, `TestUDTF.jar` test resource seems to be one of the 
> > oldest examples which Vlad pointed.
> >
> > https://github.com/apache/spark/blob/v2.0.0/core/src/test/resources/TestUDTF.jar
> > (Added 9 years ago)
> >
> > $ tar tvf spark-2.0.0.tgz | grep TestUDTF.jar
> > -rw-r--r--  0 jenkins jenkins   1328 Jul 19  2016 
> > spark-2.0.0/sql/hive/src/test/resources/TestUDTF.jar
> > -rw-r--r--  0 jenkins jenkins    1328 Jul 19  2016 
> > spark-2.0.0/sql/hive-thriftserver/src/test/resources/TestUDTF.jar
> > -rw-r--r--  0 jenkins jenkins    1328 Jul 19  2016 
> > spark-2.0.0/core/src/test/resources/TestUDTF.jar
> >
> > Dongjoon.
> >
> >
> > On 2025/02/25 22:47:07 Dongjoon Hyun wrote:
> >> Thank you for reply, Vlad.
> >>
> >> Do you mean SPARK-44246 violated ASF policy since Apache Spark 3.5.0 by 
> >> injecting a unlicensed test jar, `TestHelloV2.jar`?
> >>
> >> https://issues.apache.org/jira/browse/SPARK-44246
> >> https://github.com/apache/spark/pull/41789
> >>
> >> And, it was spread to `TestHelloV3_2.13.jar` via SPARK-44297 in the same 
> >> way?
> >>
> >> https://issues.apache.org/jira/browse/SPARK-44297
> >> https://github.com/apache/spark/pull/41852
> >>
> >>
> >> Dongjoon.
> >>
> >> On 2025/02/25 19:53:45 "Rozov, Vlad" wrote:
> >>> I am not sure if this was already discussed and noted, so want to confirm 
> >>> with PMC members:
> >>>
> >>> I see several (test) JAR files included into the ASF source release that 
> >>> do not have LICENSE in the MANIFEST or META-INF and do not have the 
> >>> source code. For example core/src/test/resources/TestHelloV3_2.13.jar. 
> >>> AFAIK it may be against ASF source release policy to include such jars 
> >>> into the source release (see "Does the policy apply to binary/object 
> >>> files, such as executables or JAR files?" in 
> >>> https://www.apache.org/legal/src-headers.html)
> >>>
> >>> Thank you,
> >>>
> >>> Vlad
> >>>
> >>> On Feb 23, 2025, at 4:37 PM, Dongjoon Hyun 
> >>> <dongjoon.h...@gmail.com<mailto:dongjoon.h...@gmail.com>> wrote:
> >>>
> >>> Please vote on releasing the following candidate as Apache Spark version 
> >>> 3.5.5.
> >>>
> >>> The vote is open until February 27th 1AM (PST) and passes if a majority 
> >>> +1 PMC votes are cast, with a minimum of 3 +1 votes.
> >>>
> >>> [ ] +1 Release this package as Apache Spark 3.5.5
> >>> [ ] -1 Do not release this package because ...
> >>>
> >>> To learn more about Apache Spark, please see https://spark.apache.org/
> >>>
> >>> The tag to be voted on is v3.5.5-rc1 (commit 
> >>> 7c29c664cdc9321205a98a14858aaf8daaa19db2)
> >>> https://github.com/apache/spark/tree/v3.5.5-rc1
> >>>
> >>> The release files, including signatures, digests, etc. can be found at:
> >>> https://dist.apache.org/repos/dist/dev/spark/v3.5.5-rc1-bin/
> >>>
> >>> Signatures used for Spark RCs can be found in this file:
> >>> https://dist.apache.org/repos/dist/dev/spark/KEYS
> >>>
> >>> The staging repository for this release can be found at:
> >>> https://repository.apache.org/content/repositories/orgapachespark-1476/
> >>>
> >>> The documentation corresponding to this release can be found at:
> >>> https://dist.apache.org/repos/dist/dev/spark/v3.5.5-rc1-docs/
> >>>
> >>> The list of bug fixes going into 3.5.5 can be found at the following URL:
> >>> https://issues.apache.org/jira/projects/SPARK/versions/12355518
> >>>
> >>> This release is using the release script of the tag v3.5.5-rc1.
> >>>
> >>> FAQ
> >>>
> >>> =========================
> >>> How can I help test this release?
> >>> =========================
> >>>
> >>> If you are a Spark user, you can help us test this release by taking
> >>> an existing Spark workload and running on this release candidate, then
> >>> reporting any regressions.
> >>>
> >>> If you're working in PySpark you can set up a virtual env and install
> >>> the current RC and see if anything important breaks, in the Java/Scala
> >>> you can add the staging repository to your projects resolvers and test
> >>> with the RC (make sure to clean up the artifact cache before/after so
> >>> you don't end up building with a out of date RC going forward).
> >>>
> >>> ===========================================
> >>> What should happen to JIRA tickets still targeting 3.5.5?
> >>> ===========================================
> >>>
> >>> The current list of open tickets targeted at 3.5.5 can be found at:
> >>> https://issues.apache.org/jira/projects/SPARK and search for "Target 
> >>> Version/s" = 3.5.5
> >>>
> >>> Committers should look at those and triage. Extremely important bug
> >>> fixes, documentation, and API tweaks that impact compatibility should
> >>> be worked on immediately. Everything else please retarget to an
> >>> appropriate release.
> >>>
> >>> ==================
> >>> But my bug isn't fixed?
> >>> ==================
> >>>
> >>> In order to make timely releases, we will typically not hold the
> >>> release unless the bug in question is a regression from the previous
> >>> release. That being said, if there is something which is a regression
> >>> that has not been correctly targeted please ping me or a committer to
> >>> help target the issue.
> >>>
> >>>
> >>
> >> ---------------------------------------------------------------------
> >> To unsubscribe e-mail: 
> >> dev-unsubscr...@spark.apache.org<mailto:dev-unsubscr...@spark.apache.org>
> >>
> >>
> >
> > ---------------------------------------------------------------------
> > To unsubscribe e-mail: 
> > dev-unsubscr...@spark.apache.org<mailto:dev-unsubscr...@spark.apache.org>
> >
> 
> 
> 

---------------------------------------------------------------------
To unsubscribe e-mail: dev-unsubscr...@spark.apache.org

Reply via email to