William1104 commented on PR #45103:
URL: https://github.com/apache/spark/pull/45103#issuecomment-1948854042

   Hi @dongjoon-hyun,
   
   Thank you for providing the script. If I understand correctly, the script is 
designed to check if any dependencies have been updated in an unexpected manner 
for every profile. This functionality is extremely valuable as it helps us 
avoid making careless mistakes and provides the reviewer with a clearer 
understanding of how dependencies will be updated.
   
   It appears that several Spark modules are using libraries without proper 
declaration, or they have declared certain libraries but are not actually using 
them. To generate a report highlighting these issues, I ran the following 
command:
   
   ```
    ./build/mvn dependency:analyze | sed -n '/<<< dependency:.*:analyze/,/>>> 
dependency:.*:analyze/p' > dependency-analyze
   ```
   
   In the "dependency-analyze" report, let's take the module 
"spark-common-utils" as an example. It currently has a compile scope dependency 
on "commons-io," which could be changed to the test scope to avoid unnecessary 
transitive dependencies. Here are the relevant excerpts from the report:
   
   ```
   [INFO] --- dependency:3.6.1:analyze (default-cli) @ spark-common-utils_2.13 
---
   [WARNING] Used undeclared dependencies found:
   [WARNING]    
com.fasterxml.jackson.core:jackson-annotations:jar:2.16.1:compile
   [WARNING]    org.apache.commons:commons-lang3:jar:3.14.0:compile
   [WARNING]    com.fasterxml.jackson.core:jackson-core:jar:2.16.1:compile
   [WARNING]    org.scala-lang:scala-library:jar:2.13.12:compile
   [WARNING]    org.scalatest:scalatest-funsuite_2.13:jar:3.2.17:test
   [WARNING]    org.scalactic:scalactic_2.13:jar:3.2.17:test
   [WARNING]    org.scalatest:scalatest-compatible:jar:3.2.17:test
   [WARNING]    org.scalatest:scalatest-core_2.13:jar:3.2.17:test
   [WARNING] Unused declared dependencies found:
   [WARNING]    
com.fasterxml.jackson.module:jackson-module-scala_2.13:jar:2.16.1:compile
   [WARNING]    oro:oro:jar:2.0.8:compile
   [WARNING]    org.slf4j:jul-to-slf4j:jar:2.0.11:compile
   [WARNING]    org.slf4j:jcl-over-slf4j:jar:2.0.11:compile
   [WARNING]    org.apache.logging.log4j:log4j-slf4j2-impl:jar:2.22.1:compile
   [WARNING]    org.apache.logging.log4j:log4j-1.2-api:jar:2.22.1:compile
   [WARNING]    org.spark-project.spark:unused:jar:1.0.0:compile
   [WARNING]    org.scalatest:scalatest_2.13:jar:3.2.17:test
   [WARNING]    org.scalatestplus:scalacheck-1-17_2.13:jar:3.2.17.0:test
   [WARNING]    org.scalatestplus:mockito-4-11_2.13:jar:3.2.17.0:test
   [WARNING]    org.scalatestplus:selenium-4-12_2.13:jar:3.2.17.0:test
   [WARNING]    org.junit.jupiter:junit-jupiter:jar:5.9.3:test
   [WARNING]    net.aichler:jupiter-interface:jar:0.11.1:test
   [WARNING] Non-test scoped test only dependencies found:
   [WARNING]    commons-io:commons-io:jar:2.15.1:compile
   [INFO]
   ```
   This information suggests that there are both used undeclared dependencies 
and unused declared dependencies. Additionally, there are non-test scoped 
test-only dependencies, such as "commons-io:commons-io:jar:2.15.1:compile." 
These findings can help us identify areas where we can optimize and refine the 
dependency management within the "spark-common-utils" module.
   
   I would like to create PR to fix the dependency scope. 
   
   Thanks and regards, 
   William


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to