[
https://issues.apache.org/jira/browse/SPARK-14765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dongjoon Hyun resolved SPARK-14765.
-----------------------------------
Resolution: Invalid
As [~joshrosen] said, currently, Jenkins uses Github merged commit and this
should be enough. This observed situation seems to happen sometimes.
> Jenkins should run tests only based on the PR contents
> ------------------------------------------------------
>
> Key: SPARK-14765
> URL: https://issues.apache.org/jira/browse/SPARK-14765
> Project: Spark
> Issue Type: Bug
> Components: Project Infra
> Reporter: Dongjoon Hyun
>
> h4. Background
> Yesterday, when I exposed a function to Python/R, only Python and R module
> are changed. At the first successful build, it took 25 minutes, but the next
> one took 125 minutes.
> * [25
> minutes|https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56291/consoleFull]
> {code}
> [info] Found the following changed modules: pyspark-sql, sparkr
> {code}
> * [125
> minutes|https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/56298/]
> {code}
> [info] Found the following changed modules: pyspark-sql, sparkr, mllib, hive,
> sql, root
> {code}
> h4. Problem
> `identify_changed_files_from_git_commits` function in `run-tests.py` simply
> runs `git diff` to the master. It means **Newly updated master code** also
> considered as a changed file in the PR.
> h4. How to fix
> Add `git rebase` command before running `git diff`.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]