Well done. Great work.
> On 19 Jun 2018, at 16:55, Cyril Ferlicot D. <[email protected]> wrote:
>
> Hi,
>
> Since months now there are a lot of random failure on the CI making it
> hard to work.
>
> There is different kind of failures:
> - Network problems
> - Failing tests
> - Incomprehensible problems
>
> Now I don't see much failure due to Network. I suppose the Inria
> infrastructure improved.
>
> Failing tests were corrected those past months and we see less and less
> of them.
>
> Now the big problem are the incomprehensible crashes such as "The
> workspace was not found" or "FileDoesNotExistException" or "pharo-vm/ is
> already present".
>
> We just found the problem :)
>
> During the validation of the Bootstrap multiple tests are launched on
> OSX/Windows/linux in parallel. Each task is on a different slave of the
> Jenkins. But, apparently we discovered that two slaves could have the
> same disk. Usually it does not cause any trouble since a job is only run
> by one slave. But in this particular case, two slaves can be used by the
> same job and mess with the resources of each other.
>
> We highlighted the problem by adding logs to the CI. Now when we launch
> tests we create a file with the name of the task.
>
> Today we got a crash and in the log we see that the same workspace has
> two of those files, proving that they are executed on the same disk, in
> the same folder :
>
> […]
> -rw-rw-r-- 1 ci ci 0 Jun 19 16:01 Kernel-tests-unix-32
> […]
> -rw-rw-r-- 1 ci ci 0 Jun 19 16:01 Tests-unix-32
>
> As a solution we will execute the tests inside a subfolder with the name
> of the task and it should reduce a lot the number of problems.
>
> Have a nice day :)
>
> --
> Cyril Ferlicot
> https://ferlicot.fr
>