Re: [petsc-dev] cuda failures of tests in master

Dominic Meiser Sat, 08 Aug 2015 08:04:16 -0700

With the current implementation the following can happen (v is of typeVECCUSP):

- Originally data on GPU, v.valid_GPU_array == PETSC_CUSP_GPU

- a call to VecPlaceArray(v, arr) unplaces the data on the host and setsv.valid_CPU_array=CPU. Note that the GPU data does not get stashed.- subsequent accesses of the GPU data will clobber the data that wasthere before VecPlaceArray.


I think there are two possible solutions:

- In VecPlaceArray_SeqCUSP we allocate a new array on the GPU and stashthe current values.- We do a GPU->CPU synchronization in VecPlaceArray_SeqCUSP to make surethat the data on the CPU is up to date.

It's a space/time tradeoff. Also, the first option further complicatesthe caching mechanism. I think the caching mechanism is already toocomplicated (nearly every bug I encounter with the CUDA stuff is relatedto caching). The second option allows us to more easily reuseVecPlaceArray_Seq. And we don't have to juggle a GPU unplaced array inaddition to the host side unplaced array. I'd therefore propose that wetake the hit of a GPU->CPU data synchronization. Not that thissynchronization only incurs in a PCIe data transfer if the data on theCPU is stale.

But all of this is only needed if the semantics ofVecPlaceArray/VecResetArray array is to preserve the contents of theunplaced array.


Cheers,
Dominic

On 08/07/2015 09:23 PM, Barry Smith wrote:

On Aug 7, 2015, at 7:50 PM, Dominic Meiser <dmei...@txcorp.com> wrote:

FYI I've opened a pull request that addresses this issue.

While going through the code I ran into a question regarding the semantics of 
VecPlaceArray and VecResetArray: Is the contents of the "unplaced" array 
supposed to be preserved so that the vector is completely restored upon calling 
VecResetArray? With the current implementation of VecPlaceArray_SeqCUSP and 
VecResetArray_SeqCUSP a situation can occur where the contents of the unplaced array gets 
clobbered. Does this need to be fixed?


    Hmm,  I think I always assumed the "unplaced" array was inaccessible during 
the time it is unplaced (since there is no public pointer to the array), this would mean 
that the values there shouldn't change. What are the exact details of how they can get 
changed?

    Thanks

    Barry


Cheers,
Dominic


On 08/07/2015 02:42 PM, Barry Smith wrote:


   Guardians of CUDA/GPUs

http://ftp.mcs.anl.gov/pub/petsc/nightlylogs/archive/2015/08/06/examples_master_arch-cuda-double_bb-proxy.log
http://ftp.mcs.anl.gov/pub/petsc/nightlylogs/archive/2015/08/06/examples_master_arch-cuda_bb-proxy.log

search for ex2_bjacobi

note that this example does not fail in non CUDA builds.

For some reason the iterative solver thinks it converges in 0 iterations but 
the answer is completely wrong.

   Barry


--
Dominic Meiser
Tech-X Corporation
5621 Arapahoe Avenue
Boulder, CO 80303
USA
Telephone: 303-996-2036
Fax: 303-448-7756
www.txcorp.com


--
Dominic Meiser
Tech-X Corporation
5621 Arapahoe Avenue
Boulder, CO 80303
USA
Telephone: 303-996-2036
Fax: 303-448-7756
www.txcorp.com

Re: [petsc-dev] cuda failures of tests in master

Reply via email to