Thomas Hellström wrote:
> Dave Airlie wrote:
> 
>> On 10/31/07, Thomas Hellström <[EMAIL PROTECTED]> wrote:
>>  
>>
>>> Dave Airlie wrote:
>>>    
>>>
>>>> On 10/31/07, Thomas Hellström <[EMAIL PROTECTED]> wrote:
>>>>
>>>>      
>>>>
>>>>> Dave,
>>>>>
>>>>> When starting out with TTM i did look a little at AGP caching issues and
>>>>> there was an issue with cached memory and speculative pre-fetching that
>>>>> may affect the mapped-cached memory,
>>>>> and that we need to know about but perhaps ignore.
>>>>>
>>>>> Suppose you bind a page to the AGP aperture, but don't change the kernel
>>>>> linear map caching policy.
>>>>> Then a speculatively prefetching processor may read the memory into its
>>>>> cache and then decide it doesn't
>>>>> want to use it, and actually write it back.
>>>>> Meanwhile the GPU may have changed the contents of the page and that
>>>>> change will be overwritten. Apparently there were big problems with AMD
>>>>> Athlons actually doing this. Linux people claiming it was an Athlon bug
>>>>> and AMD people claiming it was within specs.
>>>>>
>>>>>
>>>>>        
>>>>>
>>>> http://www.linuxjournal.com/files/linuxjournal.com/linuxjournal/articles/061/6148/6148s1.html
>>>>
>>>> Is what I believe you are talking about, I'll add something to the
>>>> comment mentioning this..
>>>>
>>>>
>>>>      
>>>>
>>> Yup. In the end I believe the "change_page_attr(), global_flush_tlb()"
>>> sequence was the final outcome of this, but as I understand it,
>>> with your new code we never write through the GTT, which makes the only
>>> possible problem overwrites of GPU written data.
>>>
>>>    
>>>
>> Well so far we've only dealt with Intel CPU/GPU combinations which
>> hopefully don't suffer from this issue.. I'll put a comment in but tbh
>> there are lots of ways to mess up things with the current APIs..
>>
>> Try allocating a snooped batchbuffer, or a snooped private back buffer
>> or anything involved in a blit.... I'm going to add checks for some of
>> the more stupid things in the Intel superioctl code...
>>  
>>
> Well i915 (and friends') snooped memory is, as you say not very useful, 
> but it works fine AFAICT for things like the flip move from VRAM (which 
> has been disabled ATM due to HW lock issues). Should also be fine for 
> readpixels and zero copy texturing, although I doubt that there is any 
> performance gain in the latter.

FWIW, zero copy texturing is a good win when the texture is used only 
once, eg for video.  The streamingrect application (when it was working) 
showed a very good improvement with the cow-pbo hack (also when it was 
working).

To make it work transparently is quite difficult & fragile though.

Keith


-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems?  Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
--
_______________________________________________
Dri-devel mailing list
Dri-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dri-devel

Reply via email to