On Apr 28, 2009, at 3:19 PM, Julian Bangert wrote:

Hello,

I am currently trying to work a bit on the remaining "missing feature" that NVIDIA requires ( http://wiki.freebsd.org/NvidiaFeatureRequests or a back post in this ML) - the improved mmap system call. For now, I am trying to extend the current system call and implementation to add cache control ( the type of memory caching used) . This feature inherently is very architecture specific- but it can lead to enormous performance improvements for memmapped devices ( useful for drivers, etc). I would do this at the user site by adding 3 flags to the mmap system call (MEM_CACHE__ATTR1 to MEM_CACHE__ATTR3 ) which are a single octal digit corresponding to the various caching options ( like Uncacheable,Write Combining, etc... ) with the same numbers as the PAT_* macros from i386/include/ specialreg.h except that the value 0 ( PAT_UNCACHEABLE ) is replaced with value 2 ( undefined), whereas value 0 ( all 3 flags cleared) is assigned the meaning "feature not used, use default cache control". For each cache behaviour there would of course also be a macro expanding to the rigth combination of these flags for enhanced useability.

The mmap system call would, if any of these flags are set, decode them and get a corresponding PAT_* value, perform the mapping and then call into the pmap module to modify the cache attributes for every page.

Have you looked at mem(4) yet?

Several architectures allow attributes to be associated with ranges of physical memory. These attributes can be manipulated via ioctl() calls performed on /dev/mem. Declarations and data types are to be found in
     <sys/memrange.h>.

The specific attributes, and number of programmable ranges may vary
     between architectures.  The full set of supported attributes is:

     MDF_UNCACHEABLE
             The region is not cached.

     MDF_WRITECOMBINE
Writes to the region may be combined or performed out of order.

     MDF_WRITETHROUGH
             Writes to the region are committed synchronously.

     MDF_WRITEBACK
             Writes to the region are committed asynchronously.

     MDF_WRITEPROTECT
             The region cannot be written to.

This requires knowledge of the physical addresses, but I believe that's probably already necessary for what it sounds like you're trying to accomplish.

Back in the FreeBSD-3.0 days, I was writing a custom driver for an AGP graphics controller, and setting the MTRR flags for the exposed buffer was a definite improvement (200-1200% faster in most cases).

-- Kevin

_______________________________________________
[email protected] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[email protected]"

Reply via email to