Re: alignment on stack-allocated arrays/structs

Trass3r Wed, 18 Nov 2009 09:20:10 -0800

Don schrieb:

Well, sort of.
It's impossible to align stack-allocated structs with any alignmentgreater than the alignment of the stack itself (which is 4 bytes).Anything larger than that and you HAVE to use the heap or alloca().


So how do other compilers supporting that alignment syntax do it?

Nothing on x86 benefits from more than 16 byte alignment, AFAIK, andit's never mandatory to use more than 8 byte alignment. I don't know somuch about the recent GPUs, though -- do they really require 16 bytealignment or more?

I'm not sure how exactly this works and why they require alignment.Couldn't find anything about that in the clEnqueueWriteBufferdescription where data gets written into GPU memory.



The specification for the OpenCL C language itself only states:

A data item declared to be a data type in memory is always aligned tothe size of the data type in bytes. For example, a float4 variable willbe aligned to a 16-byte boundary, a char2 variable will be aligned to a2-byte boundary.

A built-in data type that is not a power of two bytes in size must bealigned to the next larger power of two. This rule applies to built-intypes only, not structs or unions.




They also strangely state:

The components of vector data types with 1 ... 4 components can beaddressed as <vector_data_type>.xyzw.


float4 c, a, b;

c.xyzw = (float4)(1.0f, 2.0f, 3.0f, 4.0f);
c.z = 1.0f;         // is a float
c.xy = (float2)(3.0f, 4.0f); // is a float2

So I wonder why they used arrays in the headers and not structs to beconsistent with this.

Re: alignment on stack-allocated arrays/structs

Reply via email to