[GitHub] [tvm-rfcs] vinx13 commented on a diff in pull request #63: RFC: clarifying buffer declaration and access

GitBox Fri, 08 Apr 2022 13:36:33 -0700


vinx13 commented on code in PR #63:
URL: https://github.com/apache/tvm-rfcs/pull/63#discussion_r846458898



##########
rfcs/0063-clarifying-buffer-declaration-and-access.md:
##########
@@ -0,0 +1,232 @@
+- Feature Name: Clarifying Buffer Declaration and Access
+- Author: Wuwei Lin (@vinx13), Eric Lunderberg (@Lunderberg)
+- Start Date: 2022-03-18
+- RFC PR: [apache/tvm-rfcs#63](https://github.com/apache/tvm-rfcs/pull/63)
+- GitHub Issue: [apache/tvm#10505](https://github.com/apache/tvm/issues/10505)
+
+# Summary 
+[summary]: #summary
+
+In https://github.com/apache/tvm/pull/9727 and
+[RFC#39](https://github.com/apache/tvm-rfcs/blob/main/rfcs/0039-buffer-physical-layout.md),
 we
+deprecated `Load` and `Store` to use `BufferLoad` and `BufferStore` instead in 
order to support
+generalized multi-dimensional physical buffer access. Here we document 
necessary clarifications,
+implications about the new buffer convention, as well as the post-hoc pass 
checklist.
+
+# Motivation
+[motivation]: #motivation
+
+The main goal of this RFC is to summarize the existing buffer convention and 
the IR changes in
+https://github.com/apache/tvm/pull/9727 which have a broader impact. There are 
no new semantics
+proposed in this RFC.
+
+# Reference - level explanation
+[reference-level-explanation]: #reference-level-explanation
+
+**What’s a buffer?**
+
+Buffer is a compile-time object that groups runtime objects(data pointer and 
shape variables)
+structurally. A Buffer needs to be declared and allocated before it can be 
used.
+
+**Declaration of buffer**
+
+Buffer can be declared in the following ways:
+
+- Buffer map of `PrimFunc`. This specifies how opaque parameters of type 
`T.handle` should be
+interpreted as a buffer inside the `PrimFunc`. `T.handle` represents an opaque 
pointer (`void *`).
+- `T.alloc_buffer` is used `S-TIR` to create and allocate a buffer.
+- `T.buffer_decl` can be used to create a buffer alias by specifying the 
underlying data variable to
+reuse the data from another buffer. It can also be used to reinterpret the 
data type of the buffer.
+`T.buffer_decl` can also be used to create a buffer alias with a different 
`elem_offset`.
+`elem_offset` should be handled during the lowering process.
+
+Examples of `T.buffer_decl` is shown below.
+```
[email protected]_func
+def buffer_alias(A: T.Buffer[(16,), "float"]):
+    A_vector = T.buffer_decl([4], "float32x4", data=A.data)
+
[email protected]_func
+def buffer_alloc():
+    A = T.buffer_decl([4, 4], "float32")
+    Allocate(A.data, [16], "float32")
+```
+
+**Allocation of buffer**
+
+In low-level TIR, `Allocate` is used to allocate a data variable with given 
shapes. `Allocate`
+doesn’t operate on the buffer-level. The result of `Allocate` is a data 
variable, which may be
+reinterpreted with a different shape or data type in buffer declaration.
+
+**Explicit `DeclBuffer` IR construct**
+
+`T.buffer_decl` is not an explicit statement in TIR - There is no such node in 
TIR.
+The current behavior of `TVMScriptPrinter` is to implicitly print a 
`T.buffer_decl` at the beginning
+of `PrimFunc` for any undefined buffers. The implicit behavior can be 
error-prone. In light of the
+migration, we should consider an explicit `DeclBuffer` as part of the IR. This 
will be furthur
+discussed in a separate RFC.
+
+**Buffer Aliasing**
+
+`T.buffer_decl` creates a buffer alias if the underlying data variable 
(`.data` field) overlaps with
+another buffer. Buffer created via `alloc_buffer` always do not alias. Buffer 
aliases do not need
+`Allocate` to create the data variable. Each data variable can be allocated 
via `Allocate` once.
+If a transformation would produce multiple allocations of the same buffer var 
(e.g. unrolling a loop
+that contains an allocation), the transform should update the allocations to 
be unique using
+`tvm::tir::ConvertSSA`
+
+Buffers should not alias each other unless necessary, because aliased buffers 
increase complexity
+for TIR transformations.  Passes that rewrite buffers should clearly indicate 
how aliased buffers
+are handled.  For example, when changing the underlying layout of stored 
elements in a buffer, all
+buffer aliases must also be updated.  While buffer aliasing is typically free 
at runtime, this
+imposes a cost for buffer aliasing both to compile times and development 
complexity. 

Review Comment:
   we don't have any alias analysis right now



##########
rfcs/0063-clarifying-buffer-declaration-and-access.md:
##########
@@ -0,0 +1,232 @@
+- Feature Name: Clarifying Buffer Declaration and Access
+- Author: Wuwei Lin (@vinx13), Eric Lunderberg (@Lunderberg)
+- Start Date: 2022-03-18
+- RFC PR: [apache/tvm-rfcs#63](https://github.com/apache/tvm-rfcs/pull/63)
+- GitHub Issue: [apache/tvm#10505](https://github.com/apache/tvm/issues/10505)
+
+# Summary 
+[summary]: #summary
+
+In https://github.com/apache/tvm/pull/9727 and
+[RFC#39](https://github.com/apache/tvm-rfcs/blob/main/rfcs/0039-buffer-physical-layout.md),
 we
+deprecated `Load` and `Store` to use `BufferLoad` and `BufferStore` instead in 
order to support
+generalized multi-dimensional physical buffer access. Here we document 
necessary clarifications,
+implications about the new buffer convention, as well as the post-hoc pass 
checklist.
+
+# Motivation
+[motivation]: #motivation
+
+The main goal of this RFC is to summarize the existing buffer convention and 
the IR changes in
+https://github.com/apache/tvm/pull/9727 which have a broader impact. There are 
no new semantics
+proposed in this RFC.
+
+# Reference - level explanation
+[reference-level-explanation]: #reference-level-explanation
+
+**What’s a buffer?**
+
+Buffer is a compile-time object that groups runtime objects(data pointer and 
shape variables)
+structurally. A Buffer needs to be declared and allocated before it can be 
used.
+
+**Declaration of buffer**
+
+Buffer can be declared in the following ways:
+
+- Buffer map of `PrimFunc`. This specifies how opaque parameters of type 
`T.handle` should be
+interpreted as a buffer inside the `PrimFunc`. `T.handle` represents an opaque 
pointer (`void *`).
+- `T.alloc_buffer` is used `S-TIR` to create and allocate a buffer.
+- `T.buffer_decl` can be used to create a buffer alias by specifying the 
underlying data variable to
+reuse the data from another buffer. It can also be used to reinterpret the 
data type of the buffer.
+`T.buffer_decl` can also be used to create a buffer alias with a different 
`elem_offset`.
+`elem_offset` should be handled during the lowering process.
+
+Examples of `T.buffer_decl` is shown below.
+```
[email protected]_func
+def buffer_alias(A: T.Buffer[(16,), "float"]):
+    A_vector = T.buffer_decl([4], "float32x4", data=A.data)
+
[email protected]_func
+def buffer_alloc():
+    A = T.buffer_decl([4, 4], "float32")
+    Allocate(A.data, [16], "float32")
+```
+
+**Allocation of buffer**
+
+In low-level TIR, `Allocate` is used to allocate a data variable with given 
shapes. `Allocate`
+doesn’t operate on the buffer-level. The result of `Allocate` is a data 
variable, which may be
+reinterpreted with a different shape or data type in buffer declaration.
+
+**Explicit `DeclBuffer` IR construct**
+
+`T.buffer_decl` is not an explicit statement in TIR - There is no such node in 
TIR.
+The current behavior of `TVMScriptPrinter` is to implicitly print a 
`T.buffer_decl` at the beginning
+of `PrimFunc` for any undefined buffers. The implicit behavior can be 
error-prone. In light of the
+migration, we should consider an explicit `DeclBuffer` as part of the IR. This 
will be furthur
+discussed in a separate RFC.
+
+**Buffer Aliasing**
+
+`T.buffer_decl` creates a buffer alias if the underlying data variable 
(`.data` field) overlaps with
+another buffer. Buffer created via `alloc_buffer` always do not alias. Buffer 
aliases do not need
+`Allocate` to create the data variable. Each data variable can be allocated 
via `Allocate` once.
+If a transformation would produce multiple allocations of the same buffer var 
(e.g. unrolling a loop
+that contains an allocation), the transform should update the allocations to 
be unique using
+`tvm::tir::ConvertSSA`
+
+Buffers should not alias each other unless necessary, because aliased buffers 
increase complexity
+for TIR transformations.  Passes that rewrite buffers should clearly indicate 
how aliased buffers
+are handled.  For example, when changing the underlying layout of stored 
elements in a buffer, all
+buffer aliases must also be updated.  While buffer aliasing is typically free 
at runtime, this
+imposes a cost for buffer aliasing both to compile times and development 
complexity. 
+
+**Discussion: When it is safe to transform a buffer**
+
+We would like to discuss some examples of when it is safe to transform a 
buffer w.r.t. aliasing rules:
+
+(1) reshape, (2) layout transform (e.g. swap indices), (3) compact. 
+
+(1) is fine under aliasing as long as the low level memory is shared. (2) and 
(3) would need more
+cares. (2) requires all the aliases be changed together. (3) requires to 
compute the compact buffer
+shape and then rewrite the buffer shape. This need us to take all alias into 
consideration and then
+rewrite their shapes together.
+
+**Generalizing buffer accesses**
+
+Previously we used `Load` and `Store` to represent low-level buffer accesses. 
`Load` and `Store`
+consist of data variable, data type and index, which can be directly 
translated to pointer cast and
+accesses in runtime. Note that data type in `Load` / `Store` can be different 
from the actual data
+variable type. For example,
+
+```python
+A = T.buffer_decl(shape=(16,), dtype='float')
+T.load("float4", A.data, T.ramp(4, 1, 4))
+```
+
+can be translated to
+
+```cpp
+*((float4*)(A + 4))
+```
+
+in C codegen. 
+
+`BufferLoad` and `BufferStore` themselves can not reinterpret a buffer to a 
different shape or
+data type. They rely on the underlying buffer object. This is the fundamental 
difference between
+`Load/Store` and `BufferLoad/BufferStore` that we need to deal with carefully. 
+
+Vectorized access is achieved by using `Ramp` as index in `Load/Store`. 
Vectorized buffer access
+via `BufferLoad`/`BufferStore` can be achieved either by using a scalar index 
to access a buffer
+that has a vectorized type, or by using `Ramp` as an index into a buffer that 
has a scalar type.
+For N-D buffer indices, it is possible that `Ramp` being used in multiple 
dimensions
+(e.g. `A[Ramp(...), ..., Ramp(...)]` ). In this case the number of lanes of 
the data type of such
+value is the product of each `Ramp`. We may want to limit `Ramp` to only the 
last dimension as

Review Comment:
   @Lunderberg has already sent follow up PR to limit `Ramp` to the last 
dimension. I'll update this sentence 



##########
rfcs/0063-clarifying-buffer-declaration-and-access.md:
##########
@@ -0,0 +1,232 @@
+- Feature Name: Clarifying Buffer Declaration and Access
+- Author: Wuwei Lin (@vinx13), Eric Lunderberg (@Lunderberg)
+- Start Date: 2022-03-18
+- RFC PR: [apache/tvm-rfcs#63](https://github.com/apache/tvm-rfcs/pull/63)
+- GitHub Issue: [apache/tvm#10505](https://github.com/apache/tvm/issues/10505)
+
+# Summary 
+[summary]: #summary
+
+In https://github.com/apache/tvm/pull/9727 and
+[RFC#39](https://github.com/apache/tvm-rfcs/blob/main/rfcs/0039-buffer-physical-layout.md),
 we
+deprecated `Load` and `Store` to use `BufferLoad` and `BufferStore` instead in 
order to support
+generalized multi-dimensional physical buffer access. Here we document 
necessary clarifications,
+implications about the new buffer convention, as well as the post-hoc pass 
checklist.
+
+# Motivation
+[motivation]: #motivation
+
+The main goal of this RFC is to summarize the existing buffer convention and 
the IR changes in
+https://github.com/apache/tvm/pull/9727 which have a broader impact. There are 
no new semantics
+proposed in this RFC.
+
+# Reference - level explanation
+[reference-level-explanation]: #reference-level-explanation
+
+**What’s a buffer?**
+
+Buffer is a compile-time object that groups runtime objects(data pointer and 
shape variables)
+structurally. A Buffer needs to be declared and allocated before it can be 
used.
+
+**Declaration of buffer**
+
+Buffer can be declared in the following ways:
+
+- Buffer map of `PrimFunc`. This specifies how opaque parameters of type 
`T.handle` should be
+interpreted as a buffer inside the `PrimFunc`. `T.handle` represents an opaque 
pointer (`void *`).
+- `T.alloc_buffer` is used `S-TIR` to create and allocate a buffer.
+- `T.buffer_decl` can be used to create a buffer alias by specifying the 
underlying data variable to

Review Comment:
   Yes. In TVM script, it is `T.buffer_decl` and we have plan to rename it to 
`T.decl_buffer`



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [tvm-rfcs] vinx13 commented on a diff in pull request #63: RFC: clarifying buffer declaration and access

Reply via email to