[PATCH] D46915: [Fixed Point Arithmetic] Fixed Point Precision Bits and Fixed Point Literals

Bevin Hansson via Phabricator via cfe-commits Mon, 18 Jun 2018 08:36:36 -0700

ebevhan added inline comments.


================
Comment at: lib/Sema/SemaExpr.cpp:1248
+  bool RHSFixed = RHSType->isFixedPointType();
+
+  if (LHSFixed && RHSFixed) {
----------------
leonardchan wrote:
> leonardchan wrote:
> > ebevhan wrote:
> > > ebevhan wrote:
> > > > leonardchan wrote:
> > > > > ebevhan wrote:
> > > > > > leonardchan wrote:
> > > > > > > ebevhan wrote:
> > > > > > > > I don't see how these semantics work properly. The 
> > > > > > > > specification requires that operations be done in the full 
> > > > > > > > precision of both types. You cannot convert the types before 
> > > > > > > > performing the operation like this, since the operation will 
> > > > > > > > not be done in full precision in that case.
> > > > > > > > 
> > > > > > > > The operator semantics of Embedded-C require the operand types 
> > > > > > > > of binary operators to be different. It's only when you've 
> > > > > > > > performed the operation that you are allowed to convert the 
> > > > > > > > result to the resulting type.
> > > > > > > Initially the idea was to convert both sides to fixed point 
> > > > > > > types, then perform standard binary operations between the fixed 
> > > > > > > point types.
> > > > > > > 
> > > > > > > For the example, a `fract * int` would have the int converted to 
> > > > > > > a fixed point type by left shifting it by the scale of the fract, 
> > > > > > > multiplying, then right shifting by the scale again to get the 
> > > > > > > resulting fract. The only unhandled thing is overflow, but the 
> > > > > > > precision of the fract remains the same. The operands would also 
> > > > > > > be casted up beforehand so there was enough space to store the 
> > > > > > > result, which was casted down back to the original fract after 
> > > > > > > performing the right shift by the scale.
> > > > > > > 
> > > > > > > Operations between fixed point types would follow a similar 
> > > > > > > process of casting both operands to the higher rank fixed point 
> > > > > > > type, and depending on the operation, more underlying shifting 
> > > > > > > and casting would be done to retain full precision of the higher 
> > > > > > > ranked type.
> > > > > > > 
> > > > > > > Though I will admit that I did not realize until now that 
> > > > > > > multiplying a fixed point type by an integer does not require 
> > > > > > > shifting the integer.
> > > > > > I see how you've reasoned; this is how C normally works. The 
> > > > > > `fract` is of higher rank than `int` and therefore is the 'common 
> > > > > > type' of the operation. However, even though it is higher rank 
> > > > > > there is no guarantee that you can perform the operation without 
> > > > > > overflowing. And overflow matters here; the spec says that it must 
> > > > > > be done in the full precision (integral + fractional) of both types.
> > > > > > 
> > > > > > > The only unhandled thing is overflow, but the precision of the 
> > > > > > > fract remains the same. The operands would also be casted up 
> > > > > > > beforehand so there was enough space to store the result, which 
> > > > > > > was casted down back to the original fract after performing the 
> > > > > > > right shift by the scale.
> > > > > > 
> > > > > > The precision remains the same (and while it doesn't have to be the 
> > > > > > same to perform an operation, it makes the implementation more 
> > > > > > regular; things like addition and subtraction 'just work'), but you 
> > > > > > cannot perform a conversion to `fract` *before* the operation 
> > > > > > itself, since if you do, there's nothing to 'cast up'. Casting up 
> > > > > > is needed for things like `fract * fract` to prevent overflow, but 
> > > > > > for `fract * int` you need to cast to a type that can fit both all 
> > > > > > values of the int and all values of the fract, and *then* you can 
> > > > > > cast up before doing the multiplication.
> > > > > > 
> > > > > > > Operations between fixed point types would follow a similar 
> > > > > > > process of casting both operands to the higher rank fixed point 
> > > > > > > type, and depending on the operation, more underlying shifting 
> > > > > > > and casting would be done to retain full precision of the higher 
> > > > > > > ranked type.
> > > > > > 
> > > > > > This might work, but I feel there could be edge cases. The E-C 
> > > > > > fixed-point ranks are very odd as they don't reflect reality; 
> > > > > > `short _Accum` cannot be considered strictly 'above' `long _Fract`, 
> > > > > > but the former has a higher rank than the latter. Depending on how 
> > > > > > the types are specified for a target, implicit casts between 
> > > > > > fixed-point types might inadvertantly discard bits, even though the 
> > > > > > spec says that operations must be done in full precision.
> > > > > I see, so just to confirm, something like a `fract * int` would not 
> > > > > result in any implicit casting between either operand, but any 
> > > > > special arithmetic, like intermediate storage types or saturation 
> > > > > handling, would be handled by the underlying IR?
> > > > > 
> > > > > So should really no conversions/implicit type casting should be 
> > > > > performed here and instead all handling of arithmetic operations 
> > > > > should happen somewhere during the codegen stage?
> > > > > 
> > > > > I see, so just to confirm, something like a fract * int would not 
> > > > > result in any implicit casting between either operand, but any 
> > > > > special arithmetic, like intermediate storage types or saturation 
> > > > > handling, would be handled by the underlying IR?
> > > > 
> > > > Yes, for operations which require precision that cannot be provided by 
> > > > any of the existing types, there must be an 'invisible' implicit 
> > > > conversion to a type which can represent all of the values of either 
> > > > operand. This conversion cannot be represented in the AST as it is 
> > > > today.
> > > > 
> > > > The simplest solution is indeed to not have any implicit cast at all in 
> > > > the AST and resolve these conversions when needed (CodeGen and 
> > > > consteval are the locations I can think of), but ultimately it feels a 
> > > > bit dirty... I think that the best solution AST-wise is to define a 
> > > > completely new type class (perhaps FullPrecisionFixedPointType) that 
> > > > represents a fixed-point type with arbitrary width, scale, signedness 
> > > > and saturation. Then you can define ImplicitCasts to an instance of 
> > > > this type that can fit both the `int` and the `fract`. I don't know if 
> > > > adding this is acceptable upstream, though.
> > > > 
> > > > I think all of these rules must apply to fixed-fixed operations as 
> > > > well; a `short accum * long fract` must be done as a type that does not 
> > > > exist, similar to fixed-int. It's not clear how saturation should work 
> > > > here either...
> > > > 
> > > > I also noticed now that the spec says in regards to comparison 
> > > > operators, `When comparing fixed-point values with fixed-point values 
> > > > or integer values, the values are compared directly; the values of the 
> > > > operands are not converted before the comparison is made.` I'm not sure 
> > > > what this means.
> > > In any case, to clarify, I think there are two paths to consider. Either:
> > > 
> > >   - Add a new type class to the type system that encapsulates an 
> > > arbitrary-precision fixed-point type that can be used for implicit casts 
> > > when operating on fixed-point and integer types. This is in my opinion 
> > > the cleaner solution, since it retains invariants on the types of 
> > > operators and simplifies any logic that deals with operators; or,
> > >   - Leave the operands of these operations uncasted. This is in some way 
> > > simpler, since it doesn't require adding a whole new type, but it 
> > > complicates other parts of the code. Anything that wants to deal with 
> > > fixed-point operators will need to know how to do fixed-point conversion 
> > > as well, which isn't a very good separation of responsibility IMO. It 
> > > also breaks the C invariant of operands of arithmetic types being in a 
> > > common type, which might be surprising to people.
> > > 
> > > 
> > I'm actually more of a fan for the second case. Aside, aside from the 
> > literal parsing in NumericLieralParser, wouldn't the only other place that 
> > would actually need to know about fixed point conversion be 
> > `ScalarExprEmitter` under CodeGen/CGExprScalar.cpp?
> > 
> > It seems that it's this class that creates the binary operations and other 
> > code gen classes like CodeGenFunction just make underlying calls to 
> > ScalarExprEmitter, so the actual conversion logic may just be contained 
> > here. Most of the implicit casting handled under UsualArithmeticConversions 
> > seems to be handled by `VisitCastExpr` under ScalarExprEmitter also, so 
> > adding another casting type would in the end just result in another case in 
> > the switch statement there, which in turn may just result in another call 
> > to ScalarExprEmitter.
> > 
> > I can see how it might be weird at first that these types don't fall under 
> > usual arithmetic, but the standard does specify that it wouldn't.
> Regarding comparison operators, my guess is that it means during comparison 
> operations specifically, the actual underlying values of each operand are 
> compared instead of having the special type conversions take place. That is, 
> `1.0k != 1` but `1.0k == 128` (assuming scale of 7). If this is the case, we 
> could actually save a few operations not having to do a shift on the integer.
> 
> I also can't seem to find a test case used by GCC  where they explicitly 
> compare a fixed point type against an integer. Normally, they instead assign 
> the FP literal to an integral type, then compare that against another integer.
> 
> I'm referring to `CONV_ACCUM_INT` in
> https://github.com/gcc-mirror/gcc/blob/e11be3ea01eaf8acd8cd86d3f9c427621b64e6b4/gcc/testsuite/gcc.dg/fixed-point/convert.h
> I'm actually more of a fan for the second case. Aside, aside from the literal 
> parsing in NumericLieralParser, wouldn't the only other place that would 
> actually need to know about fixed point conversion be ScalarExprEmitter under 
> CodeGen/CGExprScalar.cpp?

ExprConstant (consteval) would also have to know, since the input expressions 
would be these 'unbalanced' binary operations. I'm not sure why it would affect 
literal parsing, though?

Regarding VisitCastExpr; in the first case, I'm not talking about adding a new 
CastKind, I'm talking about adding a whole new type altogether. This type would 
be just as much a fixed-point type as the builtin ones, just with a 
configurable width and scale. Then, something like this:
```
  int * fract
```
where int is 32 bits and fract is 16 bits Q15, would become
```
 (fract)((FullPrecFixedPoint<32+16, 0+15>)int * (FullPrecFixedPoint<32+16, 
0+15>)fract)
```
The cast on the `int` is a `CK_IntegerToFixedPointCast`, and the cast on the 
`fract` is a `CK_FixedPointCast`. All values and operations are self-consistent 
and fully representable in the AST. Converting to and from a FullPrecFixedPoint 
type is no different from converting to and from, say, `fract`. They are both 
fixed-point types with width and scale; one is just built-in and the other is 
'artificial'. The multiplication is performed like any other fixed-point 
operation, just in a higher width and (possibly higher) scale than either of 
the operands.

The issue I have with the second case is that the AST is somehow left 
'unfinished'. There *are* casts there, but they are just not representable in 
the AST. In order to represent them, you would need to add these 
arbitrary-precision types.


================
Comment at: lib/Sema/SemaExpr.cpp:1248
+  bool RHSFixed = RHSType->isFixedPointType();
+
+  if (LHSFixed && RHSFixed) {
----------------
ebevhan wrote:
> leonardchan wrote:
> > leonardchan wrote:
> > > ebevhan wrote:
> > > > ebevhan wrote:
> > > > > leonardchan wrote:
> > > > > > ebevhan wrote:
> > > > > > > leonardchan wrote:
> > > > > > > > ebevhan wrote:
> > > > > > > > > I don't see how these semantics work properly. The 
> > > > > > > > > specification requires that operations be done in the full 
> > > > > > > > > precision of both types. You cannot convert the types before 
> > > > > > > > > performing the operation like this, since the operation will 
> > > > > > > > > not be done in full precision in that case.
> > > > > > > > > 
> > > > > > > > > The operator semantics of Embedded-C require the operand 
> > > > > > > > > types of binary operators to be different. It's only when 
> > > > > > > > > you've performed the operation that you are allowed to 
> > > > > > > > > convert the result to the resulting type.
> > > > > > > > Initially the idea was to convert both sides to fixed point 
> > > > > > > > types, then perform standard binary operations between the 
> > > > > > > > fixed point types.
> > > > > > > > 
> > > > > > > > For the example, a `fract * int` would have the int converted 
> > > > > > > > to a fixed point type by left shifting it by the scale of the 
> > > > > > > > fract, multiplying, then right shifting by the scale again to 
> > > > > > > > get the resulting fract. The only unhandled thing is overflow, 
> > > > > > > > but the precision of the fract remains the same. The operands 
> > > > > > > > would also be casted up beforehand so there was enough space to 
> > > > > > > > store the result, which was casted down back to the original 
> > > > > > > > fract after performing the right shift by the scale.
> > > > > > > > 
> > > > > > > > Operations between fixed point types would follow a similar 
> > > > > > > > process of casting both operands to the higher rank fixed point 
> > > > > > > > type, and depending on the operation, more underlying shifting 
> > > > > > > > and casting would be done to retain full precision of the 
> > > > > > > > higher ranked type.
> > > > > > > > 
> > > > > > > > Though I will admit that I did not realize until now that 
> > > > > > > > multiplying a fixed point type by an integer does not require 
> > > > > > > > shifting the integer.
> > > > > > > I see how you've reasoned; this is how C normally works. The 
> > > > > > > `fract` is of higher rank than `int` and therefore is the 'common 
> > > > > > > type' of the operation. However, even though it is higher rank 
> > > > > > > there is no guarantee that you can perform the operation without 
> > > > > > > overflowing. And overflow matters here; the spec says that it 
> > > > > > > must be done in the full precision (integral + fractional) of 
> > > > > > > both types.
> > > > > > > 
> > > > > > > > The only unhandled thing is overflow, but the precision of the 
> > > > > > > > fract remains the same. The operands would also be casted up 
> > > > > > > > beforehand so there was enough space to store the result, which 
> > > > > > > > was casted down back to the original fract after performing the 
> > > > > > > > right shift by the scale.
> > > > > > > 
> > > > > > > The precision remains the same (and while it doesn't have to be 
> > > > > > > the same to perform an operation, it makes the implementation 
> > > > > > > more regular; things like addition and subtraction 'just work'), 
> > > > > > > but you cannot perform a conversion to `fract` *before* the 
> > > > > > > operation itself, since if you do, there's nothing to 'cast up'. 
> > > > > > > Casting up is needed for things like `fract * fract` to prevent 
> > > > > > > overflow, but for `fract * int` you need to cast to a type that 
> > > > > > > can fit both all values of the int and all values of the fract, 
> > > > > > > and *then* you can cast up before doing the multiplication.
> > > > > > > 
> > > > > > > > Operations between fixed point types would follow a similar 
> > > > > > > > process of casting both operands to the higher rank fixed point 
> > > > > > > > type, and depending on the operation, more underlying shifting 
> > > > > > > > and casting would be done to retain full precision of the 
> > > > > > > > higher ranked type.
> > > > > > > 
> > > > > > > This might work, but I feel there could be edge cases. The E-C 
> > > > > > > fixed-point ranks are very odd as they don't reflect reality; 
> > > > > > > `short _Accum` cannot be considered strictly 'above' `long 
> > > > > > > _Fract`, but the former has a higher rank than the latter. 
> > > > > > > Depending on how the types are specified for a target, implicit 
> > > > > > > casts between fixed-point types might inadvertantly discard bits, 
> > > > > > > even though the spec says that operations must be done in full 
> > > > > > > precision.
> > > > > > I see, so just to confirm, something like a `fract * int` would not 
> > > > > > result in any implicit casting between either operand, but any 
> > > > > > special arithmetic, like intermediate storage types or saturation 
> > > > > > handling, would be handled by the underlying IR?
> > > > > > 
> > > > > > So should really no conversions/implicit type casting should be 
> > > > > > performed here and instead all handling of arithmetic operations 
> > > > > > should happen somewhere during the codegen stage?
> > > > > > 
> > > > > > I see, so just to confirm, something like a fract * int would not 
> > > > > > result in any implicit casting between either operand, but any 
> > > > > > special arithmetic, like intermediate storage types or saturation 
> > > > > > handling, would be handled by the underlying IR?
> > > > > 
> > > > > Yes, for operations which require precision that cannot be provided 
> > > > > by any of the existing types, there must be an 'invisible' implicit 
> > > > > conversion to a type which can represent all of the values of either 
> > > > > operand. This conversion cannot be represented in the AST as it is 
> > > > > today.
> > > > > 
> > > > > The simplest solution is indeed to not have any implicit cast at all 
> > > > > in the AST and resolve these conversions when needed (CodeGen and 
> > > > > consteval are the locations I can think of), but ultimately it feels 
> > > > > a bit dirty... I think that the best solution AST-wise is to define a 
> > > > > completely new type class (perhaps FullPrecisionFixedPointType) that 
> > > > > represents a fixed-point type with arbitrary width, scale, signedness 
> > > > > and saturation. Then you can define ImplicitCasts to an instance of 
> > > > > this type that can fit both the `int` and the `fract`. I don't know 
> > > > > if adding this is acceptable upstream, though.
> > > > > 
> > > > > I think all of these rules must apply to fixed-fixed operations as 
> > > > > well; a `short accum * long fract` must be done as a type that does 
> > > > > not exist, similar to fixed-int. It's not clear how saturation should 
> > > > > work here either...
> > > > > 
> > > > > I also noticed now that the spec says in regards to comparison 
> > > > > operators, `When comparing fixed-point values with fixed-point values 
> > > > > or integer values, the values are compared directly; the values of 
> > > > > the operands are not converted before the comparison is made.` I'm 
> > > > > not sure what this means.
> > > > In any case, to clarify, I think there are two paths to consider. 
> > > > Either:
> > > > 
> > > >   - Add a new type class to the type system that encapsulates an 
> > > > arbitrary-precision fixed-point type that can be used for implicit 
> > > > casts when operating on fixed-point and integer types. This is in my 
> > > > opinion the cleaner solution, since it retains invariants on the types 
> > > > of operators and simplifies any logic that deals with operators; or,
> > > >   - Leave the operands of these operations uncasted. This is in some 
> > > > way simpler, since it doesn't require adding a whole new type, but it 
> > > > complicates other parts of the code. Anything that wants to deal with 
> > > > fixed-point operators will need to know how to do fixed-point 
> > > > conversion as well, which isn't a very good separation of 
> > > > responsibility IMO. It also breaks the C invariant of operands of 
> > > > arithmetic types being in a common type, which might be surprising to 
> > > > people.
> > > > 
> > > > 
> > > I'm actually more of a fan for the second case. Aside, aside from the 
> > > literal parsing in NumericLieralParser, wouldn't the only other place 
> > > that would actually need to know about fixed point conversion be 
> > > `ScalarExprEmitter` under CodeGen/CGExprScalar.cpp?
> > > 
> > > It seems that it's this class that creates the binary operations and 
> > > other code gen classes like CodeGenFunction just make underlying calls to 
> > > ScalarExprEmitter, so the actual conversion logic may just be contained 
> > > here. Most of the implicit casting handled under 
> > > UsualArithmeticConversions seems to be handled by `VisitCastExpr` under 
> > > ScalarExprEmitter also, so adding another casting type would in the end 
> > > just result in another case in the switch statement there, which in turn 
> > > may just result in another call to ScalarExprEmitter.
> > > 
> > > I can see how it might be weird at first that these types don't fall 
> > > under usual arithmetic, but the standard does specify that it wouldn't.
> > Regarding comparison operators, my guess is that it means during comparison 
> > operations specifically, the actual underlying values of each operand are 
> > compared instead of having the special type conversions take place. That 
> > is, `1.0k != 1` but `1.0k == 128` (assuming scale of 7). If this is the 
> > case, we could actually save a few operations not having to do a shift on 
> > the integer.
> > 
> > I also can't seem to find a test case used by GCC  where they explicitly 
> > compare a fixed point type against an integer. Normally, they instead 
> > assign the FP literal to an integral type, then compare that against 
> > another integer.
> > 
> > I'm referring to `CONV_ACCUM_INT` in
> > https://github.com/gcc-mirror/gcc/blob/e11be3ea01eaf8acd8cd86d3f9c427621b64e6b4/gcc/testsuite/gcc.dg/fixed-point/convert.h
> > I'm actually more of a fan for the second case. Aside, aside from the 
> > literal parsing in NumericLieralParser, wouldn't the only other place that 
> > would actually need to know about fixed point conversion be 
> > ScalarExprEmitter under CodeGen/CGExprScalar.cpp?
> 
> ExprConstant (consteval) would also have to know, since the input expressions 
> would be these 'unbalanced' binary operations. I'm not sure why it would 
> affect literal parsing, though?
> 
> Regarding VisitCastExpr; in the first case, I'm not talking about adding a 
> new CastKind, I'm talking about adding a whole new type altogether. This type 
> would be just as much a fixed-point type as the builtin ones, just with a 
> configurable width and scale. Then, something like this:
> ```
>   int * fract
> ```
> where int is 32 bits and fract is 16 bits Q15, would become
> ```
>  (fract)((FullPrecFixedPoint<32+16, 0+15>)int * (FullPrecFixedPoint<32+16, 
> 0+15>)fract)
> ```
> The cast on the `int` is a `CK_IntegerToFixedPointCast`, and the cast on the 
> `fract` is a `CK_FixedPointCast`. All values and operations are 
> self-consistent and fully representable in the AST. Converting to and from a 
> FullPrecFixedPoint type is no different from converting to and from, say, 
> `fract`. They are both fixed-point types with width and scale; one is just 
> built-in and the other is 'artificial'. The multiplication is performed like 
> any other fixed-point operation, just in a higher width and (possibly higher) 
> scale than either of the operands.
> 
> The issue I have with the second case is that the AST is somehow left 
> 'unfinished'. There *are* casts there, but they are just not representable in 
> the AST. In order to represent them, you would need to add these 
> arbitrary-precision types.
> Regarding comparison operators, my guess is that it means during comparison 
> operations specifically, the actual underlying values of each operand are 
> compared instead of having the special type conversions take place. That is, 
> 1.0k != 1 but 1.0k == 128 (assuming scale of 7). If this is the case, we 
> could actually save a few operations not having to do a shift on the integer.

Right... That seems incredibly dangerous to me; I really hope this isn't what 
the spec means. 1.0 is by no means the same thing as 128. On top of that, it 
means that comparisons between fixed-point and integer can vary depending on 
the scale of the fixed-point type; this feels really shaky to me. Heck, if 
SameFBits is false, for a scale of 7, `0.5r == 64`, but `0.5ur != 64`. It might 
even be the case that `0.5r != 0.5ur`. Absolutely bizarre, and incredibly 
confusing for programmers!

It might have been done this way to make it easy to inspect the raw bits of a 
fixed-point number, but why not just do a bit-preserving conversion and compare 
as an integer in that case?

DSP-C simply prohibits 'ambiguous' type conversions such as these to prevent 
this confusion from happening.


Repository:
  rC Clang

https://reviews.llvm.org/D46915



_______________________________________________
cfe-commits mailing list
[email protected]
http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

[PATCH] D46915: [Fixed Point Arithmetic] Fixed Point Precision Bits and Fixed Point Literals

Reply via email to